Author

Saideep Gona

Published

September 5, 2023

Code
suppressMessages(library(tidyverse))
suppressMessages(library(glue))
PRE = "/Users/saideepgona/Library/CloudStorage/Box-Box/imlab-data/data-Github/Daily-Blog-Sai"

## COPY THE DATE AND SLUG fields FROM THE HEADER
SLUG="ref_to_aracena_mlp" ## copy the slug from the header
bDATE='2023-09-05' ## copy the date from the blog's header here
DATA = glue("{PRE}/{bDATE}-{SLUG}")
#if(!file.exists(DATA)) system(glue::glue("mkdir {DATA}"))
WORK=DATA

For interactive GPU session on beagle:

sinteractive –account=pi-haky –partition=beagle3 –gres=gpu:1 –mem=60GB –time=2:00:00 conda activate /beagle3/haky/users/temi/software/conda_envs/dl-tools export LD_LIBRARY_PATH=\(LD_LIBRARY_PATH:/beagle3/haky/users/temi/software/conda_envs/dl-tools/lib my_ip_address=\)( /sbin/ip route get 8.8.8.8 | awk ’{print \(7;exit}' ) jupyter-notebook --no-browser --ip=\)my_ip_address –port=15005

Bugs

The Kernel crashed while executing code in the the current cell or a previous cell. Please review the code in the cell(

This is caused by not having enough RAM to initially load the model.

CUDA error: uncorrectable ECC error encountered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

This seems to be a hardware issue. It resolved itself the next day.