Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
AI Containers
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
dataanalytics-public
AI Containers
Commits
70765b34
Commit
70765b34
authored
1 month ago
by
Nastassya Horlava
Browse files
Options
Downloads
Patches
Plain Diff
upd
parent
36d79165
Branches
Branches containing commit
No related tags found
1 merge request
!4
Docs tensorflow
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
tensorflow/viper/run_undistributed.slurm
+6
-5
6 additions, 5 deletions
tensorflow/viper/run_undistributed.slurm
tensorflow/viper/tf-2.16-recipe.def
+1
-0
1 addition, 0 deletions
tensorflow/viper/tf-2.16-recipe.def
with
7 additions
and
5 deletions
tensorflow/viper/run_undistributed.slurm
+
6
−
5
View file @
70765b34
...
@@ -15,12 +15,13 @@ module purge
...
@@ -15,12 +15,13 @@ module purge
module load apptainer/1.4.1
module load apptainer/1.4.1
# Get the command line from the job info
# Get the command line from the job info
command_line
=
$(
scontrol show job
"
${
SLURM_JOBID
}
"
|
grep
-oP
'Command=\K.*'
)
export
SBATCH_SCRIPT_PATH
=
$(
scontrol show job
"
$SLURM_JOBID
"
|
grep
-oP
'Command=\K.*'
|
sed
-E
"s|/
$(
echo
"
$SLURM_CELL
"
|
tr
'[:upper:]'
'[:lower:]'
)
||"
|
sed
-E
's|^/u[12]/|/u/|'
)
lower_cell
=
$(
echo
"
$SLURM_CELL
"
|
tr
'[:upper:]'
'[:lower:]'
)
# Convert SLURM_CELL to lowercase
export
SBATCH_SCRIPT_PATH
=
"
${
command_line
//\/
$lower_cell
/
}
"
# Remove /$lower_cell from the path
sif_file
=
"
$(
dirname
"
$SBATCH_SCRIPT_PATH
"
)
/raven/tf-2.16.sif"
code_dir
=
"
$(
dirname
"
$SBATCH_SCRIPT_PATH
"
)
/src"
sif_file
=
"
$(
dirname
"
$SBATCH_SCRIPT_PATH
"
)
/tf-2.16.sif"
code_dir
=
"
$(
dirname
$(
dirname
"
$SBATCH_SCRIPT_PATH
"
))
/src"
echo
sif_file:
$sif_file
echo
code_dir:
$code_dir
export
TF_FORCE_GPU_ALLOW_GROWTH
=
true
export
TF_FORCE_GPU_ALLOW_GROWTH
=
true
export
batch_size_per_device
=
"--batch_size_per_device 256"
export
batch_size_per_device
=
"--batch_size_per_device 256"
...
...
This diff is collapsed.
Click to expand it.
tensorflow/viper/tf-2.16-recipe.def
+
1
−
0
View file @
70765b34
...
@@ -4,6 +4,7 @@ From: rocm/tensorflow:rocm6.3.3-py3.12-tf2.16-dev
...
@@ -4,6 +4,7 @@ From: rocm/tensorflow:rocm6.3.3-py3.12-tf2.16-dev
%post
%post
python -m pip install --upgrade pip
python -m pip install --upgrade pip
pip install ipython ipykernel
pip install ipython ipykernel
pip install click
%environment
%environment
export ROCBLAS_TENSILE_LIBPATH=/opt/rocm-6.3.3/lib/rocblas/library
export ROCBLAS_TENSILE_LIBPATH=/opt/rocm-6.3.3/lib/rocblas/library
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment