The File System¶
Objectives
- Learn about the file system on Kebnekaise
- Find the project storage for this course and create your own subdirectory
Overview¶
Project storage | $HOME | /scratch | |
---|---|---|---|
Recommended for batch jobs |
Yes | No (size) | Yes |
Backed up | No | Yes | No |
Accessible by batch system |
Yes | Yes | Yes (node only) |
Performance | High | High | Medium |
Default readability | Group only | Owner | Owner |
Permissions management |
chmod, chgrp, ACL | chmod, chgrp, ACL | N/A for batch jobs |
Notes | Storage your group get allocated through the storage projects |
Your home-directory | Per node |
$HOME¶
This is your home-directory (pointed to by the $HOME
variable). It has a quota limit of 25GB per default. Your home directory is backed up regularly.
Note
Since the home directory is quite small, it should not be used for most production jobs. These should instead be run from project storage directories.
To find the path to your home directory, either run pwd
just after logging in, or do the following:
Project storage¶
Project storage is where a project’s members have the majority of their storage. It is applied for through SUPR, as a storage project. While storage projects needs to be applied for separately, they are usually linked to a compute project.
This is where you should keep your data and run your batch jobs from. It offers high performance when accessed from the nodes making it suitable for storage that are to be accessed from parallel jobs, and your home directory (usually) has too little space.
Project storage is located below /proj/nobackup/
in the directory name selected during the creation of the proposal.
Note
The project storage is not intended for permanent storage and there is NO BACKUP of /proj/nobackup
.
Using project storage¶
- If you have a storage project, you should use that to run your jobs.
- You (your PI) will either choose a directory name when you/they apply for the storage project or get the project id as default name.
- The location of the storage project in the file system is
/proj/nobackup/NAME-YOU-PICKED
- Since the storage project is shared between all users of the project, you should go to that directory and create a subdirectory for your things, which you will then be using.- For this course the storage is in
Exercise
Go to the course project storage and create a subdirectory for yourself.
Now is a good time to prepare the course material and download the exercises, if you have not already done so. The easiest way to do so is by cloning the whole intro-course repository from GitHub.
Exercise
- Go to the subdirectory you created under
/proj/nobackup/intro-hpc2n
- Clone the repository for the course:
git clone https://github.com/hpc2n/intro-course.git
You will get a directory called intro-course
. Below it you will find a directory called “exercises” where the majority of the exercises for the batch system section is located.
Quota¶
The size of the storage depends on the allocation. There are small, medium, and large storage projects, each with their own requirements. You can read about this on SUPR. The quota limits are specific for the project as such, there are no user level quotas on that space.
/scratch¶
Our recommendation is that you use the project storage instead of /scratch
when working on Compute nodes or Login nodes.
On the computers at HPC2N there is a directory called /scratch
. It is a small local area split between the users using the node and it can be used for saving (temporary) files you create or need during your computations. Please do not save files in /scratch
you don’t need when not running jobs on the machine, and please make sure your job removes any temporary files it creates.
Note
When anybody need more space than available on /scratch
, we will remove the oldest/largest files without any notices.
More information about the file system, as well as archiving and compressing files, at the HPC2N documentation about File Systems.
Keypoints
- When you login to Kebnekaise, you will end up in your home-directory.
- Your home-directory is in
/home/u/username
and is pointed to by the environment variable$HOME
. - Your project storage is located in
/proj/nobackup/NAME-YOU-PICKED
- For this course it is
/proj/nobackup/intro-hpc2n
. - The project storage is NOT backed up.
- For this course it is
- You should run the batch jobs from your project storage.