Page 10 / 10
116 posts in total. Keep on posting.
Showing posts 109–116 of 116. Each entry opens locally on this site; legacy Hexo posts link back to their original article at the bottom for reference.
2020
- EN
Slurm-Day5
Slurm cluster management notes — best practices for large-scale training jobs and multi-node distributed setups.
- EN
Slurm-Day4
Slurm cluster management notes — monitoring, accounting, and troubleshooting common cluster issues.
- EN
Slurm-Day3
Slurm cluster management notes — advanced job management with dependencies, priorities, and QOS configurations.
- EN
Slurm-Day2
Slurm cluster management notes — resource allocation, GPU scheduling, and job arrays for parallel workloads.
- EN
Slurm-Day1
Slurm cluster management notes — introduction to job scheduling, partitions, and basic sbatch/srun commands.
- EN
Reinforcement Learning-Principle-Day1
Reinforcement learning study notes — introducing MDPs, value functions, Bellman equations, and the fundamental framework of RL.
2019
- EN
Tensorflow-Day1-DNN Explain
Deep learning fundamentals with TensorFlow — covering DNN architecture, forward/backward propagation, activation functions, and gradient descent.
- EN
Reinforcement Learning\_WatermelonBook\_Summary
Summary of reinforcement learning concepts from Zhou Zhihua's Machine Learning textbook (Watermelon Book), covering core RL theory and algorithms.