Page 10 / 10

116 posts in total. Keep on posting.

Showing posts 109–116 of 116. Each entry opens locally on this site; legacy Hexo posts link back to their original article at the bottom for reference.

2020

  • EN

    Slurm-Day5

    Slurm cluster management notes — best practices for large-scale training jobs and multi-node distributed setups.

  • EN

    Slurm-Day4

    Slurm cluster management notes — monitoring, accounting, and troubleshooting common cluster issues.

  • EN

    Slurm-Day3

    Slurm cluster management notes — advanced job management with dependencies, priorities, and QOS configurations.

  • EN

    Slurm-Day2

    Slurm cluster management notes — resource allocation, GPU scheduling, and job arrays for parallel workloads.

  • EN

    Slurm-Day1

    Slurm cluster management notes — introduction to job scheduling, partitions, and basic sbatch/srun commands.

  • EN

    Reinforcement Learning-Principle-Day1

    Reinforcement learning study notes — introducing MDPs, value functions, Bellman equations, and the fundamental framework of RL.

2019

  • EN

    Tensorflow-Day1-DNN Explain

    Deep learning fundamentals with TensorFlow — covering DNN architecture, forward/backward propagation, activation functions, and gradient descent.

  • EN

    Reinforcement Learning\_WatermelonBook\_Summary

    Summary of reinforcement learning concepts from Zhou Zhihua's Machine Learning textbook (Watermelon Book), covering core RL theory and algorithms.