Building Flash Attention from Source

Notes from compiling Flash Attention on an A800 box. If you’re hitting endless build times or OOM “killed” errors, the key env vars and pitfalls here may save you time.

2025-07-26    610 words    3 min

VS Code for Remote Python Development

A quick guide to using VS Code with SSH for remote development, plus a plug for notebooks. Once you know the flow, spinning up a remote Python/Notebook workflow is incredibly convenient.

2025-01-14    2025-01-19    1267 words    6 min

Back Up and Migrate Conda Environments

I recently needed to move a Conda environment. Deep learning stacks are usually tightly coupled to driver versions and package versions, so being able to package a working environment and drop it onto another box saves a lot of time.

2024-06-29    516 words    3 min

Install CUDA and an NLP Stack with Conda (No Root)

How a non-root user can install a newer version of the transformers suite without being able to change the version of the installed cuda driver.

2024-03-10    684 words    4 min

Install the R Kernel for Jupyter with Conda

Due to the author's frequent use of conda to configure the R kernel jupyter notebook on Linux remote servers, some steps are forgotten over time (the main problem is that the same pit will be stepped on differently every time). After summarizing the relevant tutorials and personal experience on the internet, a guide is provided for everyone and future me to refer to.

2022-10-10    1485 words    7 min

BiliBili Automation on Qinglong Panel

Following the basic Qinglong setup and JD coins automation, here’s a walkthrough for the BiliTools project. I added notes based on the original docs—use as a reference.

2022-09-07    2023-10-01    1274 words    3 min