sorryhyun

Knowledge gains its value when shared.

CUDAClaude CodeDevelopmentDiffusionLLM ResearchLoRAMachine LearningOpinionPaperPersonaPromptRust
2026-05-29

Training a Turbo LoRA on a Single Consumer GPU

Few-step distillation usually means holding a teacher and a student model in memory at once. We distilled a 28-step diffusion teacher into a 4-step LoRA on one 16 GB card by running three roles off a single frozen backbone.

LoRADiffusionDevelopment
2026-04-25

Training HydraLoRA for a Diffusion Model

How I got an MoE-style LoRA to actually specialize on a T2I model — through a cold-start deadlock, a failed jitter attempt, orthogonalized experts via SVD slicing, expert warmup, and σ-conditional routing borrowed from T-LoRA.

LoRADevelopmentDiffusion
2026-04-18

ChitChats: A Persona Prompt Recipe for Opus 4.7

Sharing a persona prompt, memory layout, and the 'excuse tool' — patterns I kept refining from Opus 4.1 through 4.7.

LLM ResearchPersonaPrompt
2026-04-03

Making torch.compile Actually Work for DiT LoRA Training

DiT training has three sources of shape dynamism that cause torch.compile to recompile every step. We eliminated all three and got stable compiled training on a consumer GPU.

DevelopmentCUDA
2026-03-30

Getting Flash Attention 4 to Work on SM120 (RTX 5060 Ti)

Flash Attention 4 doesn't support consumer Blackwell GPUs yet. We fixed three critical bugs and got it running on the RTX 5060 Ti.

CUDADevelopment
2026-03-14

What C++ proves: the failure of 'understand every line of code'

A C++ critique video as a lens into vibe coding and the myth of total code comprehension.

DevelopmentOpinion
2026-03-12

LoRA Training Precision Improvements

Two precision-oriented features for the LoRA training pipeline: lora_fp32_accumulation and attn_softmax_scale.

DevelopmentLoRAMachine Learning
2026-01-24

Why Epiplexity matters

Personal opinion on the paper 'Epiplexity'

PaperOpinion
2026-01-22

Building Windows Applications with Claude Code CLI (Rust)

How to build native Windows desktop applications that integrate with the Claude Code CLI using a pure Rust backend.

Claude CodeRustDevelopment
2026-01-19

Should AI slop not be respected?

Personal opinion on AI slops

Opinion
2026-01-09

Why Long Context Research Is Difficult

An exploration of the challenges in long context language model research.

LLM Research
2026-01-03

Recent Papers That Look Good To Me

A personal pick of recently published papers that show strong potential.

LLM ResearchPaperOpinion
2025-12-30

Why Persona Will Be the Next Agentic LLM

A personal take on the role of persona in the agentic LLM paradigm.

PersonaLLM ResearchOpinion
2025-12-04

Building Windows Applications with Claude Code CLI (Rust)

How to build native Windows desktop applications that integrate with the Claude Code CLI using a pure Rust backend.

Opinion