#pytorch

Hugging Face는 최적화된 GPU 코드를 Hub-native artifact로 바꿔 PyTorch 배포의 까다로운 단계를 줄이려 한다. Clement Delangue는 새 Kernels 흐름이 GPU, PyTorch 빌드, OS에 맞는 precompiled binary를 내려주며 PyTorch baseline 대비 1.7배에서 2.5배 성능 향상을 노린다고 적었다.

#hugging-face #kernels #pytorch

AI X/Twitter Apr 10, 2026 1 min read

PyTorch, Blackwell용 Diffusers·TorchAO quantization으로 diffusion inference 가속 제시

PyTorch는 2026년 4월 8일 X에서 Diffusers와 TorchAO 기반 MXFP8/NVFP4 quantization이 NVIDIA B200에서 diffusion latency를 줄일 수 있다고 밝혔다. 동반 blog는 selective quantization과 regional compilation을 현실적인 latency-memory 최적화 조합으로 제시한다.

#pytorch #torchao #blackwell

AI X/Twitter Apr 9, 2026 1 min read

PyTorch Foundation, Safetensors와 Helion 편입... open-source AI 기반 도구 거버넌스 확대

PyTorch는 2026년 4월 9일 X에서 Safetensors와 Helion이 PyTorch Foundation의 foundation-hosted project로 합류했다고 밝혔다. 이번 조정으로 foundation은 model distribution safety와 저수준 kernel tooling에 대한 역할을 더 크게 갖게 된다.

#pytorch #safetensors #helion

LLM Hacker News Apr 7, 2026 1 min read

GuppyLM, 언어 모델을 쉽게 풀어낸 8.7M 파라미터 Show HN 프로젝트

Hacker News의 Show HN에서 주목받은 GuppyLM은 60K 합성 대화 데이터와 단순한 transformer 구조로 LLM 학습 전 과정을 드러낸다. Colab과 브라우저에서 바로 실행할 수 있는 교육용 초소형 모델이라는 점이 핵심이다.

#llm #education #pytorch

AI Reddit Mar 17, 2026 1 min read

r/MachineLearning: preflight, label leakage와 NaN을 학습 전에 막는 PyTorch pre-training validator

2026년 3월 15일 r/MachineLearning에서는 preflight 소개 글이 56 points와 13 comments를 기록했다. 이 lightweight CLI는 PyTorch training 전에 label leakage, NaN, channel ordering, dead gradients, class imbalance, VRAM risk 등 10개 항목을 검사한다.

#pytorch #mlops #data-validation

AI Reddit Mar 17, 2026 1 min read

r/MachineLearning: GraphZero, mmap과 zero-copy tensor로 대형 graph를 RAM 없이 다루는 C++ engine

2026년 3월 15일 r/MachineLearning에서는 GraphZero v0.2 소개 글이 334 points와 27 comments를 모았다. post와 GitHub README는 SSD mmap, custom binary format, nanobind bridge를 이용해 100M+ node graph를 consumer hardware에서 다루는 방식을 설명한다.

#graph-neural-networks #pytorch #c++

LLM Reddit Mar 10, 2026 1 min read

r/LocalLLaMA가 주목한 자율 LLM 연구의 overnight 루프

r/LocalLLaMA에서 화제가 된 karpathy/autoresearch는 에이전트가 하나의 training file을 수정하고 5분 실험을 반복하며 val_bpb를 낮추는 방향으로 탐색하는 소형 open-source 연구 루프다.

#ai-agents #research-automation #pytorch

LLM Reddit Mar 9, 2026 1 min read

Karpathy의 autoresearch, AI agent가 PyTorch 실험을 밤새 반복하는 연구 루프

LocalLLaMA에서 공유된 autoresearch는 agent가 PyTorch 학습 코드를 수정하고 5분짜리 실험을 반복하면서 더 나은 val_bpb를 찾도록 설계된 최소 구성 연구 프레임워크다.

#llm #ai-agents #pytorch