Tags auction1 cross-attention1 distributed system2 distributed-training2 embeddings2 feature1 fsdp1 generative-recommender2 job scheduler1 kernel1 llm2 llm training1 llm4rec5 machine learning design5 machine-learning-design2 message queue2 model-parallelism1 open source1 pytorch2 reading1 realtime system2 recommendation-system5 rl-infra1 semantic-id2 sparse features2 sparse-attention1 stateful service2 system design4 training-optimization1 user-sequence-modeling5 verl1 webhook1 websocket1 读书笔记1