<feed xmlns="http://www.w3.org/2005/Atom"> <id>https://pyemma.github.io/</id><title>Coding Monkey</title><subtitle>I am a coding monkey, and I am proud of it. I have done lots of work in machine learning area, especially recommendation system and AutoML. This blog summarize my journey to become an expert monkey in distributed system and LLM.</subtitle> <updated>2026-06-03T05:51:23+00:00</updated> <author> <name>Coding Monkey</name> <uri>https://pyemma.github.io/</uri> </author><link rel="self" type="application/atom+xml" href="https://pyemma.github.io/feed.xml"/><link rel="alternate" type="text/html" hreflang="en" href="https://pyemma.github.io/"/> <generator uri="https://jekyllrb.com/" version="4.4.1">Jekyll</generator> <rights> © 2026 Coding Monkey </rights> <icon>/assets/img/favicons/favicon.ico</icon> <logo>/assets/img/favicons/favicon-96x96.png</logo> <entry><title>A Random Walk Down Recsys - Part 6</title><link href="https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-6/" rel="alternate" type="text/html" title="A Random Walk Down Recsys - Part 6" /><published>2026-06-01T00:00:00+00:00</published> <updated>2026-06-01T00:00:00+00:00</updated> <id>https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-6/</id> <content src="https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-6/" /> <author> <name>pyemma</name> </author> <category term="Machine Learning" /> <summary>Welcome back to the sixth installment of A Random Walk Down Recsys. This batch is smaller than the last — four papers — but two of them line up neatly enough that they deserve a side-by-side read: Industrial distillation pipelines — ByteDance’s Rec-Distill and Meta’s LoopFM are both attacking the same problem (transferring knowledge from an expensive teacher / foundation model into the prod...</summary> </entry> <entry><title>A Random Walk Down Recsys - Part 5</title><link href="https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-5/" rel="alternate" type="text/html" title="A Random Walk Down Recsys - Part 5" /><published>2026-05-10T00:00:00+00:00</published> <updated>2026-05-10T00:00:00+00:00</updated> <id>https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-5/</id> <content src="https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-5/" /> <author> <name>pyemma</name> </author> <category term="Machine Learning" /> <summary>Welcome back to the fifth installment of A Random Walk Down Recsys. This batch of papers reflects how quickly the generative recommendation playbook is being adapted to new verticals and tightened for production. Six papers, four themes: Generative Recommendation for Ads — two concurrent works, one from Kuaishou and one from Tencent, both attacking the same gap: today’s GR systems are heavi...</summary> </entry> <entry><title>A Random Walk Down Recsys - Part 4</title><link href="https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-4/" rel="alternate" type="text/html" title="A Random Walk Down Recsys - Part 4" /><published>2026-03-07T00:00:00+00:00</published> <updated>2026-03-07T00:00:00+00:00</updated> <id>https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-4/</id> <content src="https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-4/" /> <author> <name>pyemma</name> </author> <category term="Machine Learning" /> <summary>Welcome back to the fourth installment of A Random Walk Down Recsys. This time, the three papers span a range of practical challenges in generative recommendation: efficiently compressing long user sequences through recurrent memory, accelerating constrained decoding on hardware accelerators via trie vectorization, and rethinking how semantic IDs are trained and maintained with a dynamic, end-t...</summary> </entry> <entry><title>A Random Walk Down Recsys - Part 3</title><link href="https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-3/" rel="alternate" type="text/html" title="A Random Walk Down Recsys - Part 3" /><published>2026-02-22T00:00:00+00:00</published> <updated>2026-02-22T00:00:00+00:00</updated> <id>https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-3/</id> <content src="https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-3/" /> <author> <name>pyemma</name> </author> <category term="Machine Learning" /> <summary>Welcome back to the third installment of A Random Walk Down Recsys. This time, all five papers revolve around a single theme: Semantic IDs (SIDs) — how to generate them, how to improve their quality, and how to leverage them effectively in generative recommender (GR) models. The papers span a wide range of ideas: compressing long user sequences through SID hierarchies, injecting reasoning capab...</summary> </entry> <entry><title>A Random Walk Down Recsys - Part 2</title><link href="https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-2/" rel="alternate" type="text/html" title="A Random Walk Down Recsys - Part 2" /><published>2026-02-05T00:00:00+00:00</published> <updated>2026-02-05T00:00:00+00:00</updated> <id>https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-2/</id> <content src="https://pyemma.github.io/A-Random-Walk-Down-Recsys-Part-2/" /> <author> <name>pyemma</name> </author> <category term="Machine Learning" /> <summary>Welcome back to the second installment of A Random Walk Down Recsys. In this post, I continue surveying interesting papers from the Arxiv IR section, covering five recent works: HyFormer, Token-level Collaborative Alignment, OneMall, a Sparse Attention approach for long-term user behaviors, and Farewell to Item IDs. HyFormer: Hybrid Cross-Attention for Sequential and Non-Sequential Features ...</summary> </entry> </feed>
