2026-06-02 日报

主题: 工业级推荐落地：Semantic ID 量化与生成式建模双线突破

标签: semantic-id · quantization · industrial · cold-start

📊 统计: 共 15 篇 · 精读 4 · 🏢 工业界 3 · 🎓 学术 12 · discriminative-rec 6 · generative-rec 2 · llm 7 · other 2

综述

今日共 15 篇，判别式推荐 6 篇、LLM 7 篇、生成式推荐 2 篇，4 篇精读全部出自工业界。快手 FlowTime 提出“连续生成式回归”新范式，以一步式 VAE 加基于观看时长分位数历史的条件 Normalizing Flow 把高斯先验 warp 成个性化多峰分布，避免点回归均值坍缩与离散化误差，线上视频播放时长 +1.044% 并开源 TimeRec 基准。LinkedIn 的 RQ-FSQ 首次实证跨域 viewer Semantic ID，用残差 VAE 加逐维有限标量量化把用户 embedding 压缩 30-280 倍且 AUC 不降，最冷启动用户段 +1.522% AUC，确立“行为活跃度决定跨域迁移质量”。Shopee 的 DRQ 给出诊断 tokenizer 失效的量化框架（期望重叠率 O_π / 有效码本容量 K_eff）与解耦残差量化，揭示符号容量、重构保真与软匹配的多目标权衡。Bilibili 的 LeAP 把 O(N) 置换特征重要性改造成 O(1) 批内可微门控，在十亿级搜索排序无损剪掉 3600+ 冗余维度。趋势上，Semantic ID 量化与生成式分布建模正成为工业推荐主线，另有一簇 LLM 多智能体协作与稀疏模型 scaling law 研究值得关注。

重点论文

FlowTime · ⭐ 8/10

FlowTime: Towards Continuous Generative Watch Time Prediction via Flow-based Personalized Priors

🏢 Kuaishou · 判别式推荐

提出连续生成式回归(Continuous Generative Regression)新范式与 FlowTime——用一步式 VAE 学连续潜空间、再用基于用户/物品观看时长分位数历史的条件 Normalizing Flow 把高斯先验 warp 成个性化多峰流形，建模观看时长的多峰异质分布，避免点回归均值坍缩与离散化量化误差；快手线上 A/B 视频播放时长 +1.044%，并开源 WTP 基准库 TimeRec。

RQ-FSQ · ⭐ 7/10

Quantizing Intent: Cross-Domain Semantic IDs from Organic Activity for Industrial Ranking

🏢 LinkedIn · 判别式推荐

LinkedIn 首次实证跨域 viewer Semantic ID:把 organic feed 行为派生的预训练用户 embedding 用 RQ-FSQ(残差 VAE + 逐维有限标量量化)离散成 30-280× 更小的 SID,经 prefix n-gram 的 HDE 模块端到端注入广告 CTR 排序器,确立'行为活跃度丰富性决定跨域迁移质量',最冷启动用户段 +1.522% AUC。

LeAP · ⭐ 7/10

LeAP: Learnable Adaptive Permutation for Feature Selection in Heterogeneous and Sparse Recommender Systems

🏢 Bilibili · 判别式推荐

LeAP 把传统 O(N) 的置换特征重要性改造成 O(1) 批内可微门控,并用置换散度自适应正则化解异构维度与极端稀疏的评估偏差,在 Bilibili 十亿级请求、12000+ 维搜索排序模型上无损剪掉 3600+ 冗余维度。

DRQ · ⭐ 7/10

Decoupled Residual Quantization for Robust Semantic IDs in Recommendation

🎓 学术 · 生成式推荐

提出诊断 Semantic ID tokenizer 失效的量化框架（期望重叠率 O_π / 有效码本容量 K_eff，把失效拆成分布惩罚与几何惩罚），并以解耦残差量化 DRQ（无 STE 的 VAE 连续重塑 + 事后层次 K-Means）作为概念验证，在 1500 万物品的工业短视频数据上揭示 tokenizer 质量是符号容量/重构保真/软匹配三者的多目标权衡。

全部论文

模型	标题	类别	公司	摘要分	精读分
FlowTime	FlowTime: Towards Continuous Generative Watch Time Prediction via Flow-based Personalized Priors	判别式	🏢 Kuaishou	7	8
RQ-FSQ	Quantizing Intent: Cross-Domain Semantic IDs from Organic Activity for Industrial Ranking	判别式	🏢 LinkedIn	8	7
LeAP	LeAP: Learnable Adaptive Permutation for Feature Selection in Heterogeneous and Sparse Recommender Systems	判别式	🏢 Bilibili	7	7
DRQ	Decoupled Residual Quantization for Robust Semantic IDs in Recommendation	生成式	🎓 学术	7	7
TDPM	Time-Aware Diffusion based on Preference Disentanglement for Generative Recommendation	生成式	🎓 学术	6	—
—	LLM-Assisted Reranking to Operationalize Nuanced Objectives in Recommender Systems	判别式 / LLM	🎓 学术	5	—
SPHERE	Breaking the Information Silo: Semantic Personas for Cross-Domain Recommendation	判别式 / LLM	🎓 学术	5	—
DART	Test-Time Training for Zero-Resource Dense Retrieval Reranking	其他	🎓 学术	5	—
—	Trustworthy Recommendation in the Era of Large Language Models: Opportunities and Challenges	LLM	🎓 学术	5	—
—	Attention Calibration for Position-Fair Dense Information Retrieval	其他	🎓 学术	5	—
—	When Data Is Scarce: Scaling Sparse Language Models with Repeated Training	LLM	🎓 学术	5	—
—	The Ringelmann Effect in Multi-Agent LLM Systems: A Scaling Law for Effective Team Size	LLM	🎓 学术	4	—
Group RC-DMC	Rank-Constrained Deep Matrix Completion for Group Recommendation	判别式	🎓 学术	4	—
SIMAS	Scaling Behavior of Single LLM-Driven Multi-Agent Systems	LLM	🎓 学术	4	—
—	Information-Theoretic Lower Bounds for Bit-Constrained Stochastic Optimization via a Reduction to Compressed Gaussian Mean Estimation	LLM	🎓 学术	4	—