ZHUHAI

工作经历

阿里国际 AliExpress · 高级推荐算法工程师 2026.04 - 至今

负责Normalization优化，提出UniNorm工作，主导推荐系统特征归一化方法设计与落地。
负责生成式排序Transformer模型迭代优化

阿里国际 AliExpress · 推荐算法工程师 2024.07 - 2026.04

专注推荐精排模型商品建模方向，主攻物流偏好建模，深度参与多价模型等工作，覆盖多个核心业务场景，服务线上用户。
主导语义ID在排序模型中的检索与落地，设计并实现GateSID优化方案，提升新商品冷启动曝光率。
参与生成式排序Transformer模型（SORT）的设计与落地实现订单+6.35%、GMV+5.47%的全场景业务提升。

阿里国际 Lazada · 搜索算法工程师（实习） 2023.07 - 2024.07

针对传统搜索相关性计算仅基于文本信息的局限，负责利用多模态大模型提升搜索相关性，提出Query-LIFE方法。
实现基于查询感知的图文模态融合，有效结合图像和标题信息，增强商品表示准确性。
线上取得订单UV提升3.06%、GMV提升3.19%，成果发表于COLING 2025。

项目经历

统一归一化（UniNorm） 2026.01 - 2026.05

背景：推荐系统中特征分布差异大，传统Normalization方法难以同时兼顾泛化性与表达能力。
任务：设计统一归一化方法，提升模型收敛速度和泛化能力。
行动：提出UniNorm框架，针对不同特征域设计自适应归一化策略。
结果：有效提升模型训练稳定性与线上效果。

语义ID建模（GateSID） 2025.07 - 2026.01

背景：推荐冷启动场景中，排序模型面临协同信号与语言信号权衡难题，新商品因缺少协同信息难以获得足够曝光。
任务：设计能根据商品成熟度动态平衡语言信息与协同信号的推荐框架，提升曝光效率。
行动：提出GateSID框架，利用VQ-VAE将多模态特征离散化为分层语言ID，并构建用户语言ID行为序列；设计门控融合机制，对冷启动商品依据语言信息，对热门商品保留协同信号。
结果：GateSID在所有指标上均优于SOTA基线（COINS、SaviorRec），CTCVR AUC+0.4%、GMV+2.6%、订单量+1.6%，成果投稿SIGIR 2026。成果投稿SIGIR2026

业务级推荐系统排序Transformer模型（SORT） 2025.07 - 至今

背景：单领域数据独立训练导致用户全域意图建模缺失，传统排序模型scaling up收益递减。
任务：设计并落地基于生成式架构的transformer排序大模型，突破传统模型天花板。
行动：实现请求级样本组织、局部注意力；改进transformer架构，引入特殊token、QKNorm、Attention Gate和Sparse MoE模块；优化训练-推理系统，联动工效团队开发注意力算子、算子融合，MFU从3%提升至12%。
结果：线上3万/B测试订单+6.35%、GMV+5.47%；服务效率提升：延迟降低44.67%，吞吐量+121.33%；模型性能：CTR-AUC提升2.1pt，在数据/模型/序列长度维度均展现优异扩展性。详见

搜索相关性多模态（Query-LIFE） 2023.07 - 2024.07

背景：传统相似度计算方法仅基于产品标题和查询建模，无法充分利用图像信息，影响搜索准确度。
任务：设计多模态相似度计算方法，提升搜索结果的准确性和用户体验。
行动：提出Query-LIFE方法，采用基于查询的图文模态融合，有效结合图像和标题信息，利用查询感知的模态对增强商品表示准确度。
结果：线上3万/B测试订单UV提升3.06%、GMV提升3.19%，成果发表于COLING 2025

论文发表

Zhu H, Guo Y, Dou R, et al. Query-LIFE: Query-aware Language Image Fusion Embedding for E-Commerce Relevance[C]//Proceedings of the 31st International Conference on Computational Linguistics: Industry Track. 2025: 21-28.
Zhu H, Zhao Q, Shang W, et al. Limeattack: Local explainable method for textual hard-label adversarial attack[C]//Proceedings of the AAAI conference on artificial intelligence. 2024, 38(17): 19759-19767.
Zhu H, Zhao Q, Wu Y. Beamattack: Generating high-quality textual adversarial examples through beam search and mixed semantic spaces[C]//Pacific-Asia Conference on Knowledge Discovery and Data Mining. Cham: Springer Nature Switzerland, 2023: 454-465.
Wang Z, Zheng Y, Zhu H, et al. Transferable adversarial examples can efficiently fool topic models[J]. Computers & Security, 2022, 118: 102749.

教育背景

中国科学技术大学 · 计算机技术（硕士） 2021.09 - 2024.06

省优秀毕业生、中科院院长奖学金、国家奖学金

杭州电子科技大学 · 信息安全（本科） 2017.09 - 2021.06

省优秀毕业生、省政府奖学金、大学生数模竞赛国家二等奖/深圳杯数模国二

Work Experience

AliExpress, Alibaba International · Senior Recommendation Algorithm Engineer 2026.04 - Present

Lead Normalization optimization, propose UniNorm, design and implement feature normalization methods for recommendation systems.
Continuously focus on product modeling in recommendation ranking models, specializing in logistics preference modeling, multi-price models, covering multiple core business scenarios, serving online users.
Lead the retrieval and implementation of Semantic ID in ranking models, design and implement GateSID optimization scheme, improving exposure rate of new products in cold start.
Participate in the design and implementation of generative ranking Transformer model (SORT), achieving +6.35% orders and +5.47% GMV improvement across all scenarios.

AliExpress, Alibaba International · Recommendation Algorithm Engineer 2024.07 - 2026.04

Focus on product modeling in recommendation ranking models, specializing in logistics preference modeling, deeply involved in multi-price models, covering multiple core business scenarios, serving online users.
Lead the retrieval and implementation of Semantic ID in ranking models, design and implement GateSID optimization scheme, improving exposure rate of new products in cold start.
Participate in the design and implementation of generative ranking Transformer model (SORT), achieving +6.35% orders and +5.47% GMV improvement across all scenarios.

Lazada, Alibaba International · Search Algorithm Engineer (Intern) 2023.07 - 2024.07

Addressing the limitation of traditional search relevance calculation based only on text information, responsible for leveraging multimodal large models to improve search relevance, proposing Query-LIFE method.
Implement query-aware image-text modal fusion, effectively combining image and title information to enhance product representation accuracy.
Achieved 3.06% UV order improvement and 3.19% GMV improvement online, results published at COLING 2025.

Project Experience

Unified Normalization (UniNorm) 2026.01 - 2026.05

Background: In recommendation systems, features have large distribution differences, making it difficult for traditional normalization methods to balance generalization and expressiveness.
Task: Design unified normalization method to improve model convergence and generalization.
Action: Propose UniNorm framework, design adaptive normalization strategies for different feature domains.
Result: Effectively improved model training stability and online performance.

Semantic ID Modeling (GateSID) 2025.07 - 2026.01

Background: In recommendation cold start scenarios, ranking models face the trade-off challenge between collaborative signals and linguistic signals. New products struggle to get sufficient exposure due to lack of collaborative information.
Task: Design a recommendation framework that dynamically balances linguistic information and collaborative signals based on product maturity to improve exposure efficiency.
Action: Propose GateSID framework, use VQ-VAE to discretize multimodal features into hierarchical language IDs, and build user language ID behavior sequences; design gating fusion mechanism to rely on linguistic information for cold start products and retain collaborative signals for popular products.
Result: GateSID outperforms SOTA baselines (COINS, SaviorRec) on all metrics, CTCVR AUC +0.4%, GMV +2.6%, orders +1.6%. Submitted to SIGIR 2026 and KDD 2026. Paper

Business-level Recommendation System Ranking Transformer Model (SORT) 2025.07 - Present

Background: Single-domain independent training leads to lack of user global intent modeling, and traditional ranking models show diminishing returns in scaling up.
Task: Design and deploy a generative architecture-based transformer ranking large model to break through traditional model ceilings.
Action: Implement request-level sample organization and local attention; improve transformer architecture by introducing special tokens, QKNorm, Attention Gate, and Sparse MoE modules; optimize training-inference system, collaborate with efficiency team to develop attention operators and operator fusion, improving MFU from 3% to 12%.
Result: Online 30k/B test shows orders +6.35%, GMV +5.47%; Service efficiency: latency reduced by 44.67%, throughput increased by 121.33%; Model performance: CTR-AUC improved by 2.1pt, showing excellent scalability in data/model/sequence length dimensions. Paper

Search Relevance Multimodality (Query-LIFE) 2023.07 - 2024.07

Background: Traditional similarity calculation methods model only based on product titles and queries, unable to fully utilize image information, affecting search accuracy.
Task: Design multimodal similarity calculation method to improve search result accuracy and user experience.
Action: Propose Query-LIFE method, adopt query-based image-text modal fusion, effectively combine image and title information, and use query-aware modal pairs to enhance product representation accuracy.
Result: Online 30k/B test shows 3.06% UV order improvement and 3.19% GMV improvement, results published at COLING 2025

Publications

Zhu H, Guo Y, Dou R, et al. Query-LIFE: Query-aware Language Image Fusion Embedding for E-Commerce Relevance[C]//Proceedings of the 31st International Conference on Computational Linguistics: Industry Track. 2025: 21-28.
Zhu H, Zhao Q, Shang W, et al. Limeattack: Local explainable method for textual hard-label adversarial attack[C]//Proceedings of the AAAI conference on artificial intelligence. 2024, 38(17): 19759-19767.
Zhu H, Zhao Q, Wu Y. Beamattack: Generating high-quality textual adversarial examples through beam search and mixed semantic spaces[C]//Pacific-Asia Conference on Knowledge Discovery and Data Mining. Cham: Springer Nature Switzerland, 2023: 454-465.
Wang Z, Zheng Y, Zhu H, et al. Transferable adversarial examples can efficiently fool topic models[J]. Computers & Security, 2022, 118: 102749.

Education

University of Science and Technology of China (USTC) · Computer Technology (Master) 2021.09 - 2024.06

Provincial Outstanding Graduate, CAS President Scholarship, National Scholarship

Hangzhou Dianzi University · Information Security (Bachelor) 2017.09 - 2021.06

Provincial Outstanding Graduate, Provincial Government Scholarship, National Second Prize in Undergraduate Mathematical Modeling Competition/Shenzhen Cup National Second Prize