I am a Principal Research Scientist specializing in Multimodal Large Language Models (LLMs). Previously, I was a Research Scientist and Tech Lead at ByteDance Seed, where I focused on multimodal LLMs and Reinforcement Learning. I received my PhD from Nanyang Technological University (NTU, Singapore) in 2022, with a thesis on “Adversarial Robustness of Deep Reinforcement Learning” supervised by Prof Yew-Soon Ong and Dr. Abhishek Gupta.

I have published 50+ papers Citations at top-tier AI conferences such as NeurIPS, ICML, and ICLR.

“Your time is limited, so don’t waste it living someone else’s life.” — Steve Jobs

Recent news

  • 🎉 Two papers accepted by ICLR 2026. ParaS2S, VLM Model Merge.
  • 🏆 Received Outstanding Team award (Spot Bonus in Q3) from Bytedance Seed.
  • 🎉 Our Bytedance Seed tech report about Multimodal LLM Agent has been released.
  • 🎉 Two papers accepted by NeurIPS-2025. “Robust SuperAlignment” selected as spotlight.
  • 🎉 Google scholar citation achieved 1000.
  • 🎉 Two papers accepted by ICCV-2025; One ICCV paper was selected as highlight.
  • 🏆 Received Spot Bonus award for Breakthrough In New Area from Bytedance Seed (rate=2/366).
  • 🎉 One paper accepted by ICML-2025 as spotlight (acceptance rate=2.6%).
  • 🎉 One paper accepted by ICLR-2025.
  • 🚀 [Feb 2025] As the tech lead and main contributor, our speech to speech LLM trained using Reinforcement Learning has been released on Doubao 豆包.
  • 🎉 One paper accepted by AAAI-2025.
  • 🎉 One paper accepted by KDD-2025.
  • 🔥 Our paper was nominated as the best paper in SIGIR-2024.
  • 🎉 Two papers accepted by SIGIR-2024.
  • 🏆 Received outstanding employee award from Shanda gruop (rate=4/360).
  • 🔥 My research work on audio watermarking was recently featured on Linkedin
  • 🎉 One paper accepted by NeurIPS-2023.
  • 🔥✨ [Aug 2023] I was invited by AI-TIME to give a talk.
  • 🔥 Our work on ChatGPT driven Voice base Conversational Recommender Systems has been posted by several Chinese media platforms, such as 语音之家, 火山语音
  • 🔥 Our work about Audio QR Code has been highlighted by IJCAI-2023 officially 🔬Linked-in, Facebook, Twitter.
  • 🎉 One paper accepted by InterSpeech-2023.
  • 🎉 One paper accepted by IJCAI-2023.
  • 🎉 One paper accepted by SIGIR-2023.
  • 🎉 One paper accepted by IEEE TPAMI.
  • 🎉 One paper accepted by CIKM-2022.
  • 🔥 My research papers are reported by several Chinese media platforms, such as PaperWeekly, 语音之家,深科技,火山引擎.
  • 🎉 One paper accepted by IEEE TPAMI.
  • 🎉 Two paper accepted by SIGKDD-2022.
  • 🎉 One paper accepted by IJCAI-2022.
  • 🎉 One paper accepted by SIGIR-2022.
  • 🎉 One paper accepted by AAMAS-2022.
  • 🔥 My research on “adversarial robustness of deep reinforcement learning” has been featured with title “Expecting the Unexpected from AI”.
  • 🏆 Received Research Highlight Award from ASTAR Singapore.

Experience

  • 💼 [Jun 2024] I returned to Bytedance Seed as a research scientist and tech lead to build up the reinforcement learning pipeline for speech to speech LLMs.
  • 💼 [Aug 2023] I joined Shanda Group as an AI scientist inspired by Mr Chen Tianqiao’s ambition on AI, working on internal start-up and venture capital investment consultation on AI.
  • 🎓 [Jan 2022]I finished my oral defense and received my PhD degree. Many thanks to my supervisor Prof Yew-Soon Ong and many other collaborators.
  • 💼 [June 2021] I joined Bytedance AI lab (Singapore) as a research scientist
  • 🎓📜 [May 2021] PhD thesis submission in NTU