Dinghao YANG   杨丁豪

Algorithm Engineer, Lepton AI


Hangzhou, China

Email: dinghow@lepton.ai;
             dinghowyang@gmail.com;
Google Scholar: Google Scholar Link
Github: https://github.com/Dinghow
Personal Blog: https://dinghow.site/dinghow-blog

Biography

I am an algorithm engineer at Lepton AI, focusing on the inference optimization of deep learning models (e.g. LLM). I'm lucky to work with Jiaxin Cao, Junjie Bai and many other talented geeks. Before that, I worked at the Machine Learning Platform for AI, Alibaba Cloud, majored in the development of a fancy LLM inference engine for Qwen. I received the M.S degree from the School of Electronic and Computer Engineering, Peking University, in 2023, and the B.E. degree from the School of Software Engineerning, Tongji University, in 2020. I used to be a research intern at Data and Computing Platform, SenseTime Research, during 2020-2022, supervised by Bin Wang and Weijia Li.

My research interests include AI infra (training/inference optimization), point cloud analysis, and image segmentation/matting.

You can find my Curriculum Vitae here (may not be updated in time).

Experience

Publications

Preprint Papers

Unified Interactive Image Matting
Dinghao Yang, Bin Wang, Weijia Li, Yiqi Lin, Conghui He
Under Review
[PDF] [Code]

Published Papers

PointCHD: A Point Cloud Benchmark for Congenital Heart Disease Classification and Segmentation
Dinghao Yang, Wei Gao
IEEE Journal Of Biomedical & Health Informatics (JBHI)
[PDF] [Code]
Exploring the User Guidance for More Accurate Building Segmentation from High-Resolution Remote Sensing Images
Dinghao Yang, Bin Wang, Conghui He, Weijia Li
International Journal of Applied Earth Observation and Geoinformation (JAG)
[PDF] [Code]
Exploiting Manifold Feature Representation for Efficient Classification of 3D Point Clouds
Dinghao Yang, Wei Gao, Ge Li, Hui Yuan, Junhui Hou, Sam Kwong
ACM Transactions on Multimedia Computing, Communications and Applications (TOMM) [PDF] [Code]

Projects

BladeLLM
BladeLLM is a high-performance LLM inference engine of Alibaba Cloud, used for the deployment of Qwen series. I am responsible for the modeling of LLM models, optimization of position embedding, interface development and triton kernel development.
[Report (CN)]
Intelligent Annotation for SenseBee
I am principally responsible for the interactive data annotation algorithms for SenseBee, the data platform for SenseTime AI research. We propose and deploy deep learning-based algorithms to accelerate manual data annotations (i.e. image segmentation, image matting, 3D object detection).
[Code]
OpenPointCloud
I participate in OpenPointCloud, an open-source algorithm library of deep learning-based point cloud compression & processing, mainly contrbute in the manifold learning-based point cloud representation learning method.
[Code]

Awards

Academic Activities