Dinghao YANG 杨丁豪Algorithm Engineer, Lepton AIHangzhou, China Email: dinghow@lepton.ai; dinghowyang@gmail.com; Google Scholar: Google Scholar Link Github: https://github.com/Dinghow Personal Blog: https://dinghow.site/dinghow-blog |
|
I am an algorithm engineer at Lepton AI, focusing on the inference optimization of deep learning models (e.g. LLM). I'm lucky to work with Jiaxin Cao, Junjie Bai and many other talented geeks. Before that, I worked at the Machine Learning Platform for AI, Alibaba Cloud, majored in the development of a fancy LLM inference engine for Qwen. I received the M.S degree from the School of Electronic and Computer Engineering, Peking University, in 2023, and the B.E. degree from the School of Software Engineerning, Tongji University, in 2020. I used to be a research intern at Data and Computing Platform, SenseTime Research, during 2020-2022, supervised by Bin Wang and Weijia Li.
My research interests include AI infra (training/inference optimization), point cloud analysis, and image segmentation/matting.
You can find my Curriculum Vitae here (may not be updated in time).
Unified Interactive Image Matting
Dinghao Yang, Bin Wang, Weijia Li, Yiqi Lin, Conghui He Under Review [PDF] [Code] |
PointCHD: A Point Cloud Benchmark for Congenital Heart Disease Classification and Segmentation
Dinghao Yang, Wei Gao IEEE Journal Of Biomedical & Health Informatics (JBHI) [PDF] [Code] |
|
Exploring the User Guidance for More Accurate Building Segmentation from High-Resolution Remote Sensing Images
Dinghao Yang, Bin Wang, Conghui He, Weijia Li International Journal of Applied Earth Observation and Geoinformation (JAG) [PDF] [Code] |
|
Exploiting Manifold Feature Representation for Efficient Classification of 3D Point Clouds
Dinghao Yang, Wei Gao, Ge Li, Hui Yuan, Junhui Hou, Sam Kwong ACM Transactions on Multimedia Computing, Communications and Applications (TOMM) [PDF] [Code] |
BladeLLM
BladeLLM is a high-performance LLM inference engine of Alibaba Cloud, used for the deployment of Qwen series. I am responsible for the modeling of LLM models, optimization of position embedding, interface development and triton kernel development. [Report (CN)] |
|
Intelligent Annotation for SenseBee
I am principally responsible for the interactive data annotation algorithms for SenseBee, the data platform for SenseTime AI research. We propose and deploy deep learning-based algorithms to accelerate manual data annotations (i.e. image segmentation, image matting, 3D object detection). [Code] |
|
OpenPointCloud
I participate in OpenPointCloud, an open-source algorithm library of deep learning-based point cloud compression & processing, mainly contrbute in the manifold learning-based point cloud representation learning method. [Code] |