Muxuan Li

B.Sc., University of Electronic Science and Technology of China

InEase28@gmail.com

I am a self-motivated and fast learner who enjoys solving problems and learning new skills. I have strong communication and teamwork skills, as well as a keen eye for design and user experience. I am always eager to take on new challenges and explore new opportunities in the software engineering field.

Also, I am fascinated by the power and potential of large language models (LLMs). I have been following the latest developments and innovations in this field, and I am eager to contribute to the advancement of LLMs and their applications. I believe that LLMs are the future of natural language processing, and I want to be part of this exciting journey.

Interests

Large Language Models
Machine Learning
Computer Vision
Front-end & Back-end Development

Projects

You can also check my github for the complete list

2023-05-27, Zero-shot Video Generation Application Based on Diffusion Model, Pose Control and Cross-frame Attention

Diffusion Models ControlNet Video Generation Edge Deployment Gradio

Proposed a text-to-video generation method based on diffusion model. This method can achieve zero-shot video generation without any training by just adjusting the existing diffusion model.

Used cross-frame attention mechanism to maintain the consistency of scene and background in the generated video, improving the video quality and coherence
Used ControlNet to control the generation process. It can control the shape, position, size and other attributes of objects in the video by using edge, pose, semantic mask, image depth and other ways
Highlight Contribution - Implemented edge deployment on NVIDIA Jetson Orin NV development board and built a front-end interactive page using Gradio, demonstrating the feasibility and effect of the model on low-power devices

2023-03-15, Alumni Data Platform

Vue Feishu Open Platform Data Analysis

As the main person in charge of the alumni data platform project in our department, I was responsible for requirement analysis, solution design, data processing, platform development and testing. Using Feishu system as a data storage and visualization tool, I successfully cleaned, integrated and standardized the original data, eliminating errors, omissions and inconsistencies. I created a user-friendly website using Vue framework. The website provides some special features, such as batch import/export, data analysis and statistics. The final product received high praise from our teachers and alumni, achieving data compatibility, relevance, sharing, security and low development maintenance cost. It now has nearly 100,000 alumni data collected from the founding of the school to the present.

Highlight Contribution - Developed front-end interactive interface for alumni data platform project, using Vue + Flask to complete image upload, image annotation, model prediction, feedback and other functions.

2022-09-12, Kaggle: Multi-Organ Functional Tissue Segmentation

Kaggle Segmentation Transfer Learning Data Augmentation Computer Vision

Built a highly generalizable model and performed transfer learning to cope with the challenges of different datasets from different institutions, different annotation methods, and different datasets in the competition.

Used Resnext-50, SegFormer, mit-b2, coat and other models for multi-level feature extraction and semantic segmentation
Used various data augmentation methods such as Cutmix, staining and normalization tools to expand the dataset, as well as manual annotation.
Used various techniques and innovations such as manual annotation of lung dataset, staining and normalization tools, multiple initialization, pseudo-labeling etc. to improve the accuracy and efficiency of the model
Used pseudo-labeling. TTA, threshold adjustment, ensemble learning and other methods to optimize the model parameters and results
Improved from 0.68 to 0.82 on the private leaderboard similarity score, ranking 20/1175 (as of September 12th, 2022)

kaggle

2021-05-30, Pneumonia Detection System / Fundus Disease Detection

Medical Image Computer Vision Deep Learning Flask Vue

In the summer research project at NTU, for the pneumonia CT medical image detection problem, used ResNet model to identify and detect pneumonia symptoms, and used Grad-CAM and Guided Grad-CAM methods to generate attention heat maps and explain the model’s decision process, achieving 90% accuracy

Highlight Contribution - Responsible for the development of the front-end interactive interface, using Vue + Flask to complete the functions of image upload, image annotation, model prediction, result feedback and so on

code