Specialization: Computer Vision and Remote Sensing
Research focus: Object Detection, Semantic Segmentation
Bachelor’s in Geomatics Engineering
Tongji University
Relevant courses: Photogrammetry, Digital Image Processing, Communication Principles, GIS Development, Numerical Computing, Error Propagation and Adjustment, Data Structures and Databases, C++ Programming
Project Experience:
Semantic Segmentation of Underground Parking Slot using TDA4 (Team Lead)
Led a project involving open-source dataset collection, data preprocessing, and training semantic segmentation models (UNet, Swin-Transformer, DeeplabV3)
Conducted channel-level compression on the UNet network and tested models of various sizes.
Converted the model format from PyTorch to ONNX to TIDL.
Deployed the model on the TDA4 platform, performed INT 8 quantization compression, and conducted performance testing with models of different sizes.
Future work involves collecting a semantic segmentation dataset using a UGV platform for underground parking HD mapping (Multi-sensor fusion).
Pedestrian Tracking, Counting, and Alert System using Deep Learning (Team Lead)
Implemented pedestrian detection, tracking, line-crossing detection, and counting using YOLOv5 and Deepsort.
Utilized serial communication to control alert devices when pedestrian flow exceeded a threshold.
Deployed the system on the Jetson Nano platform and used TenserRT for inference acceleration(3x speedup).
Intelligent Analysis of Tower Crane Top-view Monitoring Videos (Team Lead)
Detected and Classified tower crane loads and ground materials.
Fused data from tower crane motion sensors and gravity sensors to record load segments, generate daily load reports, stitching panoramic map of the construction site and reconstruct a 3D view of the construction site.
Explored traditional feature matching frame difference methods and implemented rotation box object detection using YOLOv5.
Using ONNXRUNTIME for model deployment(C++)
Bare Soil Recognition using a Boosted Cascade of CNN and Manual Features(Team Lead)
Used semantic segmentation model(ResNet50+UPerNet) and manual features(Contifidence, RGB, Hue) to identify the proportion of bare soil in construction site images.
Implemented the program using Python and C++ (OpenCV).
Unmanned Aerial Vehicle Pose Measurement Using High-Speed Camera(Undergraduate Thesis)
Conducted camera calibration for internal orientation elements.
Used a total station to measure control points for external orientation elements.
Analyzed aircraft position and pose information using Photogrammetry and visualize data using Matlab.
Internship Experience:
ShiZhuang Information Technology Co., Ltd.
Image Algorithm Intern
Optimized object detection models for the Image Search project.
Assisted in data cleaning, labeling, and evaluating model accuracy.
AIGC diffusion model research and image2image model generating test.
China Aerospace Science & Industry Corporation(CASIC)