月薪62776~96390元 台北市南港區 工作經歷不拘 9天前更新
Research on Optimization of Deep Learning Model Inference and Training
The Computer Systems Laboratory - Machine Learning Systems team focuses on research areas including parallel and distributed computing, compilers, and computer architecture. We aim to leverage computer system technologies to accelerate the inference and training of deep learning models and develop optimizations for next-generation AI models. Our research emphasizes the following:
1. AI Model Compression and Optimization
Model compression techniques (e.g., pruning and quantization) reduce the size and computational demands of AI models, which are crucial for resource-constrained platforms such as embedded systems and memory-limited AI accelerators. We aim to explore:
* AI compiler: deployment methods for compressed models across servers, edge devices, and heterogeneous systems.
* High performance computing: efficient execution of compressed models on processors with advanced AI extensions, e.g., Intel AVX512, ARM SVE, RISC-V RVV, and tensor-level accelerations on GPUs and NPUs.
2. AI Accelerator Design
We aim to design AI accelerators for accelerating AI model inference, focusing on software and hardware co-design and co-optimization.
3. Optimization of AI Model Inference in Heterogeneous Environments
Computer architectures are evolving toward heterogeneous multi-processor designs (e.g., CPUs + GPUs + AI accelerators). Integrating heterogeneous processors to execute complex models (e.g., hybrid models, multi-models, and multi-task models) with high computational efficiency poses a critical challenge. We aim to explore:
* Efficient scheduling algorithms.
* Parallel algorithms for the three dimensions: data parallelism, model parallelism, and tensor parallelism.
薪資:博士62776元起聘,該職缺係「適用勞動基準法」
展開 員工餐廳員工電影週休二日勞保健保