You can also find my articles on my Google Scholar profile


LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning


Efficient Domain-Adaptive Multi-Task Dense Prediction with Vision Foundation Models


QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models


Retrospective Sparse Attention for Efficient Long-Context Generation


Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation


Dynamic Graph Structure Estimation for Learning Multivariate Point Process using Spiking Neural Networks


Has the Deep Neural Network leanred the Stochastic Process? An Evaluation Viewpoint