A comprehensive survey of techniques for scaling machine learning systems across distributed computing environments, covering both data and model parallelism approaches.
Papers2024
Efficient Parameter-Efficient Fine-Tuning for Large Language Models
We propose a novel method for parameter-efficient fine-tuning of large language models that reduces memory requirements by 70% while maintaining comparable performance to full fine-tuning.
We investigate adversarial attack vectors in state-of-the-art computer vision models and propose a new defense mechanism that improves robustness by 35%.