Aneesh Shetty
|
I like building efficient systems that solve complex math problems.
I like math that reveals interesting structures.
I like abstractions that are elegant and simplify complex problems.
And I like to write code that make these things go brrr.
|
Updates
Sep 2024 |
Our work on Finetuning using Singular Vectors was accepted at NeuRIPS 2024 |
May 2024 |
I will be joining Amazon as a Software Development Engineer in Annapurna Labs |
May 2023 |
I will be joining Amazon as a Software Development Engineering Intern this Summer |
Aug 2022 |
I will be working with Prof. Isil Dillig and Prof. Joydeep Biswas as a GRA at UT Austin |
Aug 2022 |
I will be starting my MS in Computer Science at UT Austin |
Aug 2021 |
Presented our work on Scope-Bounded Reachability in Valence Systems at CONCUR 2021 |
Aug 2021 |
Graduate from IIT Bombay with a B.Tech in Computer Science and Minor in Statistics |
Jul 2021 |
I will be joining Adobe as a Software Engineer, working on their Core C++ PDF Library |
Jun 2021 |
Our work on Scope-Bounded Reachability in Valence Systems was accepted at CONCUR 2021 |
Aug 2020 |
I started working as a Teaching Assistant for Automata Theory with Prof. Akshay S. |
|
|
SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors
Aneesh Shetty, Vijay Chandra Lingam, Atula Neerkaje, Aditya Vavre, Gautham Krishna Gudur, Joydeep Ghosh, Eunsol Choi, Alex Dimakis, Aleksandar Bojchevski, Sujay Sanghavi
NeurIPS 2024
[webpage]
[abstract]
[bibtex]
[arXiv]
Popular parameter-efficient fine-tuning (PEFT) methods, such as LoRA and its variants, freeze pre-trained model weights and inject learnable matrices .
These matrices are structured for efficient parameterization, often using techniques like low-rank approximations or scaling vectors.
However, these methods typically show a performance gap compared to full fine-tuning.
Although recent PEFT methods have narrowed this gap, they do so at the cost of additional learnable parameters.
We propose SVFT, a simple approach that fundamentally differs from existing methods: the structure imposed on depends on the specific weight matrix .
Specifically, SVFT updates as a sparse combination of outer products of its singular vectors, training only the coefficients (scales) of these sparse combinations.
This approach allows fine-grained control over expressivity through the number of coefficients.
Extensive experiments on language and vision benchmarks show that SVFT recovers up to 96% of full fine-tuning performance while training only 0.006 to 0.25% of parameters, outperforming existing methods that only recover up to 85% performance using 0.03 to 0.8% of the trainable parameter budget.
@inproceedings{NEURIPS2024_48c368f1,
author = {Lingam, Vijay Chandra and Neerkaje, Atula and Vavre, Aditya and Shetty, Aneesh and Gudur, Gautham Krishna and Ghosh, Joydeep and Choi, Eunsol and Dimakis, Alex and Bojchevski, Aleksandar and Sanghavi, Sujay},
booktitle = {Advances in Neural Information Processing Systems},
editor = {A. Globerson and L. Mackey and D. Belgrave and A. Fan and U. Paquet and J. Tomczak and C. Zhang},
pages = {41425--41446},
publisher = {Curran Associates, Inc.},
title = {SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors},
url = {https://proceedings.neurips.cc/paper_files/paper/2024/file/48c368f105e8145b945227b73255635a-Paper-Conference.pdf},
volume = {37},
year = {2024}
}
|
|
Scope-bounded Reachability in Valence Systems
Aneesh Shetty, Krishna S., Georg Zetzsche
CONCUR 2021
[webpage]
[abstract]
[bibtex]
[arXiv]
Multi-pushdown
systems
are a standard model for concurrent recursive programs, but they have an undecidable reachability
problem. Therefore, there have been several proposals to underapproximate their sets of runs so
that
reachability in this underapproximation becomes decidable. One such underapproximation that covers
a
relatively high portion of runs is scope boundedness. In such a run, after each push to stack i,
the
corresponding pop operation must come within a bounded number of visits to stack i.
In this work, we generalize this approach to a large class of infinite-state systems. For this, we
consider the model of valence systems, which consist of a finite-state control and an
infinite-state
storage mechanism that is specified by a finite undirected graph. This framework captures
pushdowns,
vector addition systems, integer vector addition systems, and combinations thereof. For this
framework, we propose a notion of scope boundedness that coincides with the classical notion when
the
storage mechanism happens to be a multi-pushdown.
We show that with this notion, reachability can be decided in PSPACE for every storage mechanism
in
the framework. Moreover, we describe the full complexity landscape of this problem across all
storage
mechanisms, both in the case of (i) the scope bound being given as input and (ii) for fixed scope
bounds. Finally, we provide an almost complete description of the complexity landscape if even a
description of the storage mechanism is part of the input.
@misc{shetty2021scopebounded,
title={Scope-Bounded Reachability in Valence Systems},
author={Aneesh K. Shetty and S. Krishna and Georg Zetzsche},
year={2021},
eprint={2108.00963},
archivePrefix={arXiv},
primaryClass={cs.FL}
}
|
Projects
-
Automatic Visual Question Generation and Answering for Image Descriptions: We used LLM as
question generator and VQA model as answerer to produce substantially descriptive paragraphs for
images,
and ground the generated questions using Image Segmentation.
[report]
-
GNN: A Survey on Architectures and Optimization: We wrote a term paper on different GNN
architectures and various optimzations to speed them up.
[report]
-
Optimizing cp -r: We used io_uring released in Linux 5.1 to wrote a faster implementation of cp
-r.
[report]
|
|