On Fine-Tuning Submodular Functions for Data Subset Selection
| dc.contributor.advisor | Bilmes, Jeffrey | |
| dc.contributor.author | Bhalerao, Megh Manoj | |
| dc.date.accessioned | 2024-09-09T23:08:18Z | |
| dc.date.issued | 2024-09-09 | |
| dc.date.submitted | 2024 | |
| dc.description | Thesis (Master's)--University of Washington, 2024 | |
| dc.description.abstract | We demonstrate that submodular functions, with fine-tuned hyperparameters, serve as extremely effectivedata subset (i.e., summary) selectors, better than the current state-of-the-art, for training machine learning systems on data subsets. To search and reduce the hyperparameter space, we introduce meta-summarization a technique designed to enhance computational efficiency of hyperparameter tuning. Meta-summarization chooses a subset of summaries based on their inter-summary diversity starting from a large set of generated summary candidates. This significantly reduces the summaries to train on relative to training on all of them. This approach enables meta-summarization to find the best performing hyperparameters for a submodular function faster than other hyperparameter search techniques, significantly reducing computation and time. We demonstrate that summaries generated using fine-tuned submodular functions outperform subset selection benchmarks such as DC-Bench (by ≈ 3% absolute) and DeepCore (by ≈ 2% absolute). Fine tuned submodular functions also outperform random and state-of-the-art k-means based subset selection for training a popular ViT-based (vision transformer) architecture, DaViT, on ImageNet, thus setting a new state-of-the-art for supervised subset selection. | |
| dc.embargo.lift | 2025-09-09T23:08:18Z | |
| dc.embargo.terms | Restrict to UW for 1 year -- then make Open Access | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.other | Bhalerao_washington_0250O_26698.pdf | |
| dc.identifier.uri | https://hdl.handle.net/1773/51966 | |
| dc.language.iso | en_US | |
| dc.rights | none | |
| dc.subject | Data Subset Selection | |
| dc.subject | Deep Learning | |
| dc.subject | Machine Learning | |
| dc.subject | Submodular Functions | |
| dc.subject | Computer science | |
| dc.subject.other | Electrical and computer engineering | |
| dc.title | On Fine-Tuning Submodular Functions for Data Subset Selection | |
| dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Bhalerao_washington_0250O_26698.pdf
- Size:
- 2.66 MB
- Format:
- Adobe Portable Document Format
