[GSoC 2025]  Explore Transformer-Based Architectures

## Background
Our current diffusion model for DIA-MS data deconvolution uses a U-Net-based architecture. While effective, recent advances in Vision Transformers (ViTs) and other transformer-based architectures have shown promising results in many domains. This task involves exploring whether transformer-based architectures could provide performance improvements for our specific use case of MS signal deconvolution. We have previously tried a [custom transformer backbone](https://github.com/Roestlab/diffusion-deconvolution-dia-msms-data/blob/53960a5db67b62d8382c745386da6fd74e6aa6d8/dquartic/model/building_blocks.py#L128).

## Task Objectives

- Implement one or more transformer-based architectures (e.g., ViT, Swin Transformer) as alternative backbones for our diffusion model
- Adapt these architectures to work with our 1D/2D MS data representation
- Train and evaluate the transformer-based models against our U-Net baseline
- Analyze trade-offs in performance, training time, and resource requirements

## Technical Details

- MS data has unique characteristics that may require architectural adaptations of standard transformer models
- Consider attention mechanisms specifically suited for spectral data
- Focus initially on smaller transformer variants to enable rapid experimentation

##  Deliverables

- Implementation of at least one transformer-based architecture compatible with our existing pipeline
- Training and evaluation scripts for the new architecture(s)
- Performance comparison with our current U-Net baseline, including:
  - Deconvolution quality metrics
  - Convergence speed
  - Inference time
  - Memory requirements
- Analysis of the strengths and weaknesses of different architectures for our specific problem

## Resources

- Current model implementation in our repository
- [Vision Transformer documentation](https://huggingface.co/docs/transformers/model_doc/vit)

## Difficulty
Advanced - This task requires deep understanding of model architectures and their adaptation to specialized data formats.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GSoC 2025] Explore Transformer-Based Architectures #18

Background

Task Objectives

Technical Details

Deliverables

Resources

Difficulty

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[GSoC 2025] Explore Transformer-Based Architectures #18

Description

Background

Task Objectives

Technical Details

Deliverables

Resources

Difficulty

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions