Skip to content

[Feature Request] Add default */dataset/* lfs tracking #263

@Hannah-Doerpholz

Description

@Hannah-Doerpholz

Is your feature request related to a problem? Please describe.
When an ARC is newly created using arc init, no .gitattributes file is created. While later running arc sync, files below the git lfs threshold (should be 25 MB atm) get tracked and each file is individually added to a new .gitattributes file. However, we recently saw an instance where 15 Gb worth of files were uploaded to GitLab without being tracked by LFS, because while there were many very small files, in total they amounted to a lot of space.

Describe the solution you'd like
It would be good to automatically add a .gitattributes file to track all dataset files regardless of file size, as it already is with the ARCitect. Essentially, this file should already contain the line **/dataset/** filter=lfs diff=lfs merge=lfs -text.

Describe alternatives you've considered
Going back and performing the steps to rewrite the git history to remove the wrongly committed and pushed files is not optimal to track the already pushed files with LFS later on.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions