Skip to content

Releases: lhotse-speech/lhotse

v1.33.0 - Himalayan Vulture

20 Apr 13:11

Choose a tag to compare

Dependencies

Recipes

New features

  • Support loading multiple non-overlapping custom recordings in MixedCut by @pzelasko in #1553
  • Respect LHOTSE_IO_BACKEND for reading AudioSource(type='url'); update docs by @pzelasko in #1557
  • Add AudioSamples(mono_downmix=True) to handle mixed single/multi channel batches gracefully by @pzelasko in #1563
  • Chunking functionality by @nune-tadevosyan in #1556
  • Add CutSet.mix(..., tag="noise"), MixTrack.{is_snr_reference,mute} bools, and MixedCut.unmix(tag=...) by @pzelasko in #1559

Fixes

New Contributors

Full Changelog: v1.32.2...v1.33.0

v1.32.2 - Blood Pheasant

14 Jan 15:09
e0a36fc

Choose a tag to compare

What's Changed

  • NSF grant acknowledgment in README.md by @pzelasko in #1539
  • Fix CutSampler initialization for newer PyTorch versions by @pzelasko in #1543

Full Changelog: v1.32.1...v1.32.2

v1.32.1 - Blood Pheasant

24 Nov 16:43

Choose a tag to compare

Fix issue with importing Lhotse v1.32.0 on Windows.

Full Changelog: v1.32.0...v1.32.1

v1.32.0 - Blood Pheasant

21 Nov 20:04

Choose a tag to compare

Recipes

New features

  • Add new augmentation: codec compression (GSM, Opus, Vorbis, MP3) by @racoiaws in #1510
  • Add new augmentation: lowpass using back-and-forth resampling via libsox by @racoiaws in #1511
  • Support cut.load_custom_video() and collate_video(..., recording_field='custom_video') by @pzelasko in #1525
  • Add AISBatchLoader for efficient batch data loading from AIStore by @gaikwadabhishek in #1529
  • Add AIStore batch loading support in AudioSamples class by @gaikwadabhishek in #1534

Fixes and enhancements

New Contributors

Full Changelog: 1.31.0...v1.32.0

v1.31.1 - Expedition 33

18 Sep 21:42
eb06518

Choose a tag to compare

(skipping 1.31.0 because I accidentally uploaded and deleted this version in the past in PyPI, and can't re-use it now)

New recipes

New features

Fixes and enhancements

  • minor fix: remove execute permission for all recipes by @yfyeung in #1485
  • Refactor IO to support both files and URLs for Features, Array, and Image by @pzelasko in #1486
  • update edacc download link by @teowenshen in #1495
  • allow reading kaldi text file with utterances having empty references by @KarelVesely84 in #1496
  • Support accessing MixedCut.custom by @pzelasko in #1499
  • Support not slicing custom recordings by @pzelasko in #1494
  • Support left padding in supervision_intervals and supervision_masks by @yfyeung in #1492
  • fix cannot join current thread when drop_last=True by @hoangtran9122 in #1487
  • Allow up to half a second of duration mismatch between audio and manifest by @pzelasko in #1513
  • Allow random reads of files from inside of tar files by @pzelasko in #1514

New Contributors

  • @hoangtran9122 made their first contribution in #1487

Full Changelog: v1.30.3...1.31.0

v1.30.3 - Nirvana patch 3

15 May 14:31
8af32e4

Choose a tag to compare

What's Changed

  • [fix] avoid import librosa repeatly by @yuekaizhang in #1475
  • Support setting initial shard offset for writing by @pzelasko in #1476
  • Testing: CUDA compatibility, remove unused tests, add torchaudio resampling by @pzelasko in #1480

Full Changelog: v1.30.2...v1.30.3

v1.30.2 - Nirvana patch 2

28 Apr 15:51
8c56a3e

Choose a tag to compare

What's Changed

  • Handle edge case in .repeat() when the input is empty by @pzelasko in #1474

Thanks for debugging the issue @monica-sekoyan

Full Changelog: v1.30.1...v1.30.2

v1.30.1 - Nirvana patch 1

21 Apr 19:32

Choose a tag to compare

Patch release with bug fixes to AIStore and multi-storage-client logic amongst others.

What's Changed

  • Fix the returned value for multi-job to_shar export by @pzelasko in #1470
  • fix: add timeout to aistore client initialization by @gaikwadabhishek in #1465
  • Restore kaldifeat in CI by @pzelasko in #1471
  • fix: add LHOTSE_MSC_BACKEND_FORCED flag to only enfore MSCIOBackend for non-MSC URLs by @jayya2 in #1472

New Contributors

Full Changelog: v1.30.0...v1.30.1

v1.30.0 - Nirvana

19 Mar 14:37

Choose a tag to compare

New features

  • add multi-storage-client backend for file open by @jayya2 in #1455

Learn more about multi-storage-client here.

Bug fixes and other enhancements

New Contributors

Full Changelog: v1.29.0...v1.30.0

v1.29.0 - Potion of Everlasting Vigor

13 Dec 15:02
a7f3b90

Choose a tag to compare

What's Changed

Recipes

  • Recipe for the Chinese Dysarthric Speech Database by @JinZr in #1423
  • Optimized ReazonSpeech download speed using hf datasets features by @yuta0306 in #1434

New features

  • Option to save audio in the original format when exporting to shar by @anteju in #1422
  • CutSet.from_huggingface_dataset() for importing HF datasets by @pzelasko in #1433
  • Extend AIStore serialization backend to writing by @pzelasko in #1435

Other improvements

New Contributors

Full Changelog: v1.28.0...v1.29.0