Skip to content

karamouche/noisekit

Model Serving & MLOps

Generate degraded speech datasets for noise‑robust ASR benchmarking

Python Latest v0.1.3 · 13h ago Security brief →

Features

  • Create synthetic noisy audio from clean HuggingFace speech datasets
  • Apply six atomic degradation presets (codec, ambient noise, clipping, reverb, etc.)
  • Compose presets into multi‑condition scenarios
  • Score each output with PESQ, SNR, and NISQA metrics
  • Produce a JSONL manifest ready for HuggingFace audiofolder loading

Recent releases

View all 3 releases →
No immediate action
v0.1.3 New feature

MuLawCompanding transform

No immediate action
v0.1.2 New feature

Code of Conduct + Security Policy

No immediate action
v0.1.1 Breaking risk

metadata.jsonl + audio presets

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

About

Stars
10
Forks
0
Language
Python

Beta — feedback welcome: [email protected]