From b51d8deaf0d0e624c6a00d45fa117298e5824232 Mon Sep 17 00:00:00 2001 From: alexanderrichard Date: Sun, 9 Jun 2024 15:33:22 -0400 Subject: [PATCH] Update README.md --- README.md | 43 ++++++++++++++++++++++++++++++++++++++++++- 1 file changed, 42 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 81ab723..686beb8 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,45 @@ -# ears_dataset +# EARS Dataset + +We release the **E**xpressive **A**nechoic **R**ecordings of **S**peech (EARS) dataset. + +If you use the dataset or any derivative of it, please cite our [Paper](https://arxiv.org/) + +``` +@inproceedings{richter2024ears, + title={EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation}, + author={Richter, Julius and Wu, Yi-Chiao and Krenn, Steven and Welker, Simon and Lay, Bunlong and Watanabe, Shinjii and Richard, Alexander and Gerkmann, Timo}, + booktitle={Interspeech}, + year={2024} +} +``` + +## Highlights +* **100 h** of speech data from **107 speakers** +* high-quality recordings at **48 kHz** in an anechoic chamber +* **high speaker diversity** with speakers from different ethnicities and age range from 18 to 75 years +* **full dynamic range** of human speech, ranging from whispering to yelling +* 18 minutes of **freeform monologues** per speaker +* sentence reading in **7 different reading styles** (regular, loud, whisper, high pitch, low pitch, fast, slow) +* emotional reading and freeform tasks covering **22 different emotions** for each speaker + +## Download EARS Dataset + +``` +for X in $(seq -w 001 107); do + curl -L https://github.com/facebookresearch/ears_dataset/releases/download/dataset/p${X}.zip -o p${X}.zip + unzip p${X}.zip + rm p${X}.zip +done +``` + +## Download Blind Testset with Noisy Speech + +``` +curl -L https://github.com/facebookresearch/ears_dataset/releases/download/blind_testset/blind_testset.zip -o blind_testset.zip +mkdir blind_testset +unzip blind_testset.zip -d blind_testset +rm blind_testset.zip +``` # License The code and dataset are released under [CC-NC 4.0 International license](https://github.com/facebookresearch/ears_dataset/blob/main/LICENSE).