Improved Documentation Of Audio Classification #35368

Uvi-12 · 2024-12-20T14:54:58Z

This PR makes several improvements to the audio_classification.md documentation for better clarity and readability.

stevhliu

Thanks for your improvements!

stevhliu · 2024-12-20T15:42:31Z

docs/source/en/tasks/audio_classification.md

@@ -20,7 +20,7 @@ rendered properly in your Markdown viewer.

 <Youtube id="KWwzcmG98Ds"/>

-Audio classification - just like with text - assigns a class label output from the input data. The only difference is instead of text inputs, you have raw audio waveforms. Some practical applications of audio classification include identifying speaker intent, language classification, and even animal species by their sounds.
+Audio classification - just like with text - assigns a class label as output from the input data.. The only difference is instead of text inputs, you have raw audio waveforms. Some practical applications of audio classification include identifying speaker intent, language classification, and even animal species by their sounds.


Suggested change

Audio classification - just like with text - assigns a class label as output from the input data.. The only difference is instead of text inputs, you have raw audio waveforms. Some practical applications of audio classification include identifying speaker intent, language classification, and even animal species by their sounds.

Audio classification - just like with text - assigns a class label as output from the input data. The only difference is instead of text inputs, you have raw audio waveforms. Some practical applications of audio classification include identifying speaker intent, language classification, and even animal species by their sounds.

stevhliu · 2024-12-20T15:44:51Z

docs/source/en/tasks/audio_classification.md

@@ -57,7 +57,7 @@ Start by loading the MInDS-14 dataset from the 🤗 Datasets library:
 >>> minds = load_dataset("PolyAI/minds14", name="en-US", split="train")
 ```

-Split the dataset's `train` split into a smaller train and test set with the [`~datasets.Dataset.train_test_split`] method. This'll give you a chance to experiment and make sure everything works before spending more time on the full dataset.
+Split the `train` split of the dataset into smaller train and test sets using the [`~datasets.Dataset.train_test_split`] method. This will give you a chance to experiment and make sure everything works before spending more time on the full dataset.


I don't think it's necessary to change the first sentence, but good with This'll --> This will

stevhliu · 2024-12-20T15:46:51Z

docs/source/en/tasks/audio_classification.md

@@ -164,7 +164,7 @@ To apply the preprocessing function over the entire dataset, use 🤗 Datasets [

 ## Evaluate

-Including a metric during training is often helpful for evaluating your model's performance. You can quickly load an evaluation method with the 🤗 [Evaluate](https://huggingface.co/docs/evaluate/index) library. For this task, load the [accuracy](https://huggingface.co/spaces/evaluate-metric/accuracy) metric (see the 🤗 Evaluate [quick tour](https://huggingface.co/docs/evaluate/a_quick_tour) to learn more about how to load and compute a metric):
+Including a metric during training can be helpful for evaluating your model's performance. You can quickly load an evaluation method with the 🤗 [Evaluate](https://huggingface.co/docs/evaluate/index) library. For this task, load the [accuracy](https://huggingface.co/spaces/evaluate-metric/accuracy) metric (see the 🤗 Evaluate [quick tour](https://huggingface.co/docs/evaluate/a_quick_tour) to learn more about how to load and compute a metric):


I don't think this change is necessary either; can suggests it's potentially helpful, but in this case I think it is definitely helpful.

stevhliu · 2024-12-20T15:47:59Z

docs/source/en/tasks/audio_classification.md

+audio-classification
+To perform inference with your fine-tuned model, use a [pipeline]. Instantiate a `pipeline` for audio classification with your model, and pass your audio file to it:
+
 The simplest way to try out your fine-tuned model for inference is to use it in a [`pipeline`]. Instantiate a `pipeline` for audio classification with your model, and pass your audio file to it:
+main


These additions also don't seem necessary (duplicate maybe?)

Uvi-12 · 2024-12-20T16:44:12Z

I have updated the documentation as per your review, please let me know if any other adjustments are required.

stevhliu

Thanks, LGTM!

Uvi-12 added 2 commits December 20, 2024 20:21

Improved Documentation Of Audio Classification

7aff33a

Merge branch 'main' into audio-classification

bf1b871

stevhliu reviewed Dec 20, 2024

View reviewed changes

Uvi-12 and others added 4 commits December 20, 2024 21:55

Merge branch 'main' into audio-classification

1bf6a06

Updated documentation as per review

e4c3e00

Updated audio_classification.md

0d3a0d4

Update audio_classification.md

e18f932

Uvi-12 requested a review from stevhliu December 20, 2024 16:44

stevhliu approved these changes Dec 20, 2024

View reviewed changes

stevhliu merged commit 94fe0b9 into huggingface:main Dec 20, 2024
8 checks passed

Uvi-12 deleted the audio-classification branch December 31, 2024 17:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved Documentation Of Audio Classification #35368

Improved Documentation Of Audio Classification #35368

Uvi-12 commented Dec 20, 2024

stevhliu left a comment

stevhliu Dec 20, 2024

stevhliu Dec 20, 2024

stevhliu Dec 20, 2024

stevhliu Dec 20, 2024

Uvi-12 commented Dec 20, 2024

stevhliu left a comment

	Audio classification - just like with text - assigns a class label as output from the input data.. The only difference is instead of text inputs, you have raw audio waveforms. Some practical applications of audio classification include identifying speaker intent, language classification, and even animal species by their sounds.
	Audio classification - just like with text - assigns a class label as output from the input data. The only difference is instead of text inputs, you have raw audio waveforms. Some practical applications of audio classification include identifying speaker intent, language classification, and even animal species by their sounds.

Improved Documentation Of Audio Classification #35368

Improved Documentation Of Audio Classification #35368

Conversation

Uvi-12 commented Dec 20, 2024

stevhliu left a comment

Choose a reason for hiding this comment

stevhliu Dec 20, 2024

Choose a reason for hiding this comment

stevhliu Dec 20, 2024

Choose a reason for hiding this comment

stevhliu Dec 20, 2024

Choose a reason for hiding this comment

stevhliu Dec 20, 2024

Choose a reason for hiding this comment

Uvi-12 commented Dec 20, 2024

stevhliu left a comment

Choose a reason for hiding this comment