Skip to content

Commit 97ff626

Browse files
authored
minor docs changes (#7452)
1 parent e8ee24a commit 97ff626

File tree

2 files changed

+4
-4
lines changed

2 files changed

+4
-4
lines changed

docs/source/use_with_polars.mdx

+1-1
Original file line numberDiff line numberDiff line change
@@ -105,7 +105,7 @@ This also works for [`IterableDataset.map`] and [`IterableDataset.filter`].
105105

106106
Many functions are available in Polars and for any data type: string, floats, integers, etc. You can find the full list [here](https://docs.pola.rs/api/python/stable/reference/expressions/functions.html). Those functions are written in Rust and run on batches of data which enables fast data processing.
107107

108-
Here is an example that shows a 2.5x speed boost using Polars instead of a regular python function to extract solutions from a LLM reasoning dataset:
108+
Here is an example that shows a 5x speed boost using Polars instead of a regular python function to extract solutions from a LLM reasoning dataset:
109109

110110
```python
111111
from datasets import load_dataset

docs/source/video_dataset.mdx

+3-3
Original file line numberDiff line numberDiff line change
@@ -99,9 +99,9 @@ It's possible to point to more than one video in each row in your dataset, for e
9999
You can also define lists of videos. In that case you need to name the field `file_names` or `*_file_names`. Here is an example:
100100

101101
```jsonl
102-
{"frames_file_names": ["0001_t0.mp4", "0001_t1.mp4"], label: "moving_up"}
103-
{"frames_file_names": ["0002_t0.mp4", "0002_t1.mp4"], label: "moving_down"}
104-
{"frames_file_names": ["0003_t0.mp4", "0003_t1.mp4"], label: "moving_right"}
102+
{"videos_file_names": ["0001_left.mp4", "0001_right.mp4"], "label": "moving_up"}
103+
{"videos_file_names": ["0002_left.mp4", "0002_right.mp4"], "label": "moving_down"}
104+
{"videos_file_names": ["0003_left.mp4", "0003_right.mp4"], "label": "moving_right"}
105105
```
106106

107107
### Video captioning

0 commit comments

Comments
 (0)