main (2 files)
begemot.parquet |
1.71GB |
README.md |
1.12kB |
Type: Dataset
Bibtex:
Tags:
Bibtex:
@article{,
title= {Begemot.ai Dataset},
journal= {},
author= {nyuuzyou},
year= {},
url= {https://huggingface.co/datasets/nyuuzyou/begemot},
abstract= {
# Dataset Card for Begemot.ai
### Dataset Summary
This dataset has 2,728,999 educational project descriptions in Russian. They were generated using AI on the Begemot.ai website. The content includes project titles, descriptions, chapters and chapter content on various educational topics.
### Languages
The dataset is primarily in Russian (ru).
## Dataset Structure
### Data Fields
This dataset includes the following fields:
- `id`: Unique identifier for the project (integer)
- `url`: URL of the project page (string)
- `title`: Title of the educational project (string)
- `type`: Type of project (string)
- `description`: Detailed description of the project (string)
- `chapters`: List of chapter titles (list of strings)
- `chapter_content`: JSON string mapping chapter titles to their content
### Data Splits
All examples are in a single split.
},
keywords= {},
terms= {},
license= {},
superseded= {}
}
begemot.parquet