datasets
2024
93,860
119,924,722,855
Rank↕ | Subreddit↕ | Occurrences in Subreddit | Total Words in Subreddit | Rate in Subreddit (per 1M words) | Ratio (Sub Rate / Reddit Rate) |
|---|---|---|---|---|---|
| #1 | r/datasets | 3,069 | 369,718 | 8300.9 | 10606.07 |
| #2 | r/spiceai | 112 | 40,709 | 2751.2 | 3515.25 |
| #3 | r/Open_Diffusion | 111 | 52,680 | 2107.1 | 2692.19 |
| #4 | r/xbeat_ml | 103 | 63,641 | 1618.5 | 2067.89 |
| #5 | r/kaggle | 56 | 36,235 | 1545.5 | 1974.64 |
| #6 | r/OpenSourceeAI | 119 | 94,482 | 1259.5 | 1609.26 |
| #7 | r/AcademicMarvelsHub | 103 | 86,271 | 1193.9 | 1525.46 |
| #8 | r/StatisticsPorn | 400 | 335,482 | 1192.3 | 1523.42 |
| #9 | r/unsloth | 39 | 36,721 | 1062.1 | 1357.00 |
| #10 | r/machinelearningnews | 281 | 272,471 | 1031.3 | 1317.69 |
| #11 | r/u_ibm | 167 | 162,666 | 1026.6 | 1311.74 |
| #12 | r/datascienceproject | 46 | 47,728 | 963.8 | 1231.44 |
| #13 | r/OpenAIevals | 202 | 221,037 | 913.9 | 1167.66 |
| #14 | r/remotesensing | 104 | 120,415 | 863.7 | 1103.52 |
| #15 | r/huggingface | 104 | 126,020 | 825.3 | 1054.44 |
| #16 | r/data | 147 | 204,189 | 719.9 | 919.84 |
| #17 | r/Ultralytics | 43 | 63,271 | 679.6 | 868.34 |
| #18 | r/lifecycleassessment | 49 | 84,087 | 582.7 | 744.55 |
| #19 | r/u_XquantumIn | 175 | 302,458 | 578.6 | 739.27 |
| #20 | r/datavisualization | 58 | 100,403 | 577.7 | 738.09 |
| #21 | r/zfs | 691 | 1,245,169 | 554.9 | 709.05 |
| #22 | r/CodeHero | 92 | 179,603 | 512.2 | 654.49 |
| #23 | r/sportsanalytics | 51 | 101,382 | 503.0 | 642.74 |
| #24 | r/computervision | 920 | 1,837,355 | 500.7 | 639.77 |
| #25 | r/MLQuestions | 454 | 970,215 | 467.9 | 597.88 |
| #26 | r/dataanalytics | 83 | 189,543 | 437.9 | 559.50 |
| #27 | r/IT4Research | 65 | 155,041 | 419.2 | 535.67 |
| #28 | r/LargeLanguageModels | 31 | 74,021 | 418.8 | 535.10 |
| #29 | r/algoprojects | 33 | 82,789 | 398.6 | 509.30 |
| #30 | r/LanguageTechnology | 171 | 433,374 | 394.6 | 504.15 |
| #31 | r/tensorflow | 33 | 84,864 | 388.9 | 496.84 |
| #32 | r/deeplearning | 561 | 1,455,712 | 385.4 | 492.40 |
| #33 | r/bigdata | 35 | 90,856 | 385.2 | 492.20 |
| #34 | r/BanSubvertingBois | 97 | 263,520 | 368.1 | 470.31 |
| #35 | r/CompSocial | 33 | 90,454 | 364.8 | 466.14 |
| #36 | r/mlscaling | 104 | 296,895 | 350.3 | 447.57 |
| #37 | r/truenas | 1,189 | 3,457,268 | 343.9 | 439.42 |
| #38 | r/dataanalysiscareers | 126 | 369,908 | 340.6 | 435.22 |
| #39 | r/neuralnetworks | 35 | 103,592 | 337.9 | 431.69 |
| #40 | r/bigquery | 74 | 219,865 | 336.6 | 430.03 |
| #41 | r/MachineLearning | 2,731 | 8,126,293 | 336.1 | 429.40 |
| #42 | r/PygmalionAI | 40 | 126,035 | 317.4 | 405.51 |
| #43 | r/webscraping | 392 | 1,239,623 | 316.2 | 404.04 |
| #44 | r/learnmachinelearning | 1,686 | 5,351,975 | 315.0 | 402.51 |
| #45 | r/bioinformatics | 866 | 2,766,493 | 313.0 | 399.96 |
| #46 | r/ChatGPTautomation | 117 | 379,292 | 308.5 | 394.13 |
| #47 | r/MistralAI | 41 | 134,954 | 303.8 | 388.17 |
| #48 | r/QuantumTrove | 33 | 109,572 | 301.2 | 384.81 |
| #49 | r/LiDAR | 31 | 106,720 | 290.5 | 371.15 |
| #50 | r/WGU_MSDA | 106 | 365,832 | 289.8 | 370.21 |