dataset
2024
169,618
119,924,722,855
Rank↕ | Subreddit↕ | Occurrences in Subreddit | Total Words in Subreddit | Rate in Subreddit (per 1M words) | Ratio (Sub Rate / Reddit Rate) |
|---|---|---|---|---|---|
| #1 | r/datasets | 2,873 | 369,718 | 7770.8 | 5494.17 |
| #2 | r/unsloth | 174 | 36,721 | 4738.4 | 3350.21 |
| #3 | r/kaggle | 169 | 36,235 | 4664.0 | 3297.58 |
| #4 | r/Open_Diffusion | 244 | 52,680 | 4631.7 | 3274.77 |
| #5 | r/tensorflow | 283 | 84,864 | 3334.7 | 2357.76 |
| #6 | r/spiceai | 115 | 40,709 | 2824.9 | 1997.30 |
| #7 | r/aprendaIA | 109 | 39,073 | 2789.7 | 1972.36 |
| #8 | r/SECourses | 100 | 40,610 | 2462.4 | 1741.02 |
| #9 | r/datascienceproject | 99 | 47,728 | 2074.3 | 1466.56 |
| #10 | r/DreamBooth | 156 | 79,149 | 1971.0 | 1393.53 |
| #11 | r/huggingface | 226 | 126,020 | 1793.4 | 1267.96 |
| #12 | r/OpenSourceeAI | 163 | 94,482 | 1725.2 | 1219.76 |
| #13 | r/pytorch | 329 | 192,724 | 1707.1 | 1206.97 |
| #14 | r/aesdr | 92 | 55,046 | 1671.3 | 1181.68 |
| #15 | r/Ultralytics | 103 | 63,271 | 1627.9 | 1150.98 |
| #16 | r/MLQuestions | 1,566 | 970,215 | 1614.1 | 1141.20 |
| #17 | r/computervision | 2,958 | 1,837,355 | 1609.9 | 1138.26 |
| #18 | r/deeplearning | 2,174 | 1,455,712 | 1493.4 | 1055.90 |
| #19 | r/machinelearningnews | 384 | 272,471 | 1409.3 | 996.43 |
| #20 | r/StatisticsPorn | 471 | 335,482 | 1404.0 | 992.63 |
| #21 | r/Statistics_Class_help | 69 | 57,835 | 1193.0 | 843.52 |
| #22 | r/bigquery | 262 | 219,865 | 1191.6 | 842.52 |
| #23 | r/zfs | 1,463 | 1,245,169 | 1174.9 | 830.72 |
| #24 | r/stata | 375 | 321,744 | 1165.5 | 824.06 |
| #25 | r/neuralnetworks | 115 | 103,592 | 1110.1 | 784.89 |
| #26 | r/LargeLanguageModels | 79 | 74,021 | 1067.3 | 754.59 |
| #27 | r/WGU_MSDA | 388 | 365,832 | 1060.6 | 749.87 |
| #28 | r/learnmachinelearning | 5,610 | 5,351,975 | 1048.2 | 741.11 |
| #29 | r/rprogramming | 216 | 206,938 | 1043.8 | 737.99 |
| #30 | r/LanguageTechnology | 440 | 433,374 | 1015.3 | 717.84 |
| #31 | r/learndatascience | 74 | 74,922 | 987.7 | 698.33 |
| #32 | r/MistralAI | 129 | 134,954 | 955.9 | 675.84 |
| #33 | r/truenas | 3,255 | 3,457,268 | 941.5 | 665.66 |
| #34 | r/data | 192 | 204,189 | 940.3 | 664.82 |
| #35 | r/askdatascience | 40 | 42,877 | 932.9 | 659.59 |
| #36 | r/apachespark | 161 | 179,886 | 895.0 | 632.80 |
| #37 | r/apache_airflow | 41 | 46,247 | 886.5 | 626.81 |
| #38 | r/spss | 238 | 284,174 | 837.5 | 592.15 |
| #39 | r/RStudio | 791 | 968,792 | 816.5 | 577.27 |
| #40 | r/GoogleColab | 57 | 72,236 | 789.1 | 557.90 |
| #41 | r/remotesensing | 92 | 120,415 | 764.0 | 540.19 |
| #42 | r/MachineLearning | 5,823 | 8,126,293 | 716.6 | 506.63 |
| #43 | r/u_XquantumIn | 207 | 302,458 | 684.4 | 483.88 |
| #44 | r/LLMDevs | 458 | 672,448 | 681.1 | 481.55 |
| #45 | r/machinetranslation | 38 | 59,450 | 639.2 | 451.93 |
| #46 | r/sdforall | 83 | 133,670 | 620.9 | 439.02 |
| #47 | r/TOPdesk | 31 | 50,188 | 617.7 | 436.72 |
| #48 | r/mlscaling | 170 | 296,895 | 572.6 | 404.84 |
| #49 | r/rstats | 433 | 756,831 | 572.1 | 404.51 |
| #50 | r/vectordatabase | 56 | 98,221 | 570.1 | 403.11 |