4

Since i needed higher quality data for retoor9b I invested some time in the statistics project. It became quite decent. It's not a silly script anymore.

I also had rank per user regarding popularity (upvotes / post avg). It got lost somehow. I only know that IHateForALiving has second place. Root 10th. Netikras 14th, Lensflare 15th). First place was by not regular user who made one post with six upvotes. That's all. Hihi.

Repository: https://retoor.molodetz.nl/retoor/...

Dataset for LLM: embeddings:https://retoor.molodetz.nl/retoor/...

Graph compilation with ALL users active last few weeks:
https://retoor.molodetz.nl/retoor/...

All generated data by this project: https://retoor.molodetz.nl/retoor/...

Build / latest export status: https://retoor.molodetz.nl/retoor/...

In the LLM dataset you'll see more interesting data for every user like:
Statistics: User(ranter) retoor made 505 contributions to devRant(developer community) what means retoor owns 1.0 percent of contributions on devRant(developer community). The avarage post length of retoor is 219 and total post length is 111037. retoor owns 0.0 percent of content on devRant(developer community).
retoor is 315 times mentioned on devRant(developer comminity).

Comments
  • 1
    lovely
  • 1
    @magicMirror has some good ideas like sentiment analysis and wordcloud. And i realize that the wordcloud is already done when I trained retoor9b. I can just ask it to create a word cloud. It's possible that the quality is bad of course. AI's filter data first before they're gonna do actions on it. So maybe the data is too filtered to make a decent realistic wordcloud on. Will see
Add Comment