Menu
News
All News
Dungeons & Dragons
Level Up: Advanced 5th Edition
Pathfinder
Starfinder
Warhammer
2d20 System
Year Zero Engine
Industry News
Reviews
Dragon Reflections
White Dwarf Reflections
Columns
Weekly Digests
Weekly News Digest
Freebies, Sales & Bundles
RPG Print News
RPG Crowdfunding News
Game Content
ENterplanetary DimENsions
Mythological Figures
Opinion
Worlds of Design
Peregrine's Nest
RPG Evolution
Other Columns
From the Freelancing Frontline
Monster ENcyclopedia
WotC/TSR Alumni Look Back
4 Hours w/RSD (Ryan Dancey)
The Road to 3E (Jonathan Tweet)
Greenwood's Realms (Ed Greenwood)
Drawmij's TSR (Jim Ward)
Community
Forums & Topics
Forum List
Latest Posts
Forum list
*Dungeons & Dragons
Level Up: Advanced 5th Edition
D&D Older Editions
*TTRPGs General
*Pathfinder & Starfinder
EN Publishing
*Geek Talk & Media
Search forums
Chat/Discord
Resources
Wiki
Pages
Latest activity
Media
New media
New comments
Search media
Downloads
Latest reviews
Search resources
EN Publishing
Store
EN5ider
Adventures in ZEITGEIST
Awfully Cheerful Engine
What's OLD is NEW
Judge Dredd & The Worlds Of 2000AD
War of the Burning Sky
Level Up: Advanced 5E
Events & Releases
Upcoming Events
Private Events
Featured Events
Socials!
EN Publishing
Twitter
BlueSky
Facebook
Instagram
EN World
BlueSky
YouTube
Facebook
Twitter
Twitch
Podcast
Features
Top 5 RPGs Compiled Charts 2004-Present
Adventure Game Industry Market Research Summary (RPGs) V1.0
Ryan Dancey: Acquiring TSR
Q&A With Gary Gygax
D&D Rules FAQs
TSR, WotC, & Paizo: A Comparative History
D&D Pronunciation Guide
Million Dollar TTRPG Kickstarters
Tabletop RPG Podcast Hall of Fame
Eric Noah's Unofficial D&D 3rd Edition News
D&D in the Mainstream
D&D & RPG History
About Morrus
Log in
Register
What's new
Search
Search
Search titles only
By:
Forums & Topics
Forum List
Latest Posts
Forum list
*Dungeons & Dragons
Level Up: Advanced 5th Edition
D&D Older Editions
*TTRPGs General
*Pathfinder & Starfinder
EN Publishing
*Geek Talk & Media
Search forums
Chat/Discord
Menu
Log in
Register
Install the app
Install
Community
General Tabletop Discussion
*Dungeons & Dragons
Data from a million DnDBeyond character sheets?
JavaScript is disabled. For a better experience, please enable JavaScript in your browser before proceeding.
You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an
alternative browser
.
Reply to thread
Message
<blockquote data-quote="ichabod" data-source="post: 9067133" data-attributes="member: 1257"><p>So here's some info from fiddling with names:</p><p></p><p>The ten most common names are:</p><p></p><ol> <li data-xf-list-type="ol"><blank/> (4364 PCs)</li> <li data-xf-list-type="ol">Test (1206 PCs)</li> <li data-xf-list-type="ol">Bob (1003 PCs)</li> <li data-xf-list-type="ol">test (505 PCs)</li> <li data-xf-list-type="ol">Varis (469 PCs)</li> <li data-xf-list-type="ol">Rhogar (461 PCs)</li> <li data-xf-list-type="ol">DamienDM's Character (350 PCs)</li> <li data-xf-list-type="ol">Steve (322 PCs)</li> <li data-xf-list-type="ol">Ragnar (320 PCs)</li> <li data-xf-list-type="ol">Jack (299 PCs)</li> </ol><p>From my first look at the data set after downloading, I've been thinking it needs some cleaning. There are several PCs in the first few rows with either all 8's for abilities or all 0's, which I was thinking should be excluded from any analysis. I would also exclude blank and test names. But what about this Damien guy? And how many people name their test PCs Bob?</p><p></p><p>Being a fan of gnomes with lots of name, I next looked at the number of words in each name (actually the number of spaces, so these counts could be a bit off):</p><ol> <li data-xf-list-type="ol">538,739</li> <li data-xf-list-type="ol">565,092</li> <li data-xf-list-type="ol">70,796</li> <li data-xf-list-type="ol">18,805</li> <li data-xf-list-type="ol">6,177</li> <li data-xf-list-type="ol">2,332</li> <li data-xf-list-type="ol">1,102</li> <li data-xf-list-type="ol">492</li> <li data-xf-list-type="ol">284</li> </ol><p>When looking at the PCs with at least five words in their name, I was disappointed to find that gnomes only came in fourth (after humans, elves, and half-elves). So then I just decided to look at the longest names in the dataset by character count. The winner is Gragnok "Bob" Stone Crusher, Last Heir of the Thundering Holds, Keeper of the Tome of Rebirth, Herald of the Returning Steps. So clearly, not everyone named Bob is a test character. What interested me more was the second longest name: Zorkxire, Shield of the Land, Protector of the People, Servant to Skadrea, Folk Hero of the Realm, and Holder of the Shards. What is interesting is that it's in the data set three times. I really wish there was an anonymized player ID so I could see if Gragnok and Zorkxire were made by the same person. There's four more names like that, each with multiple entries, and then you get some names with tons of spaces after them.</p><p></p><p>In terms of data cleaning it might be seem reasonable to keep only one Zorkxire blah blah blah. But it doesn't seem to be a good idea to keep only one Rhogar. How detailed does a name have to be before you can call it a duplicate? And if you automate that, you could take out all of DamienDM's PCs, which may actually be unique.</p><p></p><p>Anyway, further data exploration is needed, and I would be hesitant to analyze this data raw.</p></blockquote><p></p>
[QUOTE="ichabod, post: 9067133, member: 1257"] So here's some info from fiddling with names: The ten most common names are: [LIST=1] [*]<blank/> (4364 PCs) [*]Test (1206 PCs) [*]Bob (1003 PCs) [*]test (505 PCs) [*]Varis (469 PCs) [*]Rhogar (461 PCs) [*]DamienDM's Character (350 PCs) [*]Steve (322 PCs) [*]Ragnar (320 PCs) [*]Jack (299 PCs) [/LIST] From my first look at the data set after downloading, I've been thinking it needs some cleaning. There are several PCs in the first few rows with either all 8's for abilities or all 0's, which I was thinking should be excluded from any analysis. I would also exclude blank and test names. But what about this Damien guy? And how many people name their test PCs Bob? Being a fan of gnomes with lots of name, I next looked at the number of words in each name (actually the number of spaces, so these counts could be a bit off): [LIST=1] [*]538,739 [*]565,092 [*]70,796 [*]18,805 [*]6,177 [*]2,332 [*]1,102 [*]492 [*]284 [/LIST] When looking at the PCs with at least five words in their name, I was disappointed to find that gnomes only came in fourth (after humans, elves, and half-elves). So then I just decided to look at the longest names in the dataset by character count. The winner is Gragnok "Bob" Stone Crusher, Last Heir of the Thundering Holds, Keeper of the Tome of Rebirth, Herald of the Returning Steps. So clearly, not everyone named Bob is a test character. What interested me more was the second longest name: Zorkxire, Shield of the Land, Protector of the People, Servant to Skadrea, Folk Hero of the Realm, and Holder of the Shards. What is interesting is that it's in the data set three times. I really wish there was an anonymized player ID so I could see if Gragnok and Zorkxire were made by the same person. There's four more names like that, each with multiple entries, and then you get some names with tons of spaces after them. In terms of data cleaning it might be seem reasonable to keep only one Zorkxire blah blah blah. But it doesn't seem to be a good idea to keep only one Rhogar. How detailed does a name have to be before you can call it a duplicate? And if you automate that, you could take out all of DamienDM's PCs, which may actually be unique. Anyway, further data exploration is needed, and I would be hesitant to analyze this data raw. [/QUOTE]
Insert quotes…
Verification
Post reply
Community
General Tabletop Discussion
*Dungeons & Dragons
Data from a million DnDBeyond character sheets?
Top