Importing your own dataset - Example with Reddit comments

Published 5 years, 11 months ago

The different imports as here to give you a taste of what Linkage can do but more often than not you're gonna want to customize your dataset.

Linkage can take a .csv as input. It should be formatted as lines of source,target,text for each edge. To escape the text, you can quote it using ", best is to use a csv library instead of doing the csv quoting yourself.

Here we will use reddit comments as our dataset. The nodes being the users and the edges the comments.

Here's the python script we use to make the .csv:

Then we get our clustered network:

Feel free to play with the script and let us know what you find out !