QuickGraph#5 Learning a taxonomy from your tagged data

The Objective Say we have a dataset of multi-tagged items: books with multiple genres, articles with multiple topics, products with multiple categories… We want to organise logically these tags -the genres, the topics, the categories…- in a descriptive but also actionable way. A typical organisation will be hierarchical, like a taxonomy. But rather than building it […]

QuickGraph#4 Explore your browser history in Neo4j

The dataset For this example I am going to use my browser history data. Most browsers store this data in SQLite. This means relational data, easy to access from Neo4j using the apoc.load.jdbc  stored procedure. I’m a Chrome user, and in my Mac, Chrome stores the history db at ~/Library/Application Support/Google/Chrome/Default/History There are two main tables in the […]

QuickGraph#3 A step-by-step example of RDF to Property Graph transformation

The dataset For this example I am going to use a sample movie dataset from the Cayley project. It’s a set of half a million triples about actors, directors and movies that can be downloaded here. Here is what the dataset looks like: </en/meet_the_parents> <name> “Meet the Parents” . </en/meet_the_parents> <type> </film/film> . </en/meet_the_parents> </film/film/directed_by> </en/jay_roach> . […]

QuickGraph#2 How is Wikipedia’s knowledge organised

The dataset For this QuickGraph I’ll use data about Wikipedia Categories. You may have noticed at the bottom of every Wikipedia article a section listing the categories it’s classified under. Every Wikipedia article will have at least one category, and categories branch into subcategories forming overlapping trees. It is sometimes possible for a category (and the Wikipedia hierarchy is an example of […]