CRIMENET maps criminal organizations and their relationships into a single, searchable graph. It covers cartels, mafias, gangs, motorcycle clubs, triads, clans, factions, militias, and terrorist groups, drawn from nearly 1,500 Wikipedia articles in English, Italian, Portuguese, and Spanish.
An LLM pipeline reads each article and extracts every organization mentioned and every alliance, rivalry, or other tie between them. Every edge carries the verbatim Wikipedia sentence that justifies it, a paraphrased description, an extracted time period, and a versioned source URL. Any claim can be audited back to the specific revision of the specific article it came from.
The graph reflects what was found in the articles processed — about 1,500 out of millions on Wikipedia. The LLM pipeline is not perfect and occasionally misses a connection or misclassifies one. More importantly, Wikipedia itself does not document every real-world tie. An absent link does not mean a relationship does not exist; it means it was not recorded in the sources that were processed.
The code, the dataset, the pipeline, and a detailed technical report are all open source. The GitHub repository has the full pipeline, the audit tools, and the build system that turns the data into this site.