Decoration
max planck institut
informatik
mpii logo Minerva of the Max Planck Society
 

PROSPERA: PRospering knOwledge with Scalability, PrEcision, and RecAll

PROSPERA is a Hadoop-based scalable knowledge-harvesting engine which combines pattern-based gathering of
relational fact candidates with weighted MaxSat-based consistency reasoning to identify the most likely
correct facts.

Publications


The code can be found here (works with YAGO 1.0 data formats).
The following data is for the experiments reported in N.Nakashole et. al WSDM2011.

Experiment 1: Scalability Experiment (sports relations)

Precision/Recall per iteration
- Sports precision/recall
Extracted Facts
- Iteration 1
- Iteration 2
- Iteration 3
- Iteration 4
- Iteration 5
- Iteration 6
- PROSPERA Variant: NoReasoner
- PROSPERA Variant: UnWeighted
Labeled Samples
- 1 - 2 - 3 - 4 - 5 - 6 noreasoner unweighted
Constraints
- Sports Constraints
Seeds
- Sports relation seeds

Experiment 2: Constraints experiment (academic relations)

Precision/Recall per iteration
- Academia precision/recall
Extracted Facts
- Iteration 1
- Iteration 2
- PROSPERA Variant: NoReasoner
Labeled Samples
- 1 - 2 noreasoner
Constraints
Academic constraints
Seeds
- From YAGO ontology