forked from dbpedia-spotlight/dbpedia-spotlight
-
Notifications
You must be signed in to change notification settings - Fork 0
Performance
sandroacoelho edited this page Jul 26, 2013
·
3 revisions
You can get DBpedia Spotlight to run faster if:
- you put your index in a SSD (Solid State Drive) or a RAM drive -- anything that makes disk reads to the context index very fast
- put the entire context index in memory (not yet released as of 0.6, but possible from our latest source code)
- warm up the index so that the caches are built with the most common words (you can do this by setting up several queries to the system with the kinds of text you will want to annotate later)
The CoOccurrenceBasedSelector will also speed up the whole annotation process by removing some unnecessary disambiguations at spotting time, but it needs you to download and point the software to some extra models (see configuration file). The files are available here: http://spotlight.dbpedia.org/download/release-0.5/spot_selector.tgz
Another idea is to focus your index. If you're only looking for sports things, you can build a much smaller index with only sports data that will perform much faster.
Project
- Introduction
- Glossary
- User's manual
- Web application
- Installation
- Internationalization
- Licenses
- Researcher
- How to cite
- Support and Feedback
- Troubleshooting
- Team
- Acknowledgements
Statistical backend
Lucene backend
- Introduction
- Downloads
- Architecture
- Internationalization
- Web service parameters / API
- Splitting occurrences into topics
Developers