My fish database

For my diving project, I will need a small fish database.

  • I don’t want an exhaustive database (most common fishes will be OK).
  • I don’t want to replicate an existing database, if it exists, then I should pull data from it.
  • If my database is built collaboratively, I don’t want this process to be hard to do.
  • I’m a Wikipedia enthusiast, and I want people to contribute to Wikipedia.
  • My website is internationalized, however, I want a fishes to be shared between languages.

So I think a great system would be:

  1. Is the fish in my database ? If yes, use it.
  2. If no, look for its wikipedia page. If you find it, then create the fish in my database using this Wikipedia URL, and go to 1.
  3. If the wikipedia page doesn’t exist, create it, then go to 2.

Let’s analyse this system:

Pros:

  • enrich Wikipedia
  • do not duplicate the information
  • a lot of data already exists

Cons:

  • Wikipedia pages are text and do not provide an easily callable API for entity properties (but fish pages contain a Taxobox, which would facilitate the parsing)

So far, I have to investigate:

  • Wikipedia: how to get the page content ?
  • Freebase: is it not too much ?
  • dbpedia: API ? fishes ?
  • fishbase.org : license ? API ?

And if I use a third party website (such as Freebase, dbpedia), then a problem is that I would have to wait for their database to be updated.

Advertisements

One thought on “My fish database

  1. I like the idea !
    And if Wikipedia does not provide any API of some sort, it could be time to suggest it.
    My guess against such an API is that you could too easily trash the content if it were not correctly designed.

Leave a Reply

Please log in using one of these methods to post your comment:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s