lichess and mongodb

I'm not a mongodb expert. This document gives no recommendation, but describes the choices I made for lichess. Please let me know what can be improved!

The good ol time

Back in 2009 lichess was using mysql, with tables for game, player and even piece models. It required quite a lot of JOIN and lacked schema flexibility. So I implemented a custom storage based on model binary serialization with file persistence... which of course was very hard to query. Finally, in 2010, I learnt about document databases and chose mongodb for its update performances and sql-like indexes.

Numbers

The schema is composed by 25 collections. At the time of writing (Oct 13 2012), the DB contains 10,2 million objects and occupies 12,8 GB on the filesystem. All indexes fit in 824 MB. More than 10,000 games are added every day. Each game is updated 60 times in average, once per chess move. There are also the forum posts, private messages and millions of chat messages.

Storing games

A game data is span in 4 collections:

game contains the frequently needed infos such as the clock, usernames, piece positions, game status, number of turns... Here's an example of game object.
room contains the private chat of the game players
watcher_room contains the public chat of the game spectators
pgn contains the game history in pgn format, allowing for replays, analysis and takeback.

So each game results in 4 objects in 4 distinct collections. Everything could fit in a single nested object, but I prefer separating the data based on use cases. I only need the game informations to display a board or apply a move. Having 4 collections keeps the very busy game collection small, so most operations are cheap.

Compression!

There are more than 10,000 games played every day, so I try to shrink their DB representation as much as possible to reduce data transfer and keep the whole thing in memory.

In mongodb key names are stored for each object, so I reduced them to one or two chars. I also use data custom encodings to save as many bytes as possible. For instance, here's how piece positions are stored: 1pCkJqJPKNKPJPJNKBJQkRtBtRMPGPPP.

In average, a game fits in 1,24 KB:

> db.game4.stats().avgObjSize + db.room.stats().avgObjSize + db.watcher_room.stats().avgObjSize + db.pgn.stats().avgObjSize
1241.775641951236

Indexes

Here are the indexes I need on the game collection for my common queries:

status (created, started, resigned, checkmate, ...) as an Integer
userIds as a pair of strings; the index is sparse to exclude anonymous games
winnerUserId, string sparse index
createdAt descendant
userIds + createdAt compound index: to show one player's games in chronological order
bookmark sparse int: number of player bookmarks used to show popular games

The _id is a random 8 chars string used in urls. room, watcher_room and pgn objects are linked to the game objects by using the same _id.

Storing users

User data is split in 4 collections:

user for the frequently accessed data. See an example of user object. It contains security data, user preferences and a good deal of denormalized game counts. Note that the _id field is the lowercased username. It allows readable references to users in other collection objects. It also allows doing html links to users without loading them from the database, as the _id is enough to make a link like http:https://lichess.org/@/thibault.
config stores user preferences for AI, friend and lobby games. Here's a config object from the database. The config _id is the user _id.
user_history remembers every game played by a user, and the corresponding ELO rating variations. The array is indexed by timestamp int and I use it to display ELO charts.
security associates usernames (= user ids) to browser cookies and IP addresses. It is used for authentication and spam prevention.

Like for game I could have it all in a big object, but I like keeping the user collection small.

The big picture

All collections and references:

Capped collections

I use them for the homepage public chat room (lobby_room). Only 2 MB worth of silly chatting is stored. Capped collections also rotate moderator actions logs.

Sharding and replication

Eeeer no, it's all on the same server. The same one that runs the application. Only the artificial intelligence runs on a separated server.

Talking about that, the way I do backups is awful. I just rsync the DB directory without locking anything. It works for now but I'm certain it's horribly wrong.

Driver and mapping

lichess.org is built in scala using the Play2 web framework. The scala driver is casbah which wraps the Java mongodb driver.

Most of the time I map the mongodb documents to scala objects using salat, a lightweight serialization library that does not use reflection.

Migrations

Sometimes a collection grows fat and I must split it and/or compress its data. Then I write mongodb JS scripts like this one or that one.

I never update a collection in place, because it always results in lost disk space: either the documents get smaller, and there is extra padding, or they get bigger and mongodb has to move them. Instead, I copy the collection to a new one while performing the modifications. Not only the new collection only uses the exact space it needs on the disk, but the old one is still available... you know, just in case.

Scala model

All models are immutable. In fact all lichess code is immutable. No increments, no setters, and no side effects (but haskell-style IO monads).

I find immutable models much easier to deal with regardless of the database backend used. Not only they allow trivial parallelism but they just feel "right" and joyful to use. Basically, when I fetch, say, a game, the DB document is mapped to an algebraic data type, "case class" in scala. Every change to this object results in a new object. Then I can just send the final object to the DB; there is some logic to compute the $set and $unset operations from the original to the final object.

Querying with casbah/salat

First I define some query objects:

// app/game/Query.scala
val finished = "s" $in List(
  Status.Mate.id, 
  Status.Resign.id, 
  Status.Outoftime.id, 
  Status.Timeout.id)
def user(u: User) = DBObject("uids" -> u.id)
def loss(u: User) = user(u) ++ finished ++ ("wid" $ne u.id)

That I apply to the collection wraper object:

// app/game/GameRepo.scala
val games = gameRepo find Query.loss(user)

Which yields a game list of type IO[List[DbGame]]

Search

Searching in mongodb just does not work, so the search engine is powered by elasticsearch. When a game is finished, its ID is stored in the index_queue mongodb collection, then a daemon periodically batch inserts them into the elasticsearch index.

Monitoring

I made a fancy realtime monitoring tool! Check it on http:https://en.lichess.org/monitor. It tracks the mongodb memory usage, number of open connections, queries per second and global lock; all using the serverStatus command (app/monitor/MongoStatus.scala)

There are also munin graphs visible on http:https://en.lichess.org/monitor/munin/balrog/balrog/index.html#mongodb

But they seem a bit messed up by the recent mongodb 2.2 upgrade.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mongodb.md

mongodb.md

lichess and mongodb

The good ol time

Numbers

Storing games

Compression!

Indexes

Storing users

The big picture

Capped collections

Sharding and replication

Driver and mapping

Migrations

Scala model

Querying with casbah/salat

Search

Monitoring

Files

mongodb.md

Latest commit

History

mongodb.md

File metadata and controls

lichess and mongodb

The good ol time

Numbers

Storing games

Compression!

Indexes

Storing users

The big picture

Capped collections

Sharding and replication

Driver and mapping

Migrations

Scala model

Querying with casbah/salat

Search

Monitoring