Text this: Advanced language technologies for digital libraries :