Skip to content

Instantly share code, notes, and snippets.

@kleem
kleem / README.md
Last active August 29, 2015 14:22 — forked from nitaku/README.md
Cassandra word cloud
@kleem
kleem / README.md
Last active August 29, 2015 14:02
OpeNER - Text annotation visualization
@kleem
kleem / README.md
Last active August 29, 2015 14:02
Ruby annotations

A different take on the previous example: linguistic annotations is represented by using Ruby annotations and their relative CSS properties. See this article by Richard Ishida from W3C for more information.

This implementation should be better than the previous one from a semantic web perspective, since ruby tags more or less describe the semantic of an annotation. It also has the advantage of having no CSS voodoo (with the exception of some -webkit- prefixed property). Unfortunately, browser support is still incomplete, so it may not work on your browser of choice (works on Chrome 31 for sure).

@kleem
kleem / README.md
Last active August 29, 2015 14:02
Linguistic annotations

An experiment on visualizing linguistic annotations of a (small) corpus.

The example uses a nonstandard, super-simple JSON format coded by hand (please forgive me for the errors I surely made from a linguistic standpoint).

This visualization focuses on three different aspects of the analysis: sentence splitting (a gray ■ introduces a new sentence), tokenization and lemmatization (each token has an underline and its lemma written under it) and part-of-speech tagging (the color of the underline and the lemma indicates whether the term is a noun, a verb, etc.).

The original text's spacing, punctuation and line breaking is preserved, as it can be seen by the last two lines.

Various CSS hacks with line heights, relative positioning and stuff are used to create this layout, so functionalities like text selection and similar are broken.