background preloader

Google NGram Experiments

Google NGram Experiments
With Google’s new tool Ngram Viewer, you can visualise the rise and fall of particular keywords across 5 million books and 500 years! See how big cocaine was in Victorian times. The spirit of inquiry over the ages. The spirit of inquiry over the ages II (NGram is case-sensitive). The Battle Of The Brains What happened around 1700??? Age-old debates (by Andy, James Rooney, Nick, Bidzubido, Jacqui,Gary,Stefan Lasiewski,Mark) Got any more?

Related:  digital humanities

How to Use the Ngram Viewer - Google Books Here's one of Google's more obscure tools. "Google Books Ngram Viewer" -- Say that ten times fast! What the heck is an Ngram anyway? An Ngram, also commonly called an N-gram is a statistical analysis of text or speech content to find n (a number) of some sort of item in the text. 50 Great Examples of Data Visualization Wrapping your brain around data online can be challenging, especially when dealing with huge volumes of information. And trying to find related content can also be difficult, depending on what data you’re looking for. But data visualizations can make all of that much easier, allowing you to see the concepts that you’re learning about in a more interesting, and often more useful manner. Google's Ngram Viewer Goes Wild - Ben Zimmer With the addition of wildcard search-term capabilities, Google's fabulous language-analysis tool gets even more powerful. It's been nearly three years since Google rolled out its Ngram Viewer, allowing armchair historians to plot the trajectories of words and phrases over time based on an enormous corpus of data extracted from the Google Books digitization project. Since then, there have been numerous studies seeking to glean some cultural significance from the graphs of falling and rising word usage. And the graphs themselves have inspired imitators: Recently, the engineering team behind Rap Genius introduced Ngram-style graphing of historical word frequency in rap lyrics, and, more bizarrely, New York Times wedding announcements. (You can even compare the hiphop and matrimonial datasets.) As the Ngram model extends its influence, Google continues to tinker, making improvements to the Ngram Viewer's already slick interface.

Web Trend Map 2007 Version 2.0 by Oliver Reichenstein We have done it before: the 200 most successful websites pinned down on the Tokyo Metro Map, ordered by category, proximity, success, popularity and perspective. Now we have done it again — and better. Back by popular demand: here is iA’s next Web Trend Map: Welcome to Utrecht Psalter A Psalter is a songbook from the Bible (Old Testament). Christians and Jews used it and still use it in their prayers and at their services. The Utrecht Psalter is world-famous for its spectacular illustrations. This masterpiece was produced around 830 in or near the French city of Reims and after many journeys arrived in Utrecht in 1716. Why is it a current topic now? The Utrecht Psalter has been nominated for UNESCO’s Memory of the World Register.

Ten Fatal Flaws in Data Analysis 1. Where’s the Beef? In a way, the worst flaw a data analysis can have is no analysis at all. Instead, you get data lists, sorts and queries, and maybe some simple descriptive statistics but nothing that addresses objectives, answers questions, or tells a story. If that’s all you want, that’s fine. But a data report is not a data analysis.

Wind Map An invisible, ancient source of energy surrounds us—energy that powered the first explorations of the world, and that may be a key to the future. This map shows you the delicate tracery of wind flowing over the US. The wind map is a personal art project, not associated with any company. We've done our best to make this as accurate as possible, but can't make any guarantees about the correctness of the data or our software.

Kit Guide These pages are a guide to making the basic equipment and clothing of an Anglo-Saxon warrior of the sixth century, together with a few pages on some more advanced projects. Clothing Anglo-Saxon clothes were generally made of wool although linen may also have been used. Colours should be those available from undyed wool (off-white, browns and greys) or from natural dyes. Dyes for strong colours would be expensive and would only be available to the rich. Stripes and checks (but not tartans) look good for trousers but it is best to stick to plain material for tunics. Data journalism pt5: Mashing data (comments wanted) This is a draft from a book chapter on data journalism (part 1 looks at finding data; part 2 at interrogating data; part 3 at visualisation, and 4 at visualisation tools). I’d really appreciate any additions or comments you can make – particularly around tips and tools. UPDATE: It has now been published in The Online Journalism Handbook.

A Day in the Life of The New York Times R&D LabsA snapshot of site traffic to on June 25, 2009. Have you ever wondered where the readers of The New York Times’s Web site come from, and what kind of devices they use to read our content? In a past life, not too long ago, when I worked in The Times’s research and development labs, we started a research visualization project to explore this very topic. I worked on these visualizations with Michael Young, Michael Kramer, and Noriaki Okada.

Digitised Manuscripts Content The manuscript contains the only known complete copy of the Catholicon Anglicum, a bilingual English-Latin dictionary compiled anonymously at some point during the fifteenth century.Only one other copy of the Catholicon Anglicum is known to survive: Add MS 15562, an imperfect copy missing many leaves, which dates to the mid-15th century.This copy of the Catholicon contains approximately 8,000 Middle English headwords. Each headword is written in red ink: the first letter is written as a littera notabilior, and is aligned against the left-hand edge of the ruled writing space. Lists of Latin equivalents then follow, written in brown ink.The headwords are arranged largely in alphabetical order, A-Ȝ. The compiler signified the different parts of speech to which these words belonged by writing guiding words in brown ink in the margin to the left of the headwords: for example, the indefinite or definite articles for nouns, and 'to' for verbs.

Data journalism pt4: visualising data – tools and publishing (comments wanted) This is a draft from a book chapter on data journalism (here are parts 1; two; and three, which looks the charts side of visualisation). I’d really appreciate any additions or comments you can make – particularly around tips and tools. UPDATE: It has now been published in The Online Journalism Handbook.