background preloader

Kumo - Java Word Cloud

Kumo - Java Word Cloud
Kumo On GitHub: here The goal of Kumo is to create a powerful and user friendly Word Cloud library in Java. Kumo can directly generate an image file, or return a BufferedImage. Please feel free to jump in and help improve Kumo! Current Features Draw Rectangle, Circle or Image Overlay word clouds. Download from Maven Central Example to generate a Word Cloud on top of an image. Example to generate a circular Word Cloud. Example to generate a rectangle Word Cloud Example of tokenizing chinese text into a circle Create a polarity word cloud to contrast two datasets Create a Layered Word Cloud from two images/two word sets

mirador mirador Mirador is a tool for visual exploration of complex datasets. It enables users to discover correlation patterns and derive new hypotheses from the data. Download 1.3 (8 December 2014) Windows Mac OS X Instructions Download the file appropriate for your operating system. About Mirador is an open source project released under the GNU Public License v2. Further reading Ebola prognosis prediction—Computational methods for patient prognosis based on available clinical data—June 9th, 2015 Ebola data release—De-identified clinical data from Ebola patients treated at the Kenema Government Hospital in Sierra Leone between May and June of 2014—February 26th, 2015 Awards from the Department of Health and Human Services—Mirador received the third place, innovation and scientific excellence awards in the HHS VizRisk challenge—January 5th, 2015 Winning entries in the Mirador Data Competition—Read about the winning correlations submitted by Mirador users—December 1st, 2014 Citation

Word Cloud Description AKA Tag Clouds are a visualisation method that typically displays how frequently words appear in a given sample of text by making it proportional to the size of a word. Each word, sized on its frequency is then, typically arrange in a cluster or cloud. Word Clouds can also be used to display words that have metadata assigned to them. Colour used on Word Clouds is usually meaningless and is primarily aesthetic, but could be used to categorise words or display another data variable. Typically, Word Clouds are used on websites or blogs to depict keyword or tag usage. Although being simple and are easy to understand, Word Clouds have some major flaws: Long words are emphasised over short words Words whose letters contain many ascenders and descenders may receive more attention They're not great for analytical accuracy, so used more for aesthetic reasons instead Functions Analysing Text Comparisons Distribution / Frequency Proportions Anatomy Variations

Alpine Data Science Periodic Table One of the most clever giveaways at the recent Strata Conference in Santa Clara was a Periodic Table of Data Science from Alpine. At the recent Strata Conference (Feb 11-13, 2014 in Santa Clara) there were many creative give-aways companies used to attract prospects to their booth. One of the most clever was a Periodic Table of Data Science from Alpine. It divided data science operators into 7 categories: Load: Hc - copy to Hadoop, Ds - Dataset ... Explore: Bc - Bar Chart, Bp - Box Plot ... Transform: Ag - Aggregate,Co - Collapse ... Sample: Rs - Random Sampling, Ss - Stratified Sampling ... Model: Ar - Association Rules, Sr - SVM Regression ... Predict: Ad- Adaboost Predictor, Np - Neural Network Predictor ... Tools: Cm - Confusion Matrix, Gf - Goodness of Fit ... An interactive version is available at and it is intended to be used in conjunction with Alpine Chorus, which you can get for free here.

Hyperlink The Overview Project

Related: