background preloader

Big data and data visualization

Facebook Twitter

This site publishes high-touch, time-intensive data visualizations (and has a business that sustains it) Over 7,000 artists played in the New York City area in 2013.

This site publishes high-touch, time-intensive data visualizations (and has a business that sustains it)

Only 21 of those later made it, really made it, headlining at a venue with an over 3,000-person capacity — among them, bigger names like Chance the Rapper, X Ambassadors, Sam Smith, and Sylvan Esso. I learned this sort of random but fascinating tidbit from a data visualization titled “The Unlikely Odds of Making it Big,” from the site The Pudding. The Pudding is the home to high-touch, painstakingly crafted data visualizations — what the site calls “visual essays” — that are distinct in their obsessive complexity over points of cultural curiosity.

Most pieces stand wholly apart from the U.S. news cycle; no anxiety-inducing interactives around budget, taxes, health care. Want to see everywhere jazz legend Miles Davis is mentioned across Wikipedia, and how he’s connected to other people, recordings, and places? “We’re all over the map. U.S. Bureau of Labor Statistics. Open Data Institute. NACION Data- Blogs Agroindustria fue el segundo ministerio que adhirió al Decreto 117/2016 que creó el Plan de Apertura de Datos con el objetivo de garantizar el derecho de acceso a la información pública.

NACION Data- Blogs

A la fecha, ya son 3 los ministerios adheridos: Energía, Agroindustria y Justicia. En la plataforma se puede encontrar información relevante sobre producción, comercio, inversión, precios y padrones de todas las actividades agroindustriales del país. Todos los datasets son producidos por la Secretaría de Mercados Agroindustriales a través de la Subsecretaría de Información y Estadística Pública, y se descargan en formato .csv.

Websays - What the web says. Research & Statistics - UIS Statistics. Index Map. Big Garden Birdwatch. Our World In Data. Data Publishing, Dissemination & Analytics. Canadian socioeconomic database from Statistics Canada. Website Evaluation 2017 Français Thank you for visiting Statistics Canada’s website.

Canadian socioeconomic database from Statistics Canada

You have been selected to participate in a brief evaluation to help us improve the website. The evaluation is designed to measure your web site experience, please complete the questionnaire at the end of your visit. Privacy Protection Statistics Canada is conducting this voluntary evaluation and will ensure that individual responses remain anonymous and protected pursuant to the Privacy Act. Use of cookies We are making temporary use of cookies during the evaluation period from January 9 to January 26, 2017 to ensure that you do not receive this invitation more than once. Statistics Canada: Canada's national statistical agency. Website Evaluation 2017 Please take a few minutes at the end of your visit today to anonymously tell us about your experience with the website.

Statistics Canada: Canada's national statistical agency

Choosing “Yes, after my visit” will open a new window that you can return to once you complete your visit to Gapminder: Unveiling the beauty of statistics for a fact based world view. The Data Visualisation Catalogue. Weather Forecast & Reports - Long Range & Local. - Live flight tracker!

Introducing Infogram's 'Data Visualization Workshop' Video Series. 5 Awesome Free Data Analysis Tools: Extract, Clean, and Share Your Data. 05.01.2016 by Marisa Krystian.

5 Awesome Free Data Analysis Tools: Extract, Clean, and Share Your Data

Inicio. Personal Finance News, Investing Advice, Business Forecasts. OECD Data. Open Data Institute. School of Data - Evidence is Power. Data Points. The Unofficial Google Data Science Blog. The Go Programming Language. Tres bases de datos gratuitas que cualquiera puede usar (Primera parte) Data governance tools. Smart Data Intelligence. Datos Abiertos. Datos Abiertos Colombia. Academic Torrents. UNdata. The World Bank. Untitled. Welcome to DataCite. Registry of Research Data Repositories. Euromonitor International. Ushahidi. Gallup Topic. Questionnaire design. Perhaps the most important part of the survey process is the creation of questions that accurately measure the opinions, experiences and behaviors of the public.

Questionnaire design

Accurate random sampling and high response rates will be wasted if the information gathered is built on a shaky foundation of ambiguous or biased questions. Creating good measures involves both writing good questions and organizing them to form the questionnaire. Questionnaire design is a multistage process that requires attention to many details at once. Designing the questionnaire is complicated because surveys can ask about topics in varying degrees of detail, questions can be asked in different ways, and questions asked earlier in a survey may influence how people respond to later questions.

Researchers also are often interested in measuring change over time and therefore must be attentive to how opinions or behaviors have been measured in prior surveys. O'Reilly Media - Technology Books, Tech Conferences, IT Courses, News. Institute of Education Sciences (IES) Home Page, a part of the U.S. Department of Education. NASDAQ Stock Market - Stock Quotes - Stock Exchange News. Making sense of data to improve education. Weather and climate change.

Forbes Welcome. Premise Data. General How many countries do you operate?

Premise Data

We’re currently in more than 30 countries across six continents. Centre for Research in Social Policy. Datos del Conflicto Armado en Colombia. En esta página CERAC pone a su disposición la Base de datos sobre Conflicto Armado Colombiano, y el glosario con la definición de cada una de las variables que la componen.

Datos del Conflicto Armado en Colombia

Contratación Mes de Junio 2016. UNITED STATES QuickFacts from the US Census Bureau. Interview with Rick Smolan on ‘The Human Face of Big Data’ Manu: Rick, can you tell us a bit about yourself?

Interview with Rick Smolan on ‘The Human Face of Big Data’

I saw in your TED talk that you used to be a photo journalist, so how did you get started on this journey? Rick Smolan: Yes, I was always very curious as a person so it’s interesting that I’d end up in a job where I get paid to be curious. As you saw in the TED talk, I went from being a journalist where I work for other people who set the agenda, to the fortunate position of being able to steer my own ship. And now, when I get curious about something I’m able to invite my heroes, my peers and some young journalists along. The journey to collaborate together is like crowd sourced journalism but the crowd is actually the journalists. Redshift Performance & Cost. At Airbnb, we look into all possible ways to improve our product and user experience.

Redshift Performance & Cost

Often times this involves lots of analytics behind the scene. Our data pipeline thus far has consisted of Hadoop, MySQL, R and Stata. We’ve used a wide variety of libraries for interfacing with our Hadoop cluster such as Hive, Pig, Cascading and Cascalog. However, we found that analysts aren’t as productive as they can be by using Hadoop, and standalone MySQL was no longer an option given the size of our dataset. SQL. I/ˈɛs kjuː ˈɛl/,[4] or i/ˈsiːkwəl/;[5] Structured Query Language[6][7][8][9]) is a special-purpose programming language designed for managing data held in a relational database management system (RDBMS), or for stream processing in a relational data stream management system (RDSMS). Originally based upon relational algebra and tuple relational calculus, SQL consists of a data definition language, data manipulation language, and Data Control Language.

The scope of SQL includes data insert, query, update and delete, schema creation and modification, and data access control. Although SQL is often described as, and to a great extent is, a declarative language (4GL), it also includes procedural elements. SQL was one of the first commercial languages for Edgar F. MapReduce Tutorial. This section provides a reasonable amount of detail on every user-facing aspect of the MapReduce framework. This should help users implement, configure and tune their jobs in a fine-grained manner. However, please note that the javadoc for each class/interface remains the most comprehensive documentation available; this is only meant to be a tutorial. HDFS Architecture Guide. Introduction The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware.

It has many similarities with existing distributed file systems. However, the differences from other distributed file systems are significant. HDFS is highly fault-tolerant and is designed to be deployed on low-cost hardware. Welcome to Apache™ Hadoop®! Welcome to Apache Pig! Airports of the Future. Big Data and Analytics in the Enterprise.

Learn Big Data Analytics: 51 Expert Tips. Research papers that changed the world of Big Data. If you are looking for some of the most influential research papers that revolutionised the way how we gather, aggregate, analyze and store increasing volumes of data in a short span of 10 years, you are in the right place! These papers were shortlisted, based on recommendations by big data enthusiasts and experts around the globe from various social media channels.

In case we’ve missed out any important paper, please let us know. Ten Reasons Why Data Scientist is The Top Job of the 21st Century. Photo: NASA I joined Amadeus’s team of data scientists coming from an Astrophysics background. Although I enjoyed my work, at some point I became interested in ways I could apply the skills I acquired working in Astrophysics to areas outside of academic research. I wanted to directly apply what I had always done (which included a lot of analysis of large amounts of data, programming, modeling, interpretation of results etc). It was at that time I became interested in Big Data and data science, which is meant to be a very sexy job. Mathematical and Statistical Frontiers. What is Big Data?