background preloader

Big data and data visualization

Facebook Twitter

Real-Time Big Data Integration Tools for MDM & ETL. Cognitive Class - Free Data Science and Cognitive Computing Courses. SIPRI. WVS Database. Open Data Institute. The Functional Art: An Introduction to Information Graphics and Visualization.


Economic Commission for Latin America and the Caribbean. Economic Commission for Latin America and the Caribbean. Global Health Observatory (GHO) data. Global Strategy for Women's, Children's and Adolescents' Health (2016-2030): Data portal The Every Woman Every Child Global Strategy indicator and monitoring framework includes 60 indicators from health and other sectors. 34 indicators are from the Sustainable Development Goals (SDGs) and 26 from related global monitoring initiatives.

Global Health Observatory (GHO) data

From these, 16 key indicators are highlighted to provide a snapshot of progress. The Global Strategy portal provides open access to the latest available data and estimates for the 60 indicators across 194 countries. This involves collaboration across WHO departments, H6 agencies (UNAIDS, UNFPA, UNICEF, UN Women, WHO and the World Bank), other UN organizations - including the UN Statistics Division and UNESCO, and global monitoring partnerships, including the Countdown to 2030 and academic institutions. – Access the portal. Our World in Data.

This site publishes high-touch, time-intensive data visualizations (and has a business that sustains it) Over 7,000 artists played in the New York City area in 2013.

This site publishes high-touch, time-intensive data visualizations (and has a business that sustains it)

Only 21 of those later made it, really made it, headlining at a venue with an over 3,000-person capacity — among them, bigger names like Chance the Rapper, X Ambassadors, Sam Smith, and Sylvan Esso. I learned this sort of random but fascinating tidbit from a data visualization titled “The Unlikely Odds of Making it Big,” from the site The Pudding.

The Pudding is the home to high-touch, painstakingly crafted data visualizations — what the site calls “visual essays” — that are distinct in their obsessive complexity over points of cultural curiosity. Most pieces stand wholly apart from the U.S. news cycle; no anxiety-inducing interactives around budget, taxes, health care. Want to see everywhere jazz legend Miles Davis is mentioned across Wikipedia, and how he’s connected to other people, recordings, and places? “We’re all over the map. U.S. Bureau of Labor Statistics. Open Data Institute. NACION Data- Blogs Agroindustria fue el segundo ministerio que adhirió al Decreto 117/2016 que creó el Plan de Apertura de Datos con el objetivo de garantizar el derecho de acceso a la información pública.

NACION Data- Blogs

A la fecha, ya son 3 los ministerios adheridos: Energía, Agroindustria y Justicia. En la plataforma se puede encontrar información relevante sobre producción, comercio, inversión, precios y padrones de todas las actividades agroindustriales del país. Todos los datasets son producidos por la Secretaría de Mercados Agroindustriales a través de la Subsecretaría de Información y Estadística Pública, y se descargan en formato .csv. En algunos casos se puede acceder a información histórica, de períodos de tiempo que inician en 1969 y están actualizados a hoy. Al elegir las variables a consultar, la plataforma ofrece dos opciones: descargar los datos o simplemente visualizarlos y descargar el gráfico como .jpg. Websays - What the web says. Research & Statistics -

UIS Statistics. Index Map. Big Garden Birdwatch. Our World In Data. Data Publishing, Dissemination & Analytics. Canadian socioeconomic database from Statistics Canada. Website Evaluation 2017 Français Thank you for visiting Statistics Canada’s website.

Canadian socioeconomic database from Statistics Canada

Statistics Canada: Canada's national statistical agency. Website Evaluation 2017 Please take a few minutes at the end of your visit today to anonymously tell us about your experience with the website.

Statistics Canada: Canada's national statistical agency

Choosing “Yes, after my visit” will open a new window that you can return to once you complete your visit to Gapminder: Unveiling the beauty of statistics for a fact based world view. The Data Visualisation Catalogue. Weather Forecast & Reports - Long Range & Local. - Live flight tracker! Introducing Infogram's 'Data Visualization Workshop' Video Series. 5 Awesome Free Data Analysis Tools: Extract, Clean, and Share Your Data. 05.01.2016 by Marisa Krystian Data analysis is the process of cleaning, inspecting, transforming, and modeling data in order to uncover useful information.

5 Awesome Free Data Analysis Tools: Extract, Clean, and Share Your Data

Inicio. Personal Finance News, Investing Advice, Business Forecasts. OECD Data. Open Data Institute. School of Data - Evidence is Power. Data Points. The Unofficial Google Data Science Blog. The Go Programming Language. Tres bases de datos gratuitas que cualquiera puede usar (Primera parte)

Data governance tools. Smart Data Intelligence. Datos Abiertos. Datos Abiertos Colombia. Academic Torrents. UNdata. The World Bank. Untitled. Welcome to DataCite. Registry of Research Data Repositories. Euromonitor International.

Ushahidi. Gallup Topic. Questionnaire design. Perhaps the most important part of the survey process is the creation of questions that accurately measure the opinions, experiences and behaviors of the public.

Questionnaire design

Accurate random sampling and high response rates will be wasted if the information gathered is built on a shaky foundation of ambiguous or biased questions. Creating good measures involves both writing good questions and organizing them to form the questionnaire. Questionnaire design is a multistage process that requires attention to many details at once. Designing the questionnaire is complicated because surveys can ask about topics in varying degrees of detail, questions can be asked in different ways, and questions asked earlier in a survey may influence how people respond to later questions. O'Reilly Media - Technology Books, Tech Conferences, IT Courses, News.

Institute of Education Sciences (IES) Home Page, a part of the U.S. Department of Education. NASDAQ Stock Market - Stock Quotes - Stock Exchange News. Making sense of data to improve education. Weather and climate change. Forbes Welcome. Premise Data. General How many countries do you operate?

Premise Data

We’re currently in more than 30 countries across six continents. Centre for Research in Social Policy. Datos del Conflicto Armado en Colombia. En esta página CERAC pone a su disposición la Base de datos sobre Conflicto Armado Colombiano, y el glosario con la definición de cada una de las variables que la componen.

Datos del Conflicto Armado en Colombia

Contratación Mes de Junio 2016. UNITED STATES QuickFacts from the US Census Bureau. Interview with Rick Smolan on ‘The Human Face of Big Data’ Manu: Rick, can you tell us a bit about yourself?

Interview with Rick Smolan on ‘The Human Face of Big Data’

I saw in your TED talk that you used to be a photo journalist, so how did you get started on this journey? Rick Smolan: Yes, I was always very curious as a person so it’s interesting that I’d end up in a job where I get paid to be curious. As you saw in the TED talk, I went from being a journalist where I work for other people who set the agenda, to the fortunate position of being able to steer my own ship. And now, when I get curious about something I’m able to invite my heroes, my peers and some young journalists along. The journey to collaborate together is like crowd sourced journalism but the crowd is actually the journalists.

Redshift Performance & Cost. At Airbnb, we look into all possible ways to improve our product and user experience. Often times this involves lots of analytics behind the scene. Our data pipeline thus far has consisted of Hadoop, MySQL, R and Stata. We’ve used a wide variety of libraries for interfacing with our Hadoop cluster such as Hive, Pig, Cascading and Cascalog. SQL. Language for management and use of relational databases SQL ( S-Q-L,[4] "sequel"; Structured Query Language)[5][6][7] is a domain-specific language used in programming and designed for managing data held in a relational database management system (RDBMS), or for stream processing in a relational data stream management system (RDSMS). It is particularly useful in handling structured data, i.e. data incorporating relations among entities and variables. SQL offers two main advantages over older read–write APIs such as ISAM or VSAM. Firstly, it introduced the concept of accessing many records with one single command.

Secondly, it eliminates the need to specify how to reach a record, e.g. with or without an index. SQL was one of the first commercial languages to utilize Edgar F. History[edit] SQL was initially developed at IBM by Donald D. Chamberlin and Boyce's first attempt of a relational database language was Square, but it was difficult to use due to subscript notation. Design[edit] MapReduce Tutorial.

This section provides a reasonable amount of detail on every user-facing aspect of the MapReduce framework. This should help users implement, configure and tune their jobs in a fine-grained manner. However, please note that the javadoc for each class/interface remains the most comprehensive documentation available; this is only meant to be a tutorial. Let us first take the Mapper and Reducer interfaces. Applications typically implement them to provide the map and reduce methods.

We will then discuss other core interfaces including JobConf, JobClient, Partitioner, OutputCollector, Reporter, InputFormat, OutputFormat, OutputCommitter and others. Finally, we will wrap up by discussing some useful features of the framework such as the DistributedCache, IsolationRunner etc. Payload Applications typically implement the Mapper and Reducer interfaces to provide the map and reduce methods. HDFS Architecture Guide.

Introduction The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware. It has many similarities with existing distributed file systems. However, the differences from other distributed file systems are significant. Welcome to Apache™ Hadoop®! Welcome to Apache Pig! Airports of the Future. Big Data and Analytics in the Enterprise. Learn Big Data Analytics: 51 Expert Tips. Research papers that changed the world of Big Data. Ten Reasons Why Data Scientist is The Top Job of the 21st Century. Photo: NASA I joined Amadeus’s team of data scientists coming from an Astrophysics background. Although I enjoyed my work, at some point I became interested in ways I could apply the skills I acquired working in Astrophysics to areas outside of academic research.

I wanted to directly apply what I had always done (which included a lot of analysis of large amounts of data, programming, modeling, interpretation of results etc). It was at that time I became interested in Big Data and data science, which is meant to be a very sexy job. Here’s why (or why not!) Mathematical and Statistical Frontiers. What is Big Data? Big data describes a holistic information management strategy that includes and integrates many new types of data and data management alongside traditional data.

Read more White paper: Enterprise Architect's Guide to Big Data—Reference Architecture Overview Big data has also been defined by the four Vs: