background preloader

Data Sets

Data Sets

Related:  Data Showdown

The Journalist-Engineer A couple months ago, I published an article comparing historic and present-day popularity of older music. I used two huge datasets: 50,000 Billboard songs and 1,4M tracks on Spotify. If I were writing an academic paper, I’d do a ton of analysis, regression, and modeling to figure out why certain songs have become more popular over time. Or I could just make some sick visualizations… Open-source, distributed deep learning for the JVM Deeplearning4j is not the first open-source deep-learning project, but it is distinguished from its predecessors in both programming language and intent. DL4J is a JVM-based, industry-focused, commercially supported, distributed deep-learning framework intended to solve problems involving massive amounts of data in a reasonable amount of time. It integrates with Hadoop and Spark using an arbitrary number of GPUs or CPUs, and it has a number you can call if anything breaks. Get Started With Deeplearning4j Content

What is the Marital Status of Americans by Age? Visualization Data Notes A few months ago I created a visualization that allowed users to compare age distributions for various topics and another one that showed marital status by age range. Technical Analysis: Chart Patterns By Cory Janssen, Chad Langager and Casey Murphy A chart pattern is a distinct formation on a stock chart that creates a trading signal, or a sign of future price movements. Chartists use these patterns to identify current trends and trend reversals and to trigger buy and sell signals. In the first section of this tutorial, we talked about the three assumptions of technical analysis, the third of which was that in technical analysis, history repeats itself. The theory behind chart patters is based on this assumption. The idea is that certain patterns are seen many times, and that these patterns signal a certain high probability move in a stock.

how fast does miles teller play in whiplash EDIT 05 Sep. 2015: The concept of Beat Per Minutes (BPM) has been mis-understood as mentioned by reddit. What I was supposed to write was Strokes Per Minutes (SPM). Released in 2014, Whiplash focuses on a promising young drummer (Miles Teller) pursuing his dream of greatness. Implementing a Neural Network from Scratch in Python – An Introduction – WildML Get the code: To follow along, all the code is also available as an iPython notebook on Github. In this post we will implement a simple 3-layer neural network from scratch. We won’t derive all the math that’s required, but I will try to give an intuitive explanation of what we are doing. I will also point to resources for you read up on the details.

What You Like Falls on Party Lines Far and away, the Republican group is more country, while fans of Mrs. Clinton are more pop. At the top of the list for people who like Mrs. Welcome — Theano 0.8.2 documentation Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Theano features: tight integration with NumPy – Use numpy.ndarray in Theano-compiled functions.transparent use of a GPU – Perform data-intensive computations much faster than on a CPU.efficient symbolic differentiation – Theano does your derivatives for functions with one or many inputs.speed and stability optimizations – Get the right answer for log(1+x) even when x is really tiny.dynamic C code generation – Evaluate expressions faster.extensive unit-testing and self-verification – Detect and diagnose many types of errors. Theano has been powering large-scale computationally intensive scientific investigations since 2007.

The Ultimate Crowdsourced Map of Long Distance Relationships Around Valentine's Day this year, we got the idea of asking Atlas Obscura readers about one of the most fraught kind of relationships—the long distance kind, or LDRs. We assumed we'd get 50 or so responses, and maybe we'd pick a few stories to highlight. But nearly 600 of you filled out the survey. The results were incredible—and fill the interactive map above. People conducted relationships from the ends of the earth, spanning years and ostensibly filling whole hard drives with video chats and text messages. Speeding up your Neural Network with Theano and the GPU – WildML Get the code: The full code is available as an Jupyter/iPython Notebook on Github! In a previous blog post we build a simple Neural Network from scratch. Let’s build on top of this and speed up our code using the Theano library. With Theano we can make our code not only faster, but also more concise! What is Theano? Theano describes itself as a Python library that lets you to define, optimize, and evaluate mathematical expressions, especially ones with multi-dimensional arrays.

the data of long distance lovers August 25, 2015 · textmining · It is a truism to say that relationship is hard to maintain. It might be even more difficult when the Atlantic Ocean separates them. But thanks to internet and those big data companies that feed themselves with our personal informations, we can now have a real relationship even with 5,500 km between us.

Je vais donner accès aux groupes datasets et data tools pour garder le tout propre ;) by dishwasherz Jul 27

Un grand ensemble de datasets très variés. A garder de côté ! by simd3v Jul 27