background preloader

O'Reilly Media - Technology Books, Tech Conferences, IT Courses, News

O'Reilly Media - Technology Books, Tech Conferences, IT Courses, News
What is O'Reilly Media? Technology Books, Tech Conferences, IT Courses & News O'Reilly spreads the knowledge of innovators through its technology books, online services, magazines, research, and tech conferences. Since 1978, O'Reilly has been a chronicler and catalyst of leading-edge development, homing in on the technology trends that really matter and galvanizing their adoption by amplifying "faint signals" from the alpha geeks who are creating the future. An active participant in the technology community, O'Reilly has a long history of advocacy, meme-making, and evangelism.

Related:  Big data and data visualization

Questionnaire design Perhaps the most important part of the survey process is the creation of questions that accurately measure the opinions, experiences and behaviors of the public. Accurate random sampling and high response rates will be wasted if the information gathered is built on a shaky foundation of ambiguous or biased questions. Creating good measures involves both writing good questions and organizing them to form the questionnaire. Questionnaire design is a multistage process that requires attention to many details at once. Designing the questionnaire is complicated because surveys can ask about topics in varying degrees of detail, questions can be asked in different ways, and questions asked earlier in a survey may influence how people respond to later questions.

Free Books A lot of people keep asking about a good list of programming books. Hence, we are building this list to save your time and to spread the knowledge. Some of these books will definitely help us to evolve our coding skills and thought processes for developing better solutions. We will do our best to keep updating this list, hope you find this list useful, here we go. EPUB ePub is an open format defined by the Open eBook Forum of the International Digital Publishing Forum (<IDPF>). It is based on XHTML and XML along with optional CSS style sheets. Its predecessor was the OEB standard. Specifications are found at the IDPF web site. The page covers ePub version 2.01. For version 3 see ePub 3.

Premise Data General How many countries do you operate? We’re currently in more than 30 countries across six continents. How do you decide which countries you’ll start a network in next? These decisions are generally customer-driven depending on their data needs. There are also some networks we spin up out of our own desire to create greater societal transparency. O'Reilly Releases DocBook: The Definitive Guide Under the GNU License by Norman Walsh, Leonard Muellner 09/10/2001 This change in licensing allows DocBook users worldwide to update, translate, and reuse the official reference documentation for DocBook. The XML sources for the book have been checked into the DocBook project at SourceForge. The DocBook project is an open source project to develop and revise DocBook-based software and documentation. The Free Software Foundation's Bradley Kuhn said, "We believe that books on Free Software need to be Free (as in freedom). We are going to encourage people to buy this free manual from O'Reilly, and thus reward both the publisher and the authors for contributing to the community.

AZARDI : ePub Books and Resources Focus: Interactivity, fixed layout, education, training, learning. Our first AZARDI Fixed Layout book released in 2011 before the IDPF Fixed Layout specification was written. It is now updated for the full IDPF Fixed Layout Specification. Interview with Rick Smolan on ‘The Human Face of Big Data’ Manu: Rick, can you tell us a bit about yourself? I saw in your TED talk that you used to be a photo journalist, so how did you get started on this journey? Rick Smolan: Yes, I was always very curious as a person so it’s interesting that I’d end up in a job where I get paid to be curious. As you saw in the TED talk, I went from being a journalist where I work for other people who set the agenda, to the fortunate position of being able to steer my own ship. And now, when I get curious about something I’m able to invite my heroes, my peers and some young journalists along. The journey to collaborate together is like crowd sourced journalism but the crowd is actually the journalists.

Redshift Performance & Cost At Airbnb, we look into all possible ways to improve our product and user experience. Often times this involves lots of analytics behind the scene. Our data pipeline thus far has consisted of Hadoop, MySQL, R and Stata. We’ve used a wide variety of libraries for interfacing with our Hadoop cluster such as Hive, Pig, Cascading and Cascalog. However, we found that analysts aren’t as productive as they can be by using Hadoop, and standalone MySQL was no longer an option given the size of our dataset.

SQL Language for management and use of relational databases SQL ( S-Q-L,[4] "sequel"; Structured Query Language)[5][6][7] is a domain-specific language used in programming and designed for managing data held in a relational database management system (RDBMS), or for stream processing in a relational data stream management system (RDSMS). It is particularly useful in handling structured data, i.e. data incorporating relations among entities and variables. SQL offers two main advantages over older read–write APIs such as ISAM or VSAM. Firstly, it introduced the concept of accessing many records with one single command.

MapReduce Tutorial This section provides a reasonable amount of detail on every user-facing aspect of the MapReduce framework. This should help users implement, configure and tune their jobs in a fine-grained manner. However, please note that the javadoc for each class/interface remains the most comprehensive documentation available; this is only meant to be a tutorial. HDFS Architecture Guide Introduction The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware. It has many similarities with existing distributed file systems. However, the differences from other distributed file systems are significant. Research papers that changed the world of Big Data If you are looking for some of the most influential research papers that revolutionised the way how we gather, aggregate, analyze and store increasing volumes of data in a short span of 10 years, you are in the right place! These papers were shortlisted, based on recommendations by big data enthusiasts and experts around the globe from various social media channels. In case we’ve missed out any important paper, please let us know.

Ten Reasons Why Data Scientist is The Top Job of the 21st Century Photo: NASA I joined Amadeus’s team of data scientists coming from an Astrophysics background. Although I enjoyed my work, at some point I became interested in ways I could apply the skills I acquired working in Astrophysics to areas outside of academic research. I wanted to directly apply what I had always done (which included a lot of analysis of large amounts of data, programming, modeling, interpretation of results etc). It was at that time I became interested in Big Data and data science, which is meant to be a very sexy job. Here’s why (or why not!) What is Big Data? Big data describes a holistic information management strategy that includes and integrates many new types of data and data management alongside traditional data. Read more White paper: Enterprise Architect's Guide to Big Data—Reference Architecture Overview Big data has also been defined by the four Vs:

Related:  Collection EbooksKnowledge Management