R, Ruby, and Finance. R: Retrieving information from google using the RCurl package « "R" you ready? R: Retrieving information from google using the RCurl package 01Jan09 Lately I read the article Automatic Meaning Discovery Using Google by Cilibras and VitanyiIt which introduces the normalized google distance (NGD) as a measure of semantic relatedness of two search terms.

As its basis for calculation the NGD uses simple google search result counts. Now I want to figure out how to impelement this calculation using R. The first step is to retrieve the needed information from the google website. I found a nice site written by Duncan Temple Lang that explains the extraction of HTML code from any internet site using the RCurl package. Library(RCurl) # now lets extract the HTML code from my blog using getURL() # from the RCurl package getURL(" # this looks pretty unstructured.

The above implementation surely is not technically mature (e.g. the extraction code). As a last step let’s wrap the above way to extract the google search results count into a function. Happy New Year! Mark Like this: R twotorials. Statistics, R, Graphics and Fun. RSS R Introduction Course. Module 8: Principle Components Analysis, Factor Analysis and Related Principle Components Analysis, Maximum Likelihood Factor Analysis & scale reliability Creating Composite Scores (i.e., indicator variables) with factor analysis.

Creating Composite Scores (i.e., indicator variables) with principle component analysis. Determining the most appropriate number of factors to extract using VSS Demonstration of factor rotations and what they do. Data generation and simulation of various Factor Analytic Models. Data generation and fitting of binary item factor models. An example of Bootstrapped Factor Analysis An example of Hierarchical Factor Analysis Simple 2 & 3 variable Correspondence Analysis Brief examples of Multidimensional Scaling Module 9: Structural Equation Modeling (SEM) and Related SEM example run in R using SEMData.sav (same as this: Stage 1 and Stage 2) An intuitive way to do latent variable modeling (CFA, SEM, & latent growth) Dr.

Example SEM with one hierarchical latent factor. Machine Learning in R. R-statistics blog. Learning R. Statistical Modeling, Causal Inference, and Social Science. RStudio in the cloud, for dummies.

You can have your own cloud computing version of R, complete with RStudio. Why should you? It's cool! Plus, there's a lot more power out there than you can easily get on your own hardware. And, it's R in a web page. Run it from your tablet. This entry is largely made possible by the work of Louis Alsett, who's completing his doctoral work at Trinity College, University of Dublin. Start-up1. 2. 3. A. 1) click "Create a New Security Group". 2) Give a name (like "RStudio") and 3) a description ("RStudio")-- both are required. G. Use4. 5. 6. 7. There's your R in the cloud! ManagementOur understanding of the "Free Usage Tier" is that you can leave this on all the time for a year without incurring any charges. However, there is also a "stopped" state. As long as an instance is running, you retain all aspects of your session-- it's just as if you had a computer that you left on running RStudio all the time.

Happy cloud computing! R-bloggers.