background preloader

Data Reduction Techniques

Facebook Twitter

Epub.wu.ac.at/1923/1/document.pdf. Principal Component Analysis. A simple example Consider 100 students with Physics and Statistics grades shown in the diagram below.

Principal Component Analysis

The data set is in marks.dat. If we want to compare among the students which grade should be a better discriminating factor? Physics or Statistics? Pca - Can principal component analysis be applied to datasets containing a mix of continuous and categorical variables. Hamming distance. Examples[edit] The Hamming distance between: "karolin" and "kathrin" is 3.

Hamming distance

"karolin" and "kerstin" is 3.1011101 and 1001001 is 2.2173896 and 2233796 is 3. Special properties[edit] For binary strings a and b the Hamming distance is equal to the number of ones (population count) in a XOR b.