Документация Engee

RDatasets.jl

Страница в процессе перевода.

The RDatasets package provides an easy way for Julia users to experiment with most of the standard data sets that are available in the core of R as well as datasets included with many of R’s most popular packages. This package is essentially a simplistic port of the Rdatasets repo created by Vincent Arelbundock, who conveniently gathered data sets from many of the standard R packages in one convenient location on GitHub at https://github.com/vincentarelbundock/Rdatasets

In order to load one of the data sets included in the RDatasets package, you will need to have the DataFrames package installed. This package is automatically installed as a dependency of the RDatasets package if you install RDatasets as follows:

Pkg.add("RDatasets")

After installing the RDatasets package, you can then load data sets using the dataset() function, which takes the name of a package and a data set as arguments:

using RDatasets
iris = dataset("datasets", "iris")
neuro = dataset("boot", "neuro")

Data Sets

The RDatasets.packages() function returns a table of represented R packages:

Package Title

COUNT

Functions, data and code for count data.

Ecdat

Data sets for econometrics

HSAUR

A Handbook of Statistical Analyses Using R (1st Edition)

HistData

Data sets from the history of statistics and data visualization

ISLR

Data for An Introduction to Statistical Learning with Applications in R

KMsurv

Data sets from Klein and Moeschberger (1997), Survival Analysis

MASS

Support Functions and Datasets for Venables and Ripley’s MASS

SASmixed

Data sets from "SAS System for Mixed Models"

Zelig

Everyone’s Statistical Software

adehabitatLT

Analysis of Animal Movements

boot

Bootstrap Functions (Originally by Angelo Canty for S)

car

Companion to Applied Regression

cluster

Cluster Analysis Extended Rousseeuw et al.

datasets

The R Datasets Package

gamair

Datasets used in the book Generalized Additive Models: An Introduction with R

gap

Genetic analysis package

ggplot2

An Implementation of the Grammar of Graphics

lattice

Lattice Graphics

lme4

Linear mixed-effects models using Eigen and S4

mgcv

Mixed GAM Computation Vehicle with GCV/AIC/REML smoothness estimation

mlmRev

Examples from Multilevel Modelling Software Review

nlreg

Higher Order Inference for Nonlinear Heteroscedastic Models

plm

Linear Models for Panel Data

plyr

Tools for splitting, applying and combining data

pscl

Political Science Computational Laboratory, Stanford University

psych

Procedures for Psychological, Psychometric, and Personality Research

quantreg

Quantile Regression

reshape2

Flexibly Reshape Data: A Reboot of the Reshape Package.

robustbase

Basic Robust Statistics

rpart

Recursive Partitioning and Regression Trees

sandwich

Robust Covariance Matrix Estimators

sem

Structural Equation Models

survival

Survival Analysis

vcd

Visualizing Categorical Data

The RDatasets.datasets() function returns a table describing the 700+ included datasets. Or pass in a package name (e.g. RDatasets.datasets("mlmRev")) for a targeted table:

Package Dataset Title Rows Columns

mlmRev

Chem97

Scores on A-level Chemistry in 1997

31022

8

mlmRev

Contraception

Contraceptive use in Bangladesh

1934

6

mlmRev

Early

Early childhood intervention study

309

4

mlmRev

Exam

Exam scores from inner London

4059

10

mlmRev

Gcsemv

GCSE exam score

1905

5

mlmRev

Hsb82

High School and Beyond - 1982

7185

8

mlmRev

Mmmec

Malignant melanoma deaths in Europe

354

6

mlmRev

Oxboys

Heights of Boys in Oxford

234

4

mlmRev

ScotsSec

Scottish secondary school scores

3435

6

mlmRev

bdf

Language Scores of 8-Graders in The Netherlands

2287

28

mlmRev

egsingle

US Sustaining Effects study

7230

12

mlmRev

guImmun

Immunization in Guatemala

2159

13

mlmRev

guPrenat

Prenatal care in Guatemala

2449

15

mlmRev

star

Student Teacher Achievement Ratio (STAR) project data

26796

18

Licensing and Intellectual Property

Following Vincent’s lead, we have assumed that all of the data sets in this repository can be made available under the GPL-3 license. If you know that one of the datasets released here should not be released publicly or if you know that a data set can only be released under a different license, please contact me so that I can remove the data set from this repository.