What's new in Stata 9?


Stata is an environment for manipulating and analyzing data using statistical and graphical methods. Stata is an integrated package — not a collection of separate modules. You can intersperse data management, statistical, and graphical commands.


Data management
data transformations, match-merge, by-group processing, append files, sort, ODBC, XML, outer-joins, row-column transposition, labeling, string functions
 
Basic statistics
summaries, cross-tabulations, correlations, t tests, equality of variance tests, tests of proportions, confidence intervals
 
Linear models
regression; bootstrap, robust Huber/White/sandwich variance estimates, instrumental variables, three-stage least squares, seemingly unrelated regression, constrained regression, quantile regression, GLS
 
Linear mixed, multilevel, and hierarchical models
two-, three-, and multi-way random-intercepts and random-coefficients models; crossed random effects; ML and REML estimation; BLUPs of effects and fitted values;
 
Generalized linear models
ten link functions, user-defined links, seven distributions, ML and IRLS estimation, nine variance estimators, seven residuals
 
Binary, count, and limited dependent variables
logistic, probit, tobit; Poisson and negative-binomial; conditional, multinomial, ordered, rank-ordered, and stereotype logistic; multinomial probit; zero-inflated and zero-truncated count models; selection models; marginal effects;
 
Panel data/cross-sectional time-series
random- and fixed-effects with robust standard errors, linear mixed models, random-effects probit, GEE, random- and fixed-effects Poisson, Arellano–Bond, and instrumental variables regression; AR(1) disturbances;
 
Nonparametric methods
Wilcoxon–Mann–Whitney, Wilcoxon signed ranks and Kruskal–Wallis tests; Spearman and Kendall correlations; Kolmogorov–Smirnov tests; exact binomial CIs
 
ANOVA/MANOVA
balanced and unbalanced designs, factorial, nested, and mixed designs, repeated measures
 
Multivariate methods
factor analysis; principal components; rotation; multidimensional scaling; Procrustean analysis; correspondence analysis; biplots; dendrograms; user-extensible analyses
 
Cluster analysis
hierarchical clustering; kmeans and kmedian nonhierarchical clustering; dendrograms; stopping rules; user-extensible analyses;
 
Resampling and simulation methods
bootstrapping, jackknife, Monte Carlo simulation, permutation tests
 
Model testing and post-estimation support
Wald tests, LR tests, linear and nonlinear combinations, linear and nonlinear predictions, tests of nonlinear restrictions, marginal effects, adjusted means, Hausman tests
 
Graphics
line charts, scatterplots, bar charts, pie charts, hi–lo charts, regression diagnostic graphs, survival plots, nonparametric smoothers, distribution Q–Q plots
 
Survey methods
sampling weights, multistage designs; stratification, poststratification; deff; means, proportions, ratios, totals; summary tables; bootstrap, jackknife, and linearization-based variance estimation; regression, instrumental variables, probit
 
Survival analysis
Kaplan–Meier and Nelson–Aalen estimators, Cox regression (frailty); parametric models (frailty); hazards; time-varying covariates; left and right censoring, Weibull, exponential, and Gompertz analysis;
 
Tools for epidemiologists
standardization of rates, case–control, cohort, matched case–control, Mantel–Haenszel, pharmacokinetics, ROC analysis, ICD-9-CM,
 
Time series
AARIMA, ARCH/GARCH, VAR, VECM, correlograms, periodograms, white-noise tests, unit root tests, Holt–Winters smoothers, Haver Analytics data, rolling and recursive estimation,
 
Maximum likelihood
user-specified functions; NR, DFP, BFGS, BHHH; OIM, OPG, robust, bootstrap, and jackknife matrices; Wald tests; survey data; numeric or analytic derivatives;
 
Transforms and normality tests
Box–Cox transforms, power transforms, Shapiro–Wilk and Shapiro–Francia tests
 
Other statistical methods
sample size and power, nonlinear regression, imputations, stepwise regression, statistical and mathematical functions
 
Matrix commands
multiplication, addition, matrix inversion, eigenvalues and eigenvectors, SVD, Kronecker products, cross-products, matrix expressions
 
Programming language
adding new commands, command scripting, if, while, command parsing, debugging, menu and dialog-box programming, markup and control language
 
Internet capabilities
ability to install new commands, web updating, web file sharing, latest Stata news
 
Sample session


© Copyright 2005 Stata Corporation.