Stata is a general-purpose statistical software package created in 1985 by StataCorp. Most of its users work in research, especially in the fields of economics, sociology, political science, biomedicine and epidemiology.Stata’s capabilities include data management, statistical analysis, graphics, simulations, regression analysis (linear and multiple), and custom programming.
The name Stata is a portmanteau of the words statistics and data. The correct English pronunciation of Stata “must remain a mystery”; any of “Stay-ta”, “Sta-ta” or “Stah-ta” are considered acceptable.
There are four major builds of each version of Stata:
– Stata/MP for multiprocessor computers (including dual-core and multicore processors)
– Stata/SE for large databases
– Stata/IC, which is the standard version
– Small Stata, which is a smaller, student version for educational purchase only
Stata has always emphasized a command-line interface, which facilitates replicable analyses. Starting with version 8.0, however, Stata has included a graphical user interface which uses menus and dialog boxes to give access to nearly all built-in commands. This generates code which is always displayed, easing the transition to the command line interface and more flexible scripting language. The dataset can be viewed or edited in spreadsheet format. From version 11 on, other commands can be executed while the data browser or editor is opened.
Data structure and storage
Stata can only open a single dataset at any one time. Stata holds the entire dataset in (random-access or virtual) memory, which limits its use with extremely large datasets. This is mitigated to some extent by efficient internal storage, as there are integer storage types which occupy only one or two bytes rather than four, and single-precision (4 bytes) rather than double-precision (8 bytes) is the default for floating-point numbers.
Stata is offered in many licenses. From student to professional use. A “small” version is available for small datasets at a lower price point.
The dataset is always rectangular in format, that is, all variables hold the same number of observations (in more mathematical terms, all vectors have the same length, although some entries may be missing values).
regression • censored outcomes • endogenous regressors • bootstrap, jackknife, and robust and cluster–robust variance • instrumental variables • three-stage least squares • constraints • quantile regression • GLS • more
random and fixed effects with robust standard errors • linear mixed models • random-effects probit • GEE • random- and fixed-effects Poisson • dynamic panel-data models • instrumental variables • panel unit-root tests • more
Multilevel mixed-effects models
continuous, binary, count, and survival outcomes • two-, three-, and higher-level models • generalized linear models • nonlinear mode
Turnitin+Grammarly Premium Account for 12 Months (2 in 1 Package)Rated 0 out of 5
Udemy Premium Account for One YearRated 0 out of 5
Lynda Premium Account 12 MonthsRated 0 out of 5
Scribd Premium Account 12 MonthsRated 0 out of 5