Stata: data analysis and statistical software

Stata is a complete, integrated statistical software package that provides everything you need for data analysis, data management, and graphics.

Fast, accurate, and easy to use

With both a point-and-click interface and a powerful, intuitive command syntax, Stata is fast, accurate, and easy to use.

All analyses can be reproduced and documented for publication and review. Version control ensures statistical programs will continue to produce the same results no matter when you wrote them. See certification results and FDA document compliance for accuracy details.

Broad suite of statistical features

Stata puts hundreds of statistical tools at your fingertips:

Standard methods

  • Basic tabulations and summaries
  • Case-control analysis
  • ARIMA
  • ANOVA and MANOVA
  • Linear regression
  • Time-series smoothers
  • Generalized linear models (GLM)
  • Cluster analysis
  • Contrasts and comparisons
  • Power analysis
  • Sample selection
  • Choice modeling

Advanced techniques

  • Multilevel models
  • Survival models with frailty
  • Dynamic panel data (DPD) regressions
  • SEM (Structural equation modeling)
  • Binary count and censored outcomes
  • ARCH
  • Multiple imputation
  • Survey data
  • Treatment effects
  • Exact statistics
  • Bayesian analysis
  • Latent class analysis (LCA) for Finite mixture models (FMM)

Programming with Mata

Mata is a full-blown programming language that compiles what you type into bytecode, optimizes it, and executes it fast.Though you don't need to program to use Stata, it is comforting to know that a fast and complete matrix programming language is an integral part of Stata. Mata is both an interactive environment for manipulating matrices and a full development environment that can produce compiled and optimized code. It includes special features for processing panel data, performs operations on real or complex matrices, provides complete support for object-oriented programming, and is fully integrated with every aspect of Stata.

Trusted

We don't just write statistical methods, we validate them. The results you see from a Stata estimator rest on comparisons with other estimators, Monte-Carlo simulations of consistency and coverage, and extensive testing by our statisticians. Every Stata we ship has passed a certification suite that includes 2.3 million lines of testing code that produces 4.3 million lines of output. We certify every number and piece of text from those 4.3 million lines of code. Technical supportStata technical support is free to registered users. And, this is a case of getting much more than you pay for.

Technical support

We have a dedicated staff of expert Stata programmers and Statisticians to answer your technical questions. From tricky data management solutions to getting your graph looking just right. From explaining a robust standard error to specifying your multilevel model. We have your answers.

Extensible

Stata is so programmable that developers and users add new features every day to respond to the growing demands of today's researchers. With Stata's Internet capabilities, new features and official updates can be installed over the Internet with a single click.

Complete data-management facilities

Stata's data-management features give you complete control of all types of data.

You can combine and reshape datasets, manage variables, and collect statistics across groups or replicates. You can work with byte, integer, long, float, double, and string variables (including BLOBs and strings up to 2 billion characters). Stata also has advanced tools for managing specialised data such as survival/duration data, time-series data, panel/longitudinal data, categorical data, multiple-imputation data, and survey data.

Data Editor

Publication-quality graphics

You can point and click to create a custom graph, or you can write scripts to produce hundreds or thousands of graphs in a reproducible manner. Export graphs to EPS or TIF for publication, to PNG or SVG for the web, or to PDF for viewing. With the integrated Graph Editor you click to change anything about your graph or to add titles, notes, lines, arrows, and text.

  • Regression fit graphs
  • Distributional plots
  • Time-series graphs,
  • Survival plots
  • Contour plots

Real documentation

When it comes time to perform your analyses or understand the methods you are using, Stata does not leave you high and dry or ordering books to learn every detail.

Each of our data management features is fully explained, and documented, and shown in practice on real examples. Each estimator is fully documented and includes several examples on real data, with real discussions of how to interpret the results. The examples give you the data so you can work along in Stata and even extend the analyses. We give you Quick Starts for every feature showing some of the most common uses. Want even more detail, our Methods and Formulas sections provide the specifics of what is being computed and our References point you to even more information.

Stata is a big package and so has lots of documentation – over 14,000 pages in 27 volumes. But don't worry, type help my topic and Stata will search its keywords, indices, and even community-contributed packages to bring you everything you need to know about your topic. Everything is available right within Stata.

Cross-platform compatible

Stata will run on Windows, Mac and Linux/Unix computers; however, licenses are not platform specific.

That means if you have a Mac laptop and a Windows desktop, you don't need two separate licenses to run Stata. You can install your Stata license on any of the supported platforms. Stata datasets, programs, and other data can be shared across platforms without translation. You can also quickly and easily import datasets from other statistical packages, spreadsheets, and databases.

Resources & Community

Stata Journal

The Stata Journal is a quarterly publication containing articles about statistics, data analysis, teaching methods, and effective use of Stata's language. The Journal publishes reviewed papers together with shorter notes and comments, regular columns, book reviews, and other material of interest to researchers applying statistics in a variety of disciplines.

Stata Journal

Stata Press

Stata Press® publishes books, manuals, and journals about Stata and general statistics topics for professional researchers of all disciplines. Stata Press® publications are available to purchase in our Bookshop

Visit Bookshop

Stata News

The Stata News is a free publication containing articles on using Stata, announcements of new releases and updates, training schedules, new books, Conferences and Users Group meetings, new products, and other announcements of interest to Stata users.

Sign up to Stata News

Stata Blog

The offical Stata Blog, Not Elsewhere Classified (NEC), will keep you up to date about all things related to Stata, including product announcements, service announcements such as on-site and public training, and timely tips and comments related to the use of Stata. Individually signed, the articles in NEC are written by the same people who develop, support, and sell Stata. NEC is informal but useful, and even entertaining.

Latest Stata blog

Training

There are a multitude of training options available to become proficient at Stata quickly. Stata provides hands-on public training courses, one-hour feature webinars, customized on-site training courses, and online training through NetCourses and video tutorials.

All courses

Stata Conference and UGM's

Whether you are a beginner or an expert, you will find something just for you at the Users Group meetings (UGM's), which are held each year in various locations around the world. These meetings showcase in-depth presentations from StataCorp experts and experienced Stata users that focus on helping you use Stata more effectively.

Upcoming meetings

Stata 15 is a complete, integrated statistical package that provides everything you need for data analysis, data management, and graphics. Stata is not sold in modules, which means you get everything you need in one package. And, you can choose a perpetual licence, with nothing more to buy ever. Annual licences are also available.

Whether you're a student or a seasoned research professional, a range of Stata packages are available and designed to suit all needs.

All of the following flavours of Stata have the same complete set of commands and features and manuals included as PDF documentation within Stata.

Stata/MP

Stata/MP is the fastest and largest version of Stata. Most computers purchased since mid 2006 can take advantage of the advanced multiprocessing of Stata/MP. This includes the Intel Core™ 2 Duo, i3, i5, i7, and the AMD X2 dual-core chips. On dual-core chips, Stata/MP runs 40% faster overall and 72% faster where it matters - on the time-consuming estimation commands. With more than two cores or processors, Stata/MP is even faster.

Stata/MP provides the most extensive support for multiprocessor computers and multicore computers of any statistics and data-management package.

The exciting thing about Stata/MP is that it runs faster—much faster. Stata/MP lets you analyse data in one-half to two-thirds of the time compared with Stata/SE on inexpensive dual-core desktops and laptops and in one-quarter to one-half the time on quad-core desktops. Stata/MP runs even faster on multiprocessor servers. Stata/MP supports up to 64 processors/cores.

In a perfect world, software would run twice as fast on two cores, four times as fast on four cores, eight times as fast on eight cores, and so on. Across all commands, Stata/MP runs 1.7 times faster on two cores, 2.4 times faster on four cores, and 3.2 times faster on eight cores. These values are median speed improvements. Half the commands run even faster.

On the other side of the distribution, a few commands do not run faster, often because they are inherently sequential, such as time-series commands.

Stata worked hard to make sure that the performance gains for commands that take longer to run would be greater. Across all estimation commands, Stata/MP runs 1.9 times faster on two cores, 3.1 times faster on four cores, and 4.1 times faster on computers with eight cores.

Stata/MP is 100% compatible other versions of with Stata. Analyses do not have to be reformulated or modified in any way to obtain Stata/MP’s speed improvements.

Stata/MP is available for the following operating systems:

  • Windows (32- and 64-bit processors);
  • macOS (64-bit Intel processors);
  • Linux (32- and 64-bit processors);
  • Solaris (64-bit SPARC and x86-64).

To run Stata/MP, you can use a desktop computer with a dual-core or quad-core processor, or you can use a server with multiple processors. Whether a computer has separate processors or one processor with multiple cores makes no difference. More processors or cores makes Stata/MP run faster.

For more advice on purchasing/upgrading to Stata/MP or for hardware queries, please contact our sales team.

Return to menu

Stata/SE

Stata/SE and Stata/IC differ only in the dataset size that each can analyse. Stata/SE and Stata/MP can fit models with more independent variables than Stata/IC (up to 10,998). Stata/SE can analyse up to 2 billion observations.

Return to menu

Stata/IC

Stata/IC allows datasets with as many as 2,048 variables. The maximum number of observations is 2.14 billion. Stata/IC can have at most 798 independent variables in a model.

Return to menu

Comparison of features

Product Features

Stata/IC

Stata/SE

Stata/MP

Maximum number of variables

Info

2,048

32,767

120,000

Maximum number of observations

Info

2.14 billion

2.14 billion

Up to 20 billion

Maximum number of independent variables

Info

798

10,998

10,998

Multicore support

Time to run logistic regression with 5 million obs and 10 covariates Info

1-core

10.0 sec

1-core

10.0 sec

2 core

5.0 sec

4 core

2.6 sec

4+

even faster

Complete suite of statistical features

Info

Yes

Yes

Yes

Yes

Yes

Publication-quality graphics

Info

Yes

Yes

Yes

Yes

Yes

Matrix programming language

Yes

Yes

Yes

Yes

Yes

Complete PDF documentation

Info

Yes

Yes

Yes

Yes

Yes

Exceptional technical support

Yes

Yes

Yes

Yes

Yes

Includes within-release updates

Yes

Yes

Yes

Yes

Yes

64-bit version available

Yes

Yes

Yes

Yes

Yes

Windows, Mac, or Unix

Info

Yes

Yes

Yes

Yes

Yes

Memory requirements

1 GB

2 GB

4 GB

Disk space requirements

1 GB

1 GB

1 GB

For Business License terms and options please click here.

For Educational License terms and options please click here.

Return to menu

All versions of Stata run on dual-core, multi-core and multi-processor computers.

Stata for Windows

  • Windows 10 *
  • Windows 8 *
  • Windows 7 *
  • Windows Vista *
  • Windows Server 2016, 2012, 2008, 2003 *

* 64-bit and 32-bit Windows varieties for x86-64 and x86 processors made by Intel® and AMD.

Stata for Mac

  • Stata for macOS requires 64-bit Intel® processors (Core™2 Duo or better) running macOS 10.9 or newer

Stata for Unix

  • Linux: Any 64-bit (x86-64 or compatible) or 32-bit (x86 or compatible) running Linux.
  • For xstata, you need to have GTK 2.24 installed

Hardware requirements

  • Minimum of 1 GB of RAM for Stata/IC, 2GB for Stata/SE and 4GB for Stata/MP
  • Minimum of 1 GB of disk space for all versions
  • Stata for Unix requires a video card that can display thousands of colours or more (16-bit or 24-bit colour)

Find out all about Stata’s expansive range of statistical features using the table of contents below. Each section links to further details and examples to help users get the best out of their software.

Linear models Time series Data management Longitudinal data/panel data
Survival analysis Graphics Multilevel mixed-effects models Bayesian analysis
Documentation Graphical user interface Binary, count, and limited outcomes Power and sample size
Basic statistics Extended regression models (ERMs) Treatment effects Other statistical methods
Generalized linear models (GLMs) SEM (structural equation modeling) Functions Finite mixture models (FMMs)
Internet capabilities Spatial autoregressive models Latent class analysis User-written commands
Programming features ANOVA/MANOVA Multiple imputation Matrix programming—Mata
Exact statistics Survey methods Embedded statistical computations Linearized DSGE models
Cluster analysis Installation Qualification Tests, predictions, and effects IRT (item response theory)
Accessibility Contrasts, pairwise comparisons, and margins Multivariate methods Nonparametric methods
GMM and nonlinear regression Epidemiology Simple maximum likelihood Programmable maximum likelihood
Resampling and simulation methods

http://www.stata-uk.com/software/stata/

Start Again

Please select an educational licence type:

Faculty Students

Please select a upgrades licence type:

Business Educational

Please select a upgrades licence type:

Single User / Volume Single Users Network (Concurrent Use)

I currently own a Stata license for:

Students may purchase Stata/MPStata/SE and Stata/IC at a discounted price. For more information about available licence types, click here.

Post your comment

Timberlake Consultants