Stata: data analysis and statistical software

Stata is a complete, integrated statistical software package that provides everything you need for data science.

Master your data

Stata's data management features give you complete control.

  • Manage multiple datasets simultaneously
  • Import, export
  • ODBC, SQL
  • Sort, match, merge, join, append, create
  • Built-in spreadsheet
  • Unicode
  • Process text or binary data
  • Access data locally or on the web
  • Collect statistics across groups
  • BLOBs—strings that can hold entire documents
  • Billions of rows
  • Hundreds of thousands of variables
  • Survival data, panel data, multilevel data, survey data, discrete choice data, multiple-imputation data, categorical data, time-series data

And much more, to support all your data science needs.

Explore all of Stata's data management features »

Broad suite of statistical features

Publication-quality graphics

Stata makes it easy to generate publication-quality, distinctly styled graphs.

You can point and click to create a custom graph. Or you can write scripts to produce hundreds or thousands of graphs in a reproducible manner. Export graphs to EPS or TIFF for publication, to PNG or SVG for the web, or to PDF for viewing. With the integrated Graph Editor, you click to change anything about your graph or to add titles, notes, lines, arrows, and text.

Truly reproducible reporting

All the tools you need to automate reporting your results.

  • Dynamic Markdown documents
  • Create Word documents
  • Create PDF documents
  • Create Excel files
  • Schemes for graphics
  • Word, HTML, PDF, SVG, PNG

Truly reproducible research

Lots of folks talk about reproducible research.
Stata has been dedicated to it for over 30 years.

We constantly add new features; we have even fundamentally changed language elements. No matter. Stata is the only statistical package with integrated versioning. If you wrote a script to perform an analysis in 1985, that same script will still run and still produce the same results today. Any dataset you created in 1985, you can read today. And the same will be true in 2050. Stata will be able to run anything you do today.

We take reproducibility seriously.

Real documentation

When it comes time to perform your analyses or understand the methods you are using, Stata does not leave you high and dry or ordering books to learn every detail.

Each of our data management features is fully explained and documented and shown in practice on real examples. Each estimator is fully documented and includes several examples on real data, with real discussions of how to interpret the results. The examples give you the data so you can work along in Stata and even extend the analyses. We give you a Quick start for every feature, showing some of the most common uses. Want even more detail? Our Methods and formulas sections provide the specifics of what is being computed, and our References point you to even more information.

Stata is a big package and so has lots of documentation – over 15,000 pages in 31 volumes. But don't worry, type help my topic, and Stata will search its keywords, indexes, and even community-contributed packages to bring you everything you need to know about your topic. Everything is available right within Stata.

Access the documentation online »

Trusted

We don't just program statistical methods, we validate them.

The results you see from a Stata estimator rest on comparisons with other estimators, Monte Carlo simulations of consistency and coverage, and extensive testing by our statisticians. Every Stata we ship has passed a certification suite that includes 3.2 million lines of testing code that produces 4.9 million lines of output. We certify every number and piece of text from those 4.3 million lines of output.

Easy to use

All of Stata's features can be accessed through menus, dialogs, control panels, a Data Editor, a Variables Manager, a Graph Editor, and even an SEM Diagram Builder. You can point and click your way through any analysis.

If you don't want to write commands and scripts, you don't have to.

Even when you are pointing and clicking, you can record all your results and later include them in reports. You can even save the commands created by your actions and reproduce your complete analysis later.

Easy to grow with

Stata's commands for performing tasks are intuitive and easy to learn. Even better, everything you learn about performing a task can be applied to other tasks. For example, you simply add if gender=="female" to any command to limit your analysis to females in your sample. You simply add vce(robust) to any estimator to obtain standard errors and hypothesis tests that are robust to many common assumptions.

The consistency goes even deeper. What you learn about data management commands often applies to estimation commands, and vice-versa. There is also a full suite of postestimation commands to perform hypothesis tests, form linear and nonlinear combinations, make predictions, form contrasts, and even perform marginal analysis with interaction plots. These commands work the same way after virtually every estimator.

Sequencing commands to read and clean data, then to perform statistical tests and estimation, and finally to report results is at the heart of reproducible research. Stata makes this process accessible to all researchers.

Find out how »

Easy to automate

Everyone has tasks that they do all the time—create a particular kind of variable, produce a particular table, perform a sequence of statistical steps, compute an RMSE, etc. The possibilities are endless. Stata has thousands of built-in procedures, but you may have tasks that are relatively unique or that you want done in a specific way.

If you have written a script to perform your task on a given dataset, it is easy to transform that script into something that can be used on all your datasets, on any set of variables, and on any set of observations.

See how easy automation is in Stata »

Easy to extend

Some of the things you automate may be so useful that you want to share them with colleagues or even make them available to all Stata users. That's also easy. With just a little code, you can turn an automation script into a Stata command. A command that supports standard features that Stata's official commands support. A command that can be used in the same way official commands are used.

Take a look »

Advanced programming

Stata also includes an advanced programming language—Mata.

Mata has the structures, pointers, and classes that you expect in your programming language and adds direct support for matrix programming.

Though you don't need to program to use Stata, it is comforting to know that a fast and complete programming language is an integral part of Stata. Mata is both an interactive environment for manipulating matrices and a full development environment that can produce compiled and optimized code. It includes special features for processing panel data, performs operations on real or complex matrices, provides complete support for object-oriented programming, and is fully integrated with every aspect of Stata. Stata also has comprehensive Python integration, allowing you to harness all the power of Python directly from your Stata code.

Learn more about Mata »

Stata even let's you incorporate C, C++, and Java plugins in your Stata programs via a native API for each language.

Get the most out of your multicore computer.
No other statistical software comes close.
Enjoy the new features of Stata at top speed.

Learn more »

Community-contributed features

Stata is so programmable that developers and users add new features every day to respond to the growing demands of today's researchers.

With Stata's Internet capabilities, new features and official updates can be installed over the Internet with a single click.

World-class technical support

Stata technical support is free to registered users, which means you get much more than you pay for.

We have a dedicated staff of expert Stata programmers and statisticians to answer your technical questions. From tricky data management solutions to getting your graph looking just right and from explaining a robust standard error to specifying your multilevel model, we have your answers.

Cross-platform compatible

Stata will run on Windows, Mac, and Linux/Unix computers; however, our licenses are not platform specific.

That means if you have a Mac laptop and a Windows desktop, you don't need two separate licenses to run Stata. You can install your Stata license on any of the supported platforms. Stata datasets, programs, and other data can be shared across platforms without translation. You can also quickly and easily import datasets from other statistical packages, spreadsheets, and databases.

View compatible operating systems »

Widely used

Used by researchers for more than 30 years, Stata provides everything you need for data science—data manipulation, visualization, statistics, and reproducible reporting.

Select your discipline and see how Stata can work for you.

Behavioral sciences

Education

Institutional research

Public health

Biostatistics

Epidemiology

Medicine

Public policy

Data science

Finance, business, and marketing

Political science

Sociology

Economics

Can't find your discipline? See who else is using Stata »

Stata is distributed in more than 180 countries. Our authorized distributors offer services such as basic technical support and training, and carry Stata inventory so delivery is fast.

Comprehensive resources

Video tutorials

Stata's YouTube channel is the perfect resource for new users to Stata, users wanting to learn a new feature in Stata, and professors looking for aids in teaching with Stata. We have over 250 videos on our YouTube channel that have been viewed over 6 million times by Stata users wanting to learn how to label variables, merge datasets, create scatterplots, fit regression models, work with time-series or panel data, fit multilevel models, analyze survival data, perform Bayesian analylsis, and use many other features of Stata. View the complete list of videos.

Visit our YouTube channel »

Stata Blog

We write the official Stata Blog, Not Elsewhere Classified (NEC), to share things we think you will find instructive, informative, or just plain entertaining. We have written about how to interpret statistical results; export results into Word, Excel, and LaTeX; perform Monte Carlo simulations; program your own estimators; and more. We also post service and product announcements. Individually signed, the articles in NEC are written by the same people who develop and support Stata.

Read our latest blog post »

Free Stata webinars

Stata webinars offer something for everyone. Those new to Stata will get a head start when they join our Ready. Set. Go Stata webinar. Both new and experienced users will want to join our Tips and Tricks webinar and our one-hour feature webinars; each one provides an in-depth look at one of Stata's statistical, graphical, data management, or reporting features.

View the current webinar offerings »

Training

A multitude of training options are available to become proficient at Stata quickly. Stata provides hands-on classroom and web-based training courses, customized on-site training courses, and online training through NetCourses, webinars, and video tutorials.

View available trainings »

Stata Press

Stata Press® publishes books, manuals, and journals about Stata and general statistics topics for professional researchers of all disciplines. Stata Press® publications, along with books recommended by StataCorp, can be found in the Stata Bookstore.

Visit the Stata Bookstore »

Stata News

The Stata News is a free publication with columns such as the popular In the Spotlight, where Stata developers give insight into specific Stata features, and the User's corner, where we share unique, helpful, and fun contributions from the user community. The News also contains announcements such as new releases and updates, training schedules, new books, Conferences, and Users Group meetings.

Sign up to receive the Stata News »

Stata Journal

The Stata Journal is a quarterly publication containing articles about statistics, data analysis, teaching methods, and effective use of Stata's language. The Journal publishes reviewed papers together with shorter notes and comments, regular columns, book reviews, and other materials of interest to researchers applying statistics in a variety of disciplines.

Subscribe to the Stata Journal »

Vibrant community

Stata Conference and Users Group meetings

Whether you are a beginner or an expert, you will find something just for you at the Users Group meetings, which are held each year in various locations around the world. These meetings showcase in-depth presentations from StataCorp experts and experienced Stata users that focus on helping you use Stata more effectively.

View upcoming meetings »

Statalist

A great resource for users is Statalist, a forum where more than 35,000 Stata users exchange roughly 5,000 postings and responses each month. Statalist is run and moderated by Stata users and maintained by StataCorp.

Join the forum »

User comments

Our users love to share how great Stata is, so we'd like to show you! When we receive nice comments about Stata, we post them here. If you think Stata is great too, send us an email with your comment, and we may share it with the Stata community.

See what users are saying »

Affordable

Stata is not sold in modules, which means you get everything in one package!

Stata offers several purchase options to fit your budget. You can choose an annual license to ensure you always have access to the latest features or you can choose a perpetual license. Contact a sales representative or browse our products to find out more about our affordable options. You can also download a product brochure.

Learn which Stata is right for you »

Stata is a complete, integrated software package that provides all your data science needs—data manipulation, visualization, statistics, and reproducible reporting. Stata is not sold in modules, which means you get everything you need in one package. And, you can choose a perpetual licence, with nothing more to buy ever. Annual licences are also available.

Whether you're a student or a seasoned research professional, a range of Stata packages are available and designed to suit all needs.

All of the following flavours of Stata have the same complete set of commands and features and manuals included as PDF documentation within Stata.

Stata/MP

Stata/MP is the fastest and largest version of Stata. Virtually any current computer can take advantage of the advanced multiprocessing of Stata/MP. This includes the Intel i3, i5, i7, i9, Xeon, and Celeron, and AMD multi-core chips. On dual-core chips, Stata/MP runs 40% faster overall and 72% faster where it matters, on the time-consuming estimation commands. With more than two cores or processors, Stata/MP is even faster.

Stata/MP is faster-much faster. Stata/MP lets you analyze data in one-half to two-thirds of the time compared to Stata/SE on inexpensive dual-core laptops and in one-quarter to one-half the time on quad-core desktops and laptops.

Stata/MP runs even faster on multiprocessor servers. Stata/MP supports up to 64 processors/cores.

Speed is often most crucial when performing computationally intense estimation procedures. A few of Stata’s estimation procedures, including linear regression, are nearly perfectly parallelized, meaning they run twice as fast on two cores, four times as fast on four cores, eight times as fast on eight cores, and so on. Some estimation commands can be parallelized more than others. Taken at the median, estimation commands runs 1.9 times faster on two cores, 3.1 times faster on four cores, and 4.3 times faster on eight cores.

Stata/MP is 100% compatible other versions of with Stata. Analyses do not have to be reformulated or modified in any way to obtain Stata/MP’s speed improvements.

Stata/MP is available for the following operating systems:

  • Windows (64-bit processors);
  • macOS (64-bit Intel processors);
  • Linux (64-bit processors);

To run Stata/MP, you can use a desktop computer with a dual-core or quad-core processor, or you can use a server with multiple processors. Whether a computer has separate processors or one processor with multiple cores makes no difference. More processors or cores makes Stata/MP run faster.

For more advice on purchasing/upgrading to Stata/MP or for hardware queries, please contact our sales team.

Return to menu

Stata/SE

Stata/SE and Stata/IC differ only in the dataset size that each can analyse. Stata/SE and Stata/MP can fit models with more independent variables than Stata/IC (up to 10,998). Stata/SE can analyse up to 2 billion observations.

Return to menu

Stata/IC

Stata/IC allows datasets with as many as 2,048 variables. The maximum number of observations is 2.14 billion. Stata/IC can have at most 798 independent variables in a model.

Return to menu

Comparison of features

Product Features

Stata/IC

Stata/SE

Stata/MP

Maximum number of variables

Info

2,048

32,767

120,000

Maximum number of observations

Info

2.14 billion

2.14 billion

Up to 20 billion

Maximum number of independent variables

Info

798

10,998

10,998

Multicore support

Time to run logistic regression with 5 million obs and 10 covariates Info

1-core

10.0 sec

1-core

10.0 sec

2 core

5.0 sec

4 core

2.6 sec

4+

even faster

Complete suite of statistical features

Info

Yes

Yes

Yes

Yes

Yes

Publication-quality graphics

Info

Yes

Yes

Yes

Yes

Yes

Matrix programming language

Yes

Yes

Yes

Yes

Yes

Complete PDF documentation

Info

Yes

Yes

Yes

Yes

Yes

Exceptional technical support

Yes

Yes

Yes

Yes

Yes

Includes within-release updates

Yes

Yes

Yes

Yes

Yes

64-bit version available

Yes

Yes

Yes

Yes

Yes

Windows, Mac, or Unix

Info

Yes

Yes

Yes

Yes

Yes

Memory requirements

1 GB

2 GB

4 GB

Disk space requirements

1 GB

1 GB

1 GB

For Business License terms and options please click here.

For Educational License terms and options please click here.

Return to menu

All versions of Stata run on dual-core, multi-core and multi-processor computers.

Stata for Windows

  • Windows 10 *
  • Windows 8 *
  • Windows 7 *
  • Windows Vista *
  • Windows Server 2016, 2012, 2008, 2003 *

* 64-bit Windows varieties for x86-64 and x86 processors made by Intel® and AMD.

Stata for Mac

  • Stata for macOS requires 64-bit Intel® processors (Core™2 Duo or better) running macOS 10.9 or newer

Stata for Unix

  • Linux: Any 64-bit (x86-64 or compatible) running Linux.
  • For xstata, you need to have GTK 2.24 installed

Please note: The 32-bit and Solaris download options are still available, but by request only.

Hardware requirements

  • Minimum of 1 GB of RAM for Stata/IC, 2GB for Stata/SE and 4GB for Stata/MP
  • Minimum of 1 GB of disk space for all versions
  • Stata for Unix requires a video card that can display thousands of colours or more (16-bit or 24-bit colour)

Find out all about Stata’s expansive range of statistical features using the table of contents below. Each section links to further details and examples to help users get the best out of their software.

Linear models Time series Data management Longitudinal data/panel data
Survival analysis Graphics Multilevel mixed-effects models Bayesian analysis
Documentation Graphical user interface Binary, count, and limited outcomes Power and sample size
Basic data science Extended regression models (ERMs) Treatment effects/Causal inference Other statistical methods
Generalized linear models (GLMs) SEM (structural equation modeling) Functions Finite mixture models (FMMs)
Internet capabilities Spatial autoregressive models Latent class analysis community-contributed commands
Programming features ANOVA/MANOVA Multiple imputation Mata – Stata’s serious programming language
Exact data science Survey methods Embedded statistical computations Linearized DSGE models
Cluster analysis Installation Qualification Tests, predictions, and effects IRT (item response theory)
Accessibility Contrasts, pairwise comparisons, and margins Multivariate methods Nonparametric methods
GMM and nonlinear regression Epidemiology Simple maximum likelihood Programmable maximum likelihood
Resampling and simulation methods

https://www.stata-uk.com/software/stata/

Start Again

Please select an educational licence type:

Faculty Students

Please select an upgrade licence type:

Business Educational

Please select an upgrade licence type:

Single User / Volume Single Users Network (Concurrent Use)

I currently own a Stata license for:

Students may purchase Stata/MPStata/SE and Stata/IC at a discounted price. For more information about available licence types, click here.

Post your comment

Timberlake Consultants