Sometimes, you need some mock or test data to either test your program or for your analysis or to populate your database. Or sometimes, you need some sample or mock data to illustrate or explain a concept but you do not want to use your companies’ data or your delicate data.
There are a couple websites that you can generate mock or sample data from, which you can use to test your programs or algorithms.
My two favourite ones I use are and are FREE are:
- Yandataellan : You can generate and save your file in all of these formats:
- CSV – comma, semi-colon, pipe , space , other
- JSON – Simple, Complex
- EXCEL – 1997 , 2007
- SQL – MySql, Oracle, SQL Server, Postgres, SQLite
- HTML – table, ul, dl
- XML
This is the tool so far, which I have seen allow you generate up to 10000 free rows/records out of the ones listed here. And it also has a wide variety of data types relative to the others.
- Mockaroo : You can generate and save your file in all of these formats:
- CSV,
- JSON,
- TabDelimited,
- SQL,
- CUSTOM,
- EXCEL,
- XML and
- DBUnit-XML.
Mockaroo will allow you to generate Free sample data for up to 1000 rows
2. GenerateData : With GenerateData, you can generate and save your file in all of these formats
- CSV
- EXCEL
- HTML
- JSON
- LDIF
- PROGRAMMING LANGUAGES LIKE, JAVASCRIPT, PHP, PERL, RUBY
- SQL
- XML
Generatedata will allow you to generate up to 100 rows of data for free
You will really find these sites helpful so you should not be worried about getting test data for your Data Science, Analytics and Database projects.
Happy generating!
Updated:
You can also check Quandl for large Python Datasets