I made over 100 databases during my life and posted them on my website for everyone, from students practicing data analysis, to multi-national companies in real estate and automobile industry.
Most projects I made from personal interest to distribute freely or sell for professional use. Some projects I made at request for a specific customer and published on website to allow other customers to purchase if interested. I offer web scraping services, making custom databases according your requirements.
Not included in below list are the databases made as one-time project for single customers outside of my fields of interest, few being under non-disclosure agreement.
|Project URL||Type||Size (KB)||Pages / rows||Method||Made in||Update frequency||Price|
|Solar System (Word version, DELETED)||DOC||manual||2000 ?||abandoned||free|
|Solar System (Excel version)||XLS||45||35 rows||manual||2014||abandoned||free|
|Solar System Articles||DOC||1188||249 pages||copy-paste||2019||no||$20|
|Solar System Database||XLS||217||224 rows, 42 columns||copy-paste||2019||at new discoveries||$22|
|World countries & facts
(Word, based on old atlas, DELETED)
|World countries & facts
(Excel, based Encarta 2002)
|World countries & facts
(based on The World Factbook)
|XLSX||2208||268 rows||scraping||2017||by request (low sales)||$36|
|World cities population (original DELETED)||DOC||727||130 pages||manual||2003-2005||deleted||free|
|World cities population (Word simple)||DOC||477||50 pages||manual||2016||by request (low sales)||$10|
|World cities population (Word detailed)||DOC||781||150 pages||manual||2016||by request (low sales)||$20|
|World cities population (Excel detailed)||XLS||1075||7500+ rows||manual||2016||by request (low sales)||$40|
|World tallest buildings database||XLS||5425||15000+ rows||scraping||2015||no (no sales)||$150|
|World tallest buildings database||XLS||91627||160000+ rows||scraping||2016||no (no sales)||$800|
|World tallest buildings database||XLS||24112||30000+ rows, 117 columns||scraping||2019||by request (low sales)||$300|
|Airports & Airfields Database||XLS||25648||55000+ rows, 20 columns||scraping||Aug 2019||by request (low sales)||$110|
Singapore real estate
|Database of HDB Blocks||XLS||6000+||14000+ rows||manual||2009||at new announcements||$150-1200|
|Database of HDB Resale Flat Prices 2009-2013||XLS||24307||160000+ rows||copy-paste||2009||no|
|Database of HDB Resale Flat Prices 1990-2019||XLSX||60000+||700000+ rows||copy-paste||2017||by request (low sales)|
|Database of condos SingaporeExpats||XLS||2981||3100+ rows||scraping||2015||by request (low sales)||about $150|
|Database of condos PropertyGuru||XLS||2334||3200+ rows||scraping||2016||by request (low sales)||about $300|
|Database of all buildings||XLSX||17275||140000+ rows||scraping||2017||by request (low sales)||about $700|
|Managing Agent Database||XLS||1185||4400+ rows||scraping||2017||by request (low sales)||about $200|
|HDB Median Resale Prices||XLSX||291||1500+ rows||copy-paste||2013||every 3 months||$10|
|List of BTO & DBSS projects||XLS||103||300+ rows||manual||2009||at new announcements||free|
|List of BTO prices||XLS||183||700+ rows||manual||2015||at new announcements||about $30|
|List of SERS sites||XLS||49||80 rows||manual||2009||at new announcements||free|
|List of HUDC estates||XLS||64||24 rows||manual||2009||at new announcements||free|
|List of Executive Condominiums||XLS||132||70+ rows||manual||2009||at new announcements||free|
|List of MRT and LRT stations||XLS||84||227 + 43 rows||copy-paste||Nov 2019||at new announcements||$27|
Hong Kong real estate
|Hong Kong Public & Private Housing Estates
(Gohome and Centadata mixed up)
|XLS||592||1000+ rows||manual||2011||abandoned||about $200|
|Hong Kong Housing Database Estates
|XLS||806||3364 rows, 10 columns||scraping||Sep 2016||by request (low sales)||about $70|
|Hong Kong Housing Database Buildings
|XLS||4093||24370 rows, 14 columns||scraping||May 2018||by request (low sales)||about $500|
|Hong Kong Housing Authority||XLS||420||476 rows, 36 columns||scraping||May 2018||by request (low sales)||about $50|
|Car Models List (Worldwide)||DOC||403||59 pages||manual||2003||abandoned 2015 (free product
don’t deserve updates)
|Car Models Timeline (Worldwide)||XLS||584||manual||2003||abandoned 2015 (free product
don’t deserve updates)
|Car Models List (Worldwide)||XLS||1545||4500+ rows||manual||2012||every 3-6 months (high sales)||$30-55|
|Car Nameplates List (Worldwide)||XLS||725||3300+ rows||manual||2016||every year (low sales)||about €30|
|Car Models Encyclopedia (Europe)||DOC||1680||360 pages||manual||2005||abandoned 2013 (low sales)||about €50|
|Car Models Database (Europe)||XLS||1346||3500+ rows||manual||2005||every 3-6 months (high sales)||$30-110|
|Car Models & Engines Database (Europe)||XLS||13532||20000+ rows||manual||2003||every 3-6 months (high sales)||$100-600|
|American Year-Make-Model||XLS||1803||16000+ rows, 5 columns||manual||2013||every year (high sales)||$10-80|
|American Year-Make-Model-Trim-Specs||XLS||104707||55000+ rows, 86 columns||scraping||2014||every 2 months (high sales)||$50-450|
|Second American Car Database||XLS||113716||55000+ rows, 230 columns||scraping||2017||every 3 months (high sales)||$50-550|
|German Car Database||XLSX||250178||120000+ rows, 206 columns||scraping||2015||every 3 months (high sales)||$60-600|
|United Kingdom Car Database||XLSX||30065||80000+ row, 52 columns||scraping||Jan 2019||every 3 months (high sales)||$70-400|
|India Car Database||XLS||13782||5000+ rows 196 columns||scraping||2015||every month (high sales)||$40-160|
|India Car Database||XLS||2675||1200+ rows, 98 columns||scraping||2016||by request (low sales)||$30-60|
|Middle East GCC Car Database||XLS||7607||12000+ rows, 24 columns||scraping||2016||every 3 months (high sales)||$20-140|
|UAE Car Database||XLS||7757||8000+ rows, 25 columns||scraping||Sep 2019||every 3 months (high sales)||$80-160|
|UAE Car Valuation||XLS||2883||26000+ rows, 8 columns||scraping||Apr 2018||every 3 months (high sales)||$240|
|Japan Car Database||XLSX||27878||90000+ rows, 55 columns||scraping||Apr 2019||every 3-6 months (low sales)||$180-450|
|China Car Database||XLS||30000 rows, 159 columns||scraping||Jun 2019||by request (low sales)||$150-300|
|Singapore Car Database||XLS||8312||4000+ rows, 78 columns||scraping||Nov 2018||by request (low sales)||$35-180|
|Malaysia Car Database||XLS||4839||2300+ rows, 125 columns||scraping||Sep 2018||by request (low sales)||$45-115|
|Indonesia Car Database||XLS||1774||700+ rows, 164 columns||scraping||May 2018||by request (low sales)||$35-70|
|South Africa Car Database||XLS||3286||1800+ rows, 111 columns||scraping||Jul 2019||by request (low sales)||$45-90|
|Australia Car Database||XLS||1041||14000+ rows||scraping||2017||every 3 months (high sales)||$10-70|
|Australia Car Database||XLSX||51223||90000+ rows||scraping||2017||every 3 months (high sales)||$40-500|
|Motorcycles Database||XLS||24569||30000+ rows, 73 columns||scraping||2016||every 3-6 months (high sales)||$30-300|
|AutoKatalog Statistics||XLS||212||manual||2012||2014 is the last AutoKatalog||free|
|Automobile Production by country||XLS||130||copy-paste||2012||every year in April||free|
|Automobile Sales Figures by make and model||XLS||833||scraping||Feb 2017||every year in February||$10-100|
|Wheels and Tires Size (Worldwide)||XLSX||34113||400000+ rows, 17 columns||scraping||Mar 2018||by request (low sales)||$300|
|Chip tuning / ECU remap database (Europe)||XLS||4408||7000+ rows, 23 columns||scraping||Feb 2018||by request (low sales)||$120|
|Light bulbs database (American)||XLS||21333||37000+ rows, 53 columns||scraping||Mar 2017||by request (low sales)||$300|
Computers & technology
|Excel colors with RGB values||XLSX||25||manual||2018||no||free|
|Calendar for any year 1900-9999||XLS||224||manual||2018||no||free|
|Screen size calculator||XLS||50.5||manual||May 2014||abandoned 2016||free|
|Screen resolution statistics||XLS||326||copy-paste||May 2014||abandoned 2016||free|
|Screen resolution statistics by country||XLS||2380||copy-paste||May 2014||abandoned 2016||free|
|Mobile phones database||XLS||8968||9000+ rows, 85 columns||scraping||Aug 2016||every month (high sales)||about $200|
|Digital cameras database||XLS||2805||3700+ rows, 47 columns||scraping||Dec 2017||by request (low sales)||about $60|
|Age of Empires||DOC||208||manual||2001?||no||free|
|Age of Empires||XLS||106||manual||2007?||no||free|
|Age of Empires Expansion||XLS||112||manual||2007?||no||free|
|Beetle Crazy Cup||XLS||71||manual||2002?||no||free|
|Driver Education (questions)||DOC||264||manual||2000, 2006||no||free|
|GTA Vice City||XLS||117||manual||2005||no||free|
|GTA San Andreas||XLS||134||manual||2007||no||free|
|Midtown Madness 2||XLS||35||manual||2005||no||free|
|Need For Speed: Hot Pursuit 2||XLS||58||manual||2003||no||free|
|Need For Speed: Porsche Unleashed||XLS||217||manual||2003-2006||no||free|
|Need For Speed: Underground||XLS||161||manual||2006||no||free|
|The Sims 2/3 list of houses made by me||XLS||24||manual||2013||no||free|
|The Sims 1 neighborhoods and lots||XLS||44||81 lots||manual||Oct 2018||no||free|
|The Sims 2 neighborhoods and lots||XLS||135||350 lots||manual||Mar 2019||no||$14|
|The Sims 3 worlds and lots||XLS||398||1845 lots||manual||2013||no||$23|
|The Sims 4 worlds and lots||XLS||manual||future||no|
|The Sims 1 career tracks||XLS||132||211 jobs||manual||2017||no||$5|
|The Sims 2 career tracks||XLS||212||325 jobs||manual||2012||no||$5|
|The Sims 3 career tracks||XLS||313||482 jobs||manual||Jan 2020||no||$10|
|The Sims 3 career tracks||XLS||284||371 jobs (to be continued)||manual||Apr 2020||no||$10|
|The Sims 1 list of items||XLS||421||1640 items||manual||Dec 2019||no||$32|
|The Sims 2 list of items||XLS||1039||4556 items||manual||2012||no||$44|
|The Sims 3 list of items||XLS||manual||future||no|
|Music Database||XLS||2200+||6000+ rows||manual||2005||when I download new songs||free|
|FIFA Word Cup matches||XLS||copy-paste||future|
|Stadiums Database||XLS||5360||3125 rows||scraping||2018||by request (low sales)||about $60|
FAQ: why real estate and car database cost money?
These databases are targeting commercial use, each year few hundreds companies, programmers and web developers, are paying me BIG $$ to ensure getting a complete vehicle database, frequently updated and with accurate technical specs, several customers offered me $1000+ to make a customized vehicle list. Hope you can do the same.
If you are a company, stay away from websites offering FREE databases, they are most likely stolen and posted online by other people than original author, or made by careless students who needed only a bunch of data for their studies, such databases does not cover all cars, have many errors, are outdated, no support or updates are provided. Using databases downloaded freely may destroy your company reputation.
Some databases I posted for free download (example: List of Singapore MRT and LRT stations) because data is simply copy-pasted from Wikipedia or other websites, anyone can do the same in few minutes, thus I do not think that anyone will pay me. After copy-pasting data, what I did was to add visual borders and colors.
Most gaming-related databases are free because they have NO USE for professionals, the only people who may find them useful are gamers who are unlikely to pay money.
How I came in data providing industry
I am born in a family of engineers (read more on About me), I use AutoCAD since 1998 and in 2008 I started designing buildings, many people encouraged me to pursue a career in architecture, but this turned to be a bad experience due to difficulties in convincing people to pay my services, many idiots who want house plans for free, and in the rare cases when I was paid, it was one-time payment per project. I had little contact with people in IT and nobody ever told me that these databases can be a gold mine.
Since childhood I love writing books, doing research, making databases and statistics. I started using computers in 1997, my dad taught me to use Word, but around 2003 I started using Excel more than Word. Analyzing data, making tables and charts about everything encountered in my life! For example, in racing games I measure the speed of each car and write the numbers in an Excel spreadsheet then make a chart.
The hobby for cars started in 1999, since 2003 I use Excel to create an all-cars database, independently from the internet world (I connected to internet in 2005), manually entering data from the very reliable AutoKatalog car magazines from Germany (as seen in above video), making an original compilation that you cannot find anywhere else online (except on websites that purchased from me).
In 2011 I published car database works for first time on my website, intending to share with other car hobbyists, but had the surprise to be visited by programmers, web designers and mobile app developers, working for various companies (car insurance, car parts shops, car shipping services, etc), realizing that I can make a business from this!
Had to do extensive transformation to make my hobby-made databases suitable for professional use. For example: some cells were merged horizontally and vertically, and Make | Model was originally on a single column, which pose no problem in reading with human eyes, but a computer program cannot read it correctly. As soon 1 customer bring this in attention, I eliminated merged cells and separated Make | Model to 2 columns.
Hong Kong Housing Database was also made in early 2011 from personal interest of analyzing public housing estates, few months later, an insurance company asked me if I can expand it to private housing estates. It became my FIRST large project made for a customer.
Seeing the success with car database, I decided to transform other hobby databases, such as HDB Database and World Cities Database, in a format suitable for machines, and publish on my website for sale to professionals.
A new era started in 2015: web scraping. “Scraping” usually means coding a bot that visit a list of given pages, copy specific data from each page and put it in an Excel / CSV file automatically, at rate of few pages per second. This allow me to create much larger databases with little effort, writing scraping code takes 10 min – 1 hour, and running it in background takes few hours or days to generate CSV with 1,000-100,000+ rows.
Previously I avoided to copy data from other websites, thinking that it means cheating, copyright issue, illegal way to collect data and creating non-original content, but most people have no issue with this, they do not care about buying someone’s original compilation and ask me specifically to create scripts that extract data from various websites and sell them CSV. Now I want to apologize to all owners of websites I scraped data from, but if I was rejecting these customers motivating that scraping is illegal, customers would go to other data mining companies or freelancers and obtained same data anyway.
Keeping all databases regularly updated takes is a huge workload. Scraping more websites, if they take too much time, will create additional workload and will delay everyone’s updates. As 2018 I decided to STOP updating databases having less than 5 sales per year so I can focus on the ~20 best-selling databases that produce 80% of my income.
So… unless you come with a GREAT IDEA of database that can be sold to multiple customers, I have the right to NOT do your web scraping project if it takes more than ~2 hours of manual work and more than ~50 hours of running scraper in background.
My hobby for writing started before having a computer, I was writing with pencil on notebooks.
We had a computer since 1995, and in late 1996 my dad taught me to use Microsoft Word, he even set me rules how to write a nice Word document: Arial font, body text with 12-14pt and justified, titles 16-20pt centered, bold and underlined. However, since my writings were not really a book, but a list of… something, following my dad rules created excessive bold and centered text. I personally added a system of numbering chapters, and since some of my works (example: list of rivers) had 5-level hierarchy, titles contained 22.214.171.124.5. title name, making document unaesthetic.
My dad told me to finish and print the work because “what is not finished have ZERO value“. I never understand why he wanted to print. Some projects like Car Database should NOT be printed, it cannot be “finished” because need constant updates with new launched cars. My parents promised me that will help me publishing a book, that turned sarcasm. They were against becoming a public figure (note: they may have wanted just to give me a solitary occupation to stop me disturbing them, instead of letting me to have a social life).
Exploring my computer, especially Readme.txt from various software and DOS games, made me fascinated in 1998 to use Notepad because of Fixedsys font, I made some writing where titles were enhanced with lines of —- or ==== of full width of screen. For a period in 2001-2004 I even wanted to impose monospace fonts for all my works.
Since 2003 I broke away from dad rules and started using Excel more than Word. New works in Word were optimized for on-screen display instead of printing, often using non-standard page sizes, to make exactly 1 page for each subject. I made body text with 10pt, titles 20pt and 15pt with full-width colored background, usually red and blue. New works in Excel had column headers with blue background and white text, but I gradually changed to coloring background of each group of columns in rainbow colors.
I created my first website in 2009, designing it similar with my Word documents: body text justified and titles centered with different background color. Some people criticized my website, saying that is looks being made by an expert in typography rather than by a web designer.
In 2015 I changed my Word files, removing full-width colored backgrounds of titles, putting instead full-width horizontal lines (somewhat similar with TXT files from MS-DOS era). This will be better for printing (even if I assume that nobody will print my works).
Example of styling in my books: 2012, 2013, 2015 editions of Car Models Encyclopedia
I also done some kind of competition between my projects, using an Excel size to keep track of file size, number of pages, rows and columns of each project. A race to create biggest Excel spreadsheet or biggest Word document, in terms of pages and file size, under certain standards (body text 10pt font, no duplicate stuff, no large open spaces, NO bullshit but useful content, etc).
Today, sales of Excel databases is my primary source of income. Word documents and books turned hard to sell, but I continue to write, just that I write blog articles instead of Word documents.
All databases made by me wear a signature: the very colorful Excel spreadsheets with borders in correct place (since 2013 I realized that majority of customers import data into MySQL instead of visualization in Excel, so the visual enhancements are useless), but they prove how much CARE I spend developing databases.