Index of databases made by me

I made over 100 databases during my life and posted them on my website for everyone, from students practicing data analysis, to multi-national companies in real estate and automobile industry.

Most projects I made from personal interest to distribute freely or sell for professional use. Some projects I made at request for a specific customer and published on website to allow other customers to purchase if interested. I offer web scraping services, making custom databases according your requirements.

Not included in below list are the databases made as one-time project for single customers outside of my fields of interest, few being under non-disclosure agreement.

Project URL Type Size (KB) Pages / rows Method Made in Update frequency Price
World geography
Solar System (Word version, DELETED) DOC manual 2000 ? abandoned free
Solar System (Excel version) XLS 45 35 rows manual 2014 abandoned free
Solar System Articles DOC 1188 249 pages copy-paste 2019 no $20
Solar System Database XLS 217 224 rows, 42 columns copy-paste 2019 at new discoveries $22
World countries & facts
(Word, based on old atlas, DELETED)
DOC manual 1998-2000 abandoned free
World countries & facts
(Excel, based Encarta 2002)
XLS 81 ~200 rows manual 2004 abandoned free
World countries & facts
(based on The World Factbook)
XLSX 2208 268 rows scraping 2017 by request (low sales) $36
World cities population (original DELETED) DOC 727 130 pages manual 2003-2005 deleted free
World cities population (Word simple) DOC 477 50 pages manual 2016 by request (low sales) $10
World cities population (Word detailed) DOC 781 150 pages manual 2016 by request (low sales) $20
World cities population (Excel detailed) XLS 1075 7500+ rows manual 2016 by request (low sales) $40
World tallest buildings database XLS 5425 15000+ rows scraping 2015 no (no sales) $150
World tallest buildings database XLS 91627 160000+ rows scraping 2016 no (no sales) $800
World tallest buildings database XLS 24112 30000+ rows, 117 columns scraping 2019 by request (low sales) $300
Airports & Airfields Database XLS 25648 55000+ rows, 20 columns scraping Aug 2019 by request (low sales) $110
Singapore real estate
Database of HDB Blocks XLS 6000+ 14000+ rows manual 2009 at new announcements $150-1200
Database of HDB Resale Flat Prices 2009-2013 XLS 24307 160000+ rows copy-paste 2009 no
Database of HDB Resale Flat Prices 1990-2019 XLSX 60000+ 700000+ rows copy-paste 2017 by request (low sales)
Database of condos SingaporeExpats XLS 2981 3100+ rows scraping 2015 by request (low sales) about $150
Database of condos PropertyGuru XLS 2334 3200+ rows scraping 2016 by request (low sales) about $300
Database of all buildings XLSX 17275 140000+ rows scraping 2017 by request (low sales) about $700
Managing Agent Database XLS 1185 4400+ rows scraping 2017 by request (low sales) about $200
HDB Median Resale Prices XLSX 291 1500+ rows copy-paste 2013 every 3 months $10
List of BTO & DBSS projects XLS 103 300+ rows manual 2009 at new announcements free
List of BTO prices XLS 183 700+ rows manual 2015 at new announcements about $30
List of SERS sites XLS 49 80 rows manual 2009 at new announcements free
List of HUDC estates XLS 64 24 rows manual 2009 at new announcements free
List of Executive Condominiums XLS 132 70+ rows manual 2009 at new announcements free
List of MRT and LRT stations XLS 84 227 + 43 rows copy-paste Nov 2019 at new announcements $27
Hong Kong real estate
Hong Kong Public & Private Housing Estates
(Gohome and Centadata mixed up)
XLS 592 1000+ rows manual 2011 abandoned about $200
Hong Kong Housing Database Estates
(Centadata)
XLS 806 3364 rows, 10 columns scraping Sep 2016 by request (low sales) about $70
Hong Kong Housing Database Buildings
(Centadata)
XLS 4093 24370 rows, 14 columns scraping May 2018 by request (low sales) about $500
Hong Kong Housing Authority XLS 420 476 rows, 36 columns scraping May 2018 by request (low sales) about $50
Romania geography
Geografia României DOC 215 23 pages manual 1999 2006 free
Drumuri naţionale DOC 70.5 6 pages manual 1999 2006 free
Căi ferate DOC 72.5 6 pages manual 1999 2006 free
Căi ferate detaliat DOC 347 51 pages manual 2000 2006 $20
Împărţirea administrativă interbelică DOC 166 11 pages manual 2006 2006 free
Împărţirea administrativă în regiuni şi raioane DOC 80 7 pages manual 1999 2006 free
Împărţirea administrativă în judeţe DOC 103 10 pages manual 1998 2006 free
Populaţia oraşelor şi judeţelor XLS 213 320 cities + 2861 communes manual 2004 2011 census $20
Building database – Ploieşti city XLSX ~1900 manual Jan 2018 constantly variable
Building database – expansion to other cities XLSX 5000+ and growing manual Aug 2019 constantly variable
Automobile research
Car Models List (Worldwide) DOC 403 59 pages manual 2003 abandoned 2015 (free product
don’t deserve updates)
free
Car Models Timeline (Worldwide) XLS 584 manual 2003 abandoned 2015 (free product
don’t deserve updates)
free
Car Models List (Worldwide) XLS 1545 4500+ rows manual 2012 every 3-6 months (high sales) $30-55
Car Nameplates List (Worldwide) XLS 725 3300+ rows manual 2016 every year (low sales) about €30
Car Models Encyclopedia (Europe) DOC 1680 360 pages manual 2005 abandoned 2013 (low sales) about €50
Car Models Database (Europe) XLS 1346 3500+ rows manual 2005 every 3-6 months (high sales) $30-110
Car Models & Engines Database (Europe) XLS 13532 20000+ rows manual 2003 every 3-6 months (high sales) $100-600
American Year-Make-Model XLS 1803 16000+ rows, 5 columns manual 2013 every year (high sales) $10-80
American Year-Make-Model-Trim-Specs XLS 104707 55000+ rows, 86 columns scraping 2014 every 2 months (high sales) $50-450
Second American Car Database XLS 113716 55000+ rows, 230 columns scraping 2017 every 3 months (high sales) $50-550
German Car Database XLSX 250178 120000+ rows, 206 columns scraping 2015 every 3 months (high sales) $60-600
United Kingdom Car Database XLSX 30065 80000+ row, 52 columns scraping Jan 2019 every 3 months (high sales) $70-400
India Car Database XLS 13782 5000+ rows 196 columns scraping 2015 every month (high sales) $40-160
India Car Database XLS 2675 1200+ rows, 98 columns scraping 2016 by request (low sales) $30-60
Middle East GCC Car Database XLS 7607 12000+ rows, 24 columns scraping 2016 every 3 months (high sales) $20-140
UAE Car Database XLS 7757 8000+ rows, 25 columns scraping Sep 2019 every 3 months (high sales) $80-160
UAE Car Valuation XLS 2883 26000+ rows, 8 columns scraping Apr 2018 every 3 months (high sales) $240
Japan Car Database XLSX 27878 90000+ rows, 55 columns scraping Apr 2019 every 3-6 months (low sales) $180-450
China Car Database XLS 30000 rows, 159 columns scraping Jun 2019 by request (low sales) $150-300
Singapore Car Database XLS 8312 4000+ rows, 78 columns scraping Nov 2018 by request (low sales) $35-180
Malaysia Car Database XLS 4839 2300+ rows, 125 columns scraping Sep 2018 by request (low sales) $45-115
Indonesia Car Database XLS 1774 700+ rows, 164 columns scraping May 2018 by request (low sales) $35-70
South Africa Car Database XLS 3286 1800+ rows, 111 columns scraping Jul 2019 by request (low sales) $45-90
Australia Car Database XLS 1041 14000+ rows scraping 2017 every 3 months (high sales) $10-70
Australia Car Database XLSX 51223 90000+ rows scraping 2017 every 3 months (high sales) $40-500
Motorcycles Database XLS 24569 30000+ rows, 73 columns scraping 2016 every 3-6 months (high sales) $30-300
AutoKatalog Statistics XLS 212 manual 2012 2014 is the last AutoKatalog free
Automobile Production by country XLS 130 copy-paste 2012 every year in April free
Automobile Sales Figures by make and model XLS 833 scraping Feb 2017 every year in February $10-100
Wheels and Tires Size (Worldwide) XLSX 34113 400000+ rows, 17 columns scraping Mar 2018 by request (low sales) $300
Chip tuning / ECU remap database (Europe) XLS 4408 7000+ rows, 23 columns scraping Feb 2018 by request (low sales) $120
Light bulbs database (American) XLS 21333 37000+ rows, 53 columns scraping Mar 2017 by request (low sales) $300
Computers & technology
Excel colors with RGB values XLSX 25 manual 2018 no free
Calendar for any year 1900-9999 XLS 224 manual 2018 no free
Screen size calculator XLS 50.5 manual May 2014 abandoned 2016 free
Screen resolution statistics XLS 326 copy-paste May 2014 abandoned 2016 free
Screen resolution statistics by country XLS 2380 copy-paste May 2014 abandoned 2016 free
Mobile phones database XLS 8968 9000+ rows, 85 columns scraping Aug 2016 every month (high sales) about $200
Digital cameras database XLS 2805 3700+ rows, 47 columns scraping Dec 2017 by request (low sales) about $60
Gaming stuff
Age of Empires DOC 208 manual 2001? no free
Age of Empires XLS 106 manual 2007? no free
Age of Empires Expansion XLS 112 manual 2007? no free
Beetle Crazy Cup XLS 71 manual 2002? no free
Driver Education (questions) DOC 264 manual 2000, 2006 no free
GTA Vice City XLS 117 manual 2005 no free
GTA San Andreas XLS 134 manual 2007 no free
Midtown Madness XLS 40 manual 2003 no free
Midtown Madness 2 XLS 35 manual 2005 no free
Need For Speed: Hot Pursuit 2 XLS 58 manual 2003 no free
Need For Speed: Porsche Unleashed XLS 217 manual 2003-2006 no free
Need For Speed: Underground XLS 161 manual 2006 no free
Quake 3 XLS manual 2000-2007 no free
The Sims 2/3 list of houses made by me XLS 24 manual 2013 no free
The Sims 1 neighborhoods and lots XLS 44 81 lots manual Oct 2018 no free
The Sims 2 neighborhoods and lots XLS 135 350 lots manual Mar 2019 no $14
The Sims 3 worlds and lots XLS 398 1845 lots manual 2013 no $23
The Sims 4 worlds and lots XLS manual future no
The Sims 1 career tracks XLS 132 211 jobs manual 2017 no $5
The Sims 2 career tracks XLS 212 325 jobs manual 2012 no $5
The Sims 3 career tracks XLS 313 482 jobs manual Jan 2020 no $10
The Sims 3 career tracks XLS 284 371 jobs (to be continued) manual Apr 2020 no $10
The Sims 1 list of items XLS 421 1640 items manual Dec 2019 no $32
The Sims 2 list of items XLS 1039 4556 items manual 2012 no $44
The Sims 3 list of items XLS manual future no
Miscellaneous works
Music Database XLS 2200+ 6000+ rows manual 2005 when I download new songs free
FIFA Word Cup matches XLS copy-paste future
Stadiums Database XLS 5360 3125 rows scraping 2018 by request (low sales) about $60

FAQ: why real estate and car database cost money?

These databases are targeting commercial use, each year few hundreds companies, programmers and web developers, are paying me BIG $$ to ensure getting a complete vehicle database, frequently updated and with accurate technical specs, several customers offered me $1000+ to make a customized vehicle list. Hope you can do the same.

If you are a company, stay away from websites offering FREE databases, they are most likely stolen and posted online by other people than original author, or made by careless students who needed only a bunch of data for their studies, such databases does not cover all cars, have many errors, are outdated, no support or updates are provided. Using databases downloaded freely may destroy your company reputation.

Some databases I posted for free download (example: List of Singapore MRT and LRT stations) because data is simply copy-pasted from Wikipedia or other websites, anyone can do the same in few minutes, thus I do not think that anyone will pay me. After copy-pasting data, what I did was to add visual borders and colors.

Most gaming-related databases are free because they have NO USE for professionals, the only people who may find them useful are gamers who are unlikely to pay money.

How I came in data providing industry

I am born in a family of engineers (read more on About me), I use AutoCAD since 1998 and in 2008 I started designing buildings, many people encouraged me to pursue a career in architecture, but this turned to be a bad experience due to difficulties in convincing people to pay my services, many idiots who want house plans for free, and in the rare cases when I was paid, it was one-time payment per project. I had little contact with people in IT and nobody ever told me that these databases can be a gold mine.

Since childhood I love writing books, doing research, making databases and statistics. I started using computers in 1997, my dad taught me to use Word, but around 2003 I started using Excel more than Word. Analyzing data, making tables and charts about everything encountered in my life! For example, in racing games I measure the speed of each car and write the numbers in an Excel spreadsheet then make a chart.

The hobby for cars started in 1999, since 2003 I use Excel to create an all-cars database, independently from the internet world (I connected to internet in 2005), manually entering data from the very reliable AutoKatalog car magazines from Germany (as seen in above video), making an original compilation that you cannot find anywhere else online (except on websites that purchased from me).

In 2011 I published car database works for first time on my website, intending to share with other car hobbyists, but had the surprise to be visited by programmers, web designers and mobile app developers, working for various companies (car insurance, car parts shops, car shipping services, etc), realizing that I can make a business from this!

Had to do extensive transformation to make my hobby-made databases suitable for professional use. For example: some cells were merged horizontally and vertically, and Make | Model was originally on a single column, which pose no problem in reading with human eyes, but a computer program cannot read it correctly. As soon 1 customer bring this in attention, I eliminated merged cells and separated Make | Model to 2 columns.

Hong Kong Housing Database was also made in early 2011 from personal interest of analyzing public housing estates, few months later, an insurance company asked me if I can expand it to private housing estates. It became my FIRST large project made for a customer.

Seeing the success with car database, I decided to transform other hobby databases, such as HDB Database and World Cities Database, in a format suitable for machines, and publish on my website for sale to professionals.

A new era started in 2015: web scraping. “Scraping” usually means coding a bot that visit a list of given pages, copy specific data from each page and put it in an Excel / CSV file automatically, at rate of few pages per second. This allow me to create much larger databases with little effort, writing scraping code takes 10 min – 1 hour, and running it in background takes few hours or days to generate CSV with 1,000-100,000+ rows.

Previously I avoided to copy data from other websites, thinking that it means cheating, copyright issue, illegal way to collect data and creating non-original content, but most people have no issue with this, they do not care about buying someone’s original compilation and ask me specifically to create scripts that extract data from various websites and sell them CSV. Now I want to apologize to all owners of websites I scraped data from, but if I was rejecting these customers motivating that scraping is illegal, customers would go to other data mining companies or freelancers and obtained same data anyway.

Keeping all databases regularly updated takes is a huge workload. Scraping more websites, if they take too much time, will create additional workload and will delay everyone’s updates. As 2018 I decided to STOP updating databases having less than 5 sales per year so I can focus on the ~20 best-selling databases that produce 80% of my income.

So… unless you come with a GREAT IDEA of database that can be sold to multiple customers, I have the right to NOT do your web scraping project if it takes more than ~2 hours of manual work and more than ~50 hours of running scraper in background.

Writing style

My hobby for writing started before having a computer, I was writing with pencil on notebooks.

We had a computer since 1995, and in late 1996 my dad taught me to use Microsoft Word, he even set me rules how to write a nice Word document: Arial font, body text with 12-14pt and justified, titles 16-20pt centered, bold and underlined. However, since my writings were not really a book, but a list of… something, following my dad rules created excessive bold and centered text. I personally added a system of numbering chapters, and since some of my works (example: list of rivers) had 5-level hierarchy, titles contained 1.2.3.4.5. title name, making document unaesthetic.

My dad told me to finish and print the work because “what is not finished have ZERO value“. I never understand why he wanted to print. Some projects like Car Database should NOT be printed, it cannot be “finished” because need constant updates with new launched cars. My parents promised me that will help me publishing a book, that turned sarcasm. They were against becoming a public figure (note: they may have wanted just to give me a solitary occupation to stop me disturbing them, instead of letting me to have a social life).

Exploring my computer, especially Readme.txt from various software and DOS games, made me fascinated in 1998 to use Notepad because of Fixedsys font, I made some writing where titles were enhanced with lines of —- or ==== of full width of screen. For a period in 2001-2004 I even wanted to impose monospace fonts for all my works.

Since 2003 I broke away from dad rules and started using Excel more than Word. New works in Word were optimized for on-screen display instead of printing, often using non-standard page sizes, to make exactly 1 page for each subject. I made body text with 10pt, titles 20pt and 15pt with full-width colored background, usually red and blue. New works in Excel had column headers with blue background and white text, but I gradually changed to coloring background of each group of columns in rainbow colors.

I created my first website in 2009, designing it similar with my Word documents: body text justified and titles centered with different background color. Some people criticized my website, saying that is looks being made by an expert in typography rather than by a web designer.

In 2015 I changed my Word files, removing full-width colored backgrounds of titles, putting instead full-width horizontal lines (somewhat similar with TXT files from MS-DOS era). This will be better for printing (even if I assume that nobody will print my works).

Example of styling in my books: 2012, 2013, 2015 editions of Car Models Encyclopedia

I also done some kind of competition between my projects, using an Excel size to keep track of file size, number of pages, rows and columns of each project. A race to create biggest Excel spreadsheet or biggest Word document, in terms of pages and file size, under certain standards (body text 10pt font, no duplicate stuff, no large open spaces, NO bullshit but useful content, etc).

Today, sales of Excel databases is my primary source of income. Word documents and books turned hard to sell, but I continue to write, just that I write blog articles instead of Word documents.

Excel style

All databases made by me wear a signature: the very colorful Excel spreadsheets with borders in correct place (since 2013 I realized that majority of customers import data into MySQL instead of visualization in Excel, so the visual enhancements are useless), but they prove how much CARE I spend developing databases.

The BIG Car Database Car Models Timeline Year Make Model Specifications car database for American market Mobile phones specifications database Solar System Database - planets and satellites facts and figures HDB Database, block number, street address, postal code, lease commemnce date, number of units breakdown by flat type, upgrading programmes

4 thoughts on “Index of databases made by me

  1. Teoalida offer a fantastic package , its hard to find a great data base of vehicles and motorcycle for use on websites . We have used these databases on our global classifieds website
    Crackerclassifieds.com because we are global we needed a complete selection for our users .
    This database selection helps users to place their ads and also search relevant makes and models with ease .
    The support we received from Teoalida when our developer needed certain criteria was fantastic
    We highly recommend the use of these databases, should you have the need .
    Great service thank you so much

Leave a Reply

Your email address will not be published. Required fields are marked *