Web Analytics Made Easy -
StatCounter

About me & history of Car Database

Teoalida’s Car Database was born from a major hobby for automobiles and pleasure for working in Excel and data analysis. It is one of MANY hobbies started during childhood and one of the FEW hobbies turning into business during adulthood.

This is mostly 1-man business. I (Teoalida) do data entry, web scraping, database updates, design my website, and also customer support via chat every day. I employ other people just for coding certain scrapers too complex for me.

After years of dedication I became the biggest data provider in automobile industry, given by numerous car databases offered for various regions of world, complete, accurate and frequently updated, plus other databases about geography, real estate and electronic products.

I am born in a family of engineers (read more on About me), I use AutoCAD since 1998 and in 2008 I started designing buildings, many people encouraged me to pursue a career in architecture, but this turned to be a bad experience due to difficulties in convincing people to pay my services, many idiots who want house plans for free, and in the rare cases when I was paid, it was one-time payment per project. I had little contact with people in IT and nobody ever told me that these databases can be a gold mine.

The hobby for cars started in 1999, since 2003 I use Excel to create an all-cars database, independently from the internet world (I connected to internet in 2005), manually entering data from the very reliable AutoKatalog car magazines from Germany (as seen in above video), making an original compilation that you cannot find anywhere else online (except on websites that purchased from me).

In 2011 I published my works for first time intending to share with other car hobbyists, but had the surprise to be visited by programmers, web designers and mobile app developers, working for various companies (car insurance, car parts shops, car shipping services, etc), realizing that I can make a business from this!

Due to poor website design and offering too much for free, plus data structure optimized for on-screen reading rather than for programming, I did not made any sale until May 2012. I had to do many changes to make it appealing for this unexpected audience, both in data structure and in website presentation, including moving in November 2012 to WordPress to have an eCommerce plugin where people can make payments and receive databases automatically, this helped me to have in each each month of 2013 more sales than whole 2012 year. I started updating databases regularly.

Most of these programmers have little experience in cars, some making mistakes such as buying wrong database, buying an American car database when they do business in Europe, or buying from other data providers just because their database is cheaper or have more rows, ending with bad quality databases with many missing models or incorrect data.

A new era started in 2015: web scraping. Previously I avoided to copy data from other websites, thinking that it means cheating, copyright issue, illegal way to collect data and creating non-original content, but most people have no issue with this, they do not care about buying someone’s original compilation and ask me specifically to create scripts that extract data from various websites and sell them CSV. Now I want to apologize to all owners of websites I scraped data from, but if I was rejecting these customers motivating that scraping is illegal, customers would go to other data mining companies or freelancers and obtained same data anyway.

In 2015 I sold approximately 100 databases, enough to quit my job of AutoCAD and architectural design, and dedicate my life to data providing industry.

Over time I realized that web scraping is not as dangerous as I though, since 2015 to 2018 I created additional 50+ databases, raising my income 5 times, including the best-selling car databases for America, India, Middle East, Australia, motorcycles database, real estate databases, mobile phones, etc. During 2018 over 500 people have paid me for databases.

Having high experience in automobile industry and few other fields, I carefully choose what websites to scrap data from, to maximize accuracy and data completion rather than largest number of car model variations as other people may do.

All databases made by me wear a signature: the very colorful Excel spreadsheets with borders in correct place (since 2013 I realized that majority of customers import data into MySQL instead of visualization in Excel, so the visual enhancements are useless), but they prove how much CARE I spend developing databases.

Just look at there metrics comparing with other car database providers
(you can check yourself on smallseotools.com)

Teoalida car database vs other data providers

Full story

The love for cars began in 1999, while walking to school I started counting car makes and number of cars for each foreign make. My list contained about 50, I omitted all american car makes as well as sport car manufacturers like Ferrari or Aston Martin since these were never seen on Romanian roads.

I was curious how many car makes and models exists, so I started collecting information. I had lack of sources of information, in first years my parents rarely bought me car magazines, also the car magazines show new launched models, rather than a worldwide list of cars models including historical models, like an encyclopedia. A friend gave me Auto Catalog 1997 (a car magazine released once per year, showing about 120 makes and technical data for about 2000 worldwide model versions).

I also have an obsession for making tables and statistics about everything. Since I had a computer, knowledge of Microsoft World and Excel, car magazines, and free time, I began writing my own automobile encyclopedia, to organize my knowledge about cars and fulfill my hobby of data analysis.

I started by writing in Notepad the list of car models (one TXT for each brand) sorted by manufacturer and car classes, with years of production (no technical data), to sort out the cars and understand which is the successor of which model. Later the TXT were combined in a single file Car Models List.DOC, additionally I made Car Models Timeline.XLS. Also I wrote a Car Body List for selected brands, because some successive models have different production years for each body style.

Early automobile encyclopedia made in Word

My dad taught me to use Microsoft Word and even set me rules how to write a nice book: body text with 12pt font, titles with 16-20pt font, centered, bold, underlined, etc. Writing about cars was my idea (I was writing also a geography encyclopedia, as well many other stuff).

My dad told me to finish my works and print them because “what is not finished have ZERO value“. I never understand why wanted to print… an automobile database is a thing that need to be updated constantly and cannot be “finished” to be printed.

My parents encouraged my hobby, printed few pages for me, and promised that some day will help me publishing a book, but this never happened. There is possible that they lied me intending to keep me BUSY at computer to prevent me disturbing their busy lives (instead of allowing me to make friends and be busy with a social life, as I wanted).

My access to information was limited, I was collecting data from all sources that came in my hand, sometimes copying articles word by word: AutoCatalog from Auto Motor si Sport and AutoSpecial from ADAC (copying technical data and model advantages/disadvantages), a Catalog of used cars (copying model history), monthly car magazines, etc. I wanted to include car photos too, but I had photos just with few cars in computer, and no digital photo camera to take photos.

The resulted encyclopedia was a MESS, no standard format across various car models and impossible to make it complete, because with limited access to information I did not know how many other engine versions of same model does exist. My car encyclopedia can be compared with Wikipedia (it also do not have a standard format across various car models), but mine had more tables than text, was printer-friendly, but most importantly, was 1-person work!

I wrote in this way about 80 car models on 200 pages, during 2 years (2-4 pages per model).

Download SAMPLE: Car encyclopedia made in childhood, what is your opinion? Probably a total MESS!

Since 2003 I broke away from what my dad taught me, for future works I used 10pt font, titles with full-width colored background (similar with the current theme of website), optimized for on-screen display rather than printing. Also started working more in Excel than in Word.

In same 2001-2003 I also made in Excel some tables with formulas… like a case study for selected car models. Also a database dedicated to sport cars, but today I do not see them so important to worth maintaining them.

I started in 2005 a new Word file, currently sold as Car Models Encyclopedia.

AutoKatalog, one of most reliable car publications in the world, is published yearly in Germany since 1957 and in other ~20 European countries during history, including Romania starting from 1996 under name AutoCatalog.

Started making automobile database in Excel in 2003

Probably the most important project was the Excel table with all car model variants displaying only 7 technical specs fields, based only on AutoCatalog and AutoSpecial, for faster data entry and clean-looking, currently sold as Car Models & Engines Database. Started in 2003, an experiment with just 100 model versions, but in school vacation of 2004 I decided to expand it to all cars, the first final version launched in January 2005 had 3600 car models produced from mid-1990s to present, but was still incomplete in case of non-European brands or SUVs as I was not interested in them.

I constantly updated it with new models (thanks to yearly releases of AutoCatalog), I also added older models (with less technical details) thanks to the internet connection and Wikipedia, reached 6000 model versions from 50 brands in January 2007 and 7800 models from 57 brands in 2009 edition, being complete database for European market.

Another project was Car Models Plus in both Word and Excel, started in 2005. Similar with earlier Car Body List, one entry per body style, plus some data about car dimensions and engine range.

The Word file, currently sold as Car Models Encyclopedia, with one paragraph per body style, a mini-encyclopedia compared with what I done in 2001-2003, with just FEW info about each car, written in a standard format, later turned in one paragraph per model for all car body styles.

It had an Excel equivalent, currently sold as Car Models Database, an Excel table with one row per body style, to compare cars over time.

Considering less important, they did not had any final complete version until 2010, when it reached about 500 car models produced in Europe. I added foreign imports in Europe in 2012 and at end of year the Car Models Encyclopedia reached 1200+ models on 160 pages, 4-10 models per page, when I realized that nobody is buying Word documents so I put ON HOLD future updates of Car Models Encyclopedia, concentrating on Car Models Database in Excel only.

I connected to the Internet in 2005

Gained access to Wikipedia and this allowed me to improve car databases, especially by extending historical coverage of Car Models List & Timeline towards World War II.

For a short moment I resumed working at the encyclopedia in Word, by adding older cars, this time getting inspired from Wikipedia, translating articles into Romanian, often writing with my own words and not direct translation. Shortly I abandoned that encyclopedia along with 80% of projects started in childhood, considering them useless as better products are available on internet. But I kept maintaining the car databases in Excel, as they had NO equivalent on internet.

Local friends praised my work (although some blamed me that I waste my life doing useless things) and one said that people are willing to pay money for such things and asked me why I do not post the car encyclopedia on internet?

Publishing on website in 2011 and doing first sales in 2012

In 2009 I created my personal website http://teoalida.webs.com/ to publish my works primarily my other hobby AutoCAD design field (2D and 3D models of building and cars).

In November 2010 I did a FULL update for 6 of the car database files and added them to my personal-portfolio website, under URL http://teoalida.webs.com/cardatabase.htm. I translated all Romanian into English (took just 10 min to find/replace all car terms in Car Models Encyclopedia, but rewriting introduction for every file took few hours).

I never not sold anything on internet (for money), but in the same time I did not wanted to give Car Models & Engines Database for free so I offered only a sample, being undecided whenever to sell and how to sell. I was waiting for some feedback.

A failed start:

  • I had huge knowledge about cars but zero knowledge about marketing.
  • I did not know what kind of audience my website have, and what database formats customers would be looking for.
  • I believed that my products have same target like car magazines: buyers of new and used cars, plus hobbyist studying automobile industry like me, but surprisingly, I got contacted by companies looking for vehicle database for commercial use.
  • I did not got any feedback for about 1 year, I became worried that I done a very BIG USELESS database and is better to abandon the hobby.

First feedback were like this “not what I needed though keep up good work“… but what they need?
Only after discovering a software to chat with website visitors in January 2012, I started getting significant feedback, I realized that the database was GOOD, but the presentation was BAD. While the rest of my website is aimed to architecture students and house builders (web amateurs), the Car Database page was visited by highly-skilled people, company managers and webdesigners who probably did not liked my cheap-looking website hosted for free under subdomain.

Mistakes done until April 2012:

  • Presentation like a BOOK, author info on first page. The Excel spreadsheets saved in Legend & About me sheet, people had to do extra clicks to see the Database sheet to realize that there IS a vehicle database, they may have been lazy and though that there are just biographies.
  • No prices were displayed, as well no BUY NOW or FOR SALE buttons, no wonder why people would not understand if I am selling or what the hell is this website…
  • No preview screenshots, text-only page, bounce ratio 75%, after adding the screenshot it reduced to 40%.
  • Offering too much for free: 5 files offered complete and 1 file “The BIG Car Database” was offered as one-brand sample, waiting if someone will contact me, to negotiate a price… 2-3 people contacted me but I refused the deal once they said that are buying “for their websites”, I wasn’t sure what they are doing, though that are publishing my Excel file on their website and I refused to sell them complete database.
  • Most customers weren’t looking for that big database with engines, but for car external dimensions, which was offered FREE but contained only cars produced in Europe as it was done for the purpose of research car evolution in Europe, instead of complete list of all cars sold (imported) in European market. Other customers were even looking for a simple make/model list in XLS or CSV for web development, which was offered FREE in wrong format, DOC optimized for reading/printing.

So, I updated the European car dimensions database with worldwide cars in April 2012 and put on SALE for a small price, after several days I made the first sale to an insurance company. Since that moment I started the constant updating…

During 2012, almost every buyer praised the database but criticized website presentation and gave me suggestions.

A successful business, decisive changes done in November 2012:

  • I moved to paid hosting and domain www.teoalida.com.
  • WordPress platform and Simple eCommerce plugin that allow people to pay and download digital products automatically.
  • Car Models List .DOC converted in XLS and put for sale instead of being given free, becoming the best-seller (I was surprised, I did not knew that people are willing to pay even for a DATAbase without (technical) DATA).
  • I bought a collection of german AutoKatalog 1991-2013 from eBay (I paid them with my own money made from several sales during six months of 2012)… I updated with new models but also backward updating with classic cars, making the most comprehensive database of cars produced last 20 years…
    I also made new databases, custom-made for individual customers, and the number of customers increased about 4 times.

Website redesigned in March 2013 further raised sales:

  • In late 2012 for the first time I googled and found that there are several other websites selling car databases, and got inspired to design my website in similar manner.
  • Installed a separate WordPress for Car Database section (URL remained the same), displaying on first page the products in grid layout with prices indicated, with buttons “Read more” going to individual pages of each database, also a FAQ section (as opposite of the previous single-page with a lot of rows of text and links to sample files, then “go in Store to buy full databases” (you can still see the old website design in http://teoalida.webs.com/cardatabase.htm).

Over time I was facing (and I am still facing) with the problem of insufficient feedback… LAZY people who, if they do not like a detail in the database, they go away instead of contacting me and requesting a change (probably many people think that I am just selling databases “as it is” instead of being the developer of database thus ready to make changes).

For example: Make | Model was originally on a single column, which pose no problem for a reader, but a programmer may not like it. As soon 1 customer bring this in attention, I separated Make | Model to 2 columns in December 2012. These changes helped to have in 2013 more sales per month than in entire 2012 year! If the first customers purchased it for reading in Excel, in 2013 about half of customers are webdesigners buying for importing CSV or SQL. By this way, when I moved to WordPress in 2012 I heard for first time about MySQL and realized that not all websites are made by static HTML as believed.

Expansion beyond European market – 2013

Car Models List included US automobile manufacturers, but did not included foreign cars imported in America, instead for Japanese manufacturers it included only models sold in Europe.

In January 2013, once a customer asked me for a database of cars sold in United States, with each year on separate row, I done an extensive research on Wikipedia and expanded my Car Models List making it complete for American market, with exact Model Years the car was sold in US.

Next project was to create a database of technical specs. I done a small experiment sourcing data from Edmunds website, writing it manually, but was overkill to do this for all 40,000+ car models. The salvation came from a customer who bought Singapore HDB Resale Flat Prices Database, he was a programmer so I told him to create a scraping script for Edmunds website, for FREE, in exchanging of giving FREE the database he purchased (refunded his purchase), but Edmunds is redesigned often, making his script obsolete.

Additional improvements that increased sale volume:

  • Changed plugin Simple eCommerce to Easy Digital Downloads (May 2015) and adding multiple price options to purchase small chunks such as 2000-present at smaller price, for people who do not need older cars 1945-present.
  • Removed product prices on home page and rearranging products (2015). Instead of putting the simplest and cheapest databases at top, I put the most detailed databases of Europe, America and India on 1st row. People had to click “more details” or image in order to see prices. While number of sales did not grew, monthly earnings grew because people went to purchase more expensive databases.
  • Wrote list of makes and list of columns included in each database (2015-2016), for people who are lazy to check this inside sample files (and also keywords for Google).
  • Added option to pay with credit card via 2checkout.com beside PayPal (September 2016).
  • Removed editing protection of sample Excel files (2016-2017). When you open a protected file in Microsoft Excel, you cannot edit it unless you go in Tools > Unprotect sheet and enter a password. But some non-Microsoft Office software ask user to enter password as soon you open it, or click “open read-only” which confuse visitors and they don’t open at all.

A new era: web scraping services – 2015

A new era started in August 2015 when I discovered www.import.io, a data scraping software which (at that times) was free (accounts made after April 2016 are limited to 500 pages unless you buy a paid plan). Although the software is free and theoretically you can scrap simple websites yourself, most people don’t have knowledge, or time to learn how to use it, so I am here to help you.

The first database made via web scraping was India Car Database, took about 1 week to figure out how to create it using import.io. The funny thing: one of the 3 indians who contacted me during that week and I informed what I am doing, though that I created Indian database because of his request, so requested me making additional databases of bikes too and other vehicles. For moment I refused, because I did not wanted to waste time on projects outside car field.

In November 2015 I recreated American Year-Make-Model-Trim-Specs allowing me to update it myself by using import.io extracting car specs from each URL although require to manually compile list of URLs of all cars, I also created other databases Singapore Condo Database (source: SingaporeExpats.com), Skyscrapers Database (source: Emporis.com).

Moreover, in November 2015 I allied with a programmer student from Pakistan to create scraping software for websites that are too complex for import.io.

Once I mastered scraping skills in 2016, I decided to offer freelance web scraping services, allowing me to serve people asking for specific data that I do not have, by quickly creating a new custom database scraping data from a website chosen at mutual agreement, even if just 1 person demand that particular data, once created I published database on website allowing future people to purchase ready-made (unless customer asked for a non-disclosure agreement).

Import.io turned to a paid service in April 2016 and suspended my free account in September 2016, I was ready to buy a paid plan although quite expensive. This gave idea to Pakistan programmer to create own universal scraping software in VB.NET for my own use, which turned to be faster and better than import.io, able to scrap multiple websites at same time without slowing down each other, allowing me to create even more databases and update them more often.

 

Other automobile data providers on internet do not even specify source of data, never mind of author info.
I am not sure if this About Me story will make you more willing to purchase a car database from me, or not, but…

…what is your opinion? Leave comments!

The BIGGEST Vehicle Database

This database is an original compilation “Made by Teoalida”, and it is the biggest of all hand-made databases (NO web scraping). You will not find any equivalent data on internet (except on websites that purchased database from me). However the recent databases made via web scraping have even more rows: Germany and United Kingdom databases.

Download SAMPLE:
Car models, engines, transmission, drivetrain, tires, dimensions, weight, performance, etc (Excel)
Alternate formats: CSV and SQL

Buy FULL database + FREE updates for one year:

Price too high? Need less data? You can buy other packages (having SAME number of rows) such as Car Models, Engines (9 columns, 60% discount), Car Models, Engines, Dimensions, Performance (27 columns, 40% discount), or tell me what fields do you need, I will sell part of data at negotiable price lower than the price of complete database.

Do you need even more details, rows for each trim / equipment level? see German car database

Last release: 6 May 2019, including 564 versions launched in 2018 and 190 versions launched in 2019. Full change log.

Coverage

Car sold in Europe. Database being based on AutoKatalog (german car magazine) have 100% coverage for cars sold in Germany which is the biggest car market of Europe. 99% of cars sold in any European country are also sold in Germany. I also added unique cars sold only in UK, communist cars sold only in Eastern bloc, as well as special cars with 2.0 liter engine to comply with Italian tax system.

If you are outside Europe, you can check list of car models included in Excel file and report missing models (sold in your country but not in Europe) so I can add their data if available, but I cannot guarantee that I can add all.

Description & history

I started this automobile database in 2003, from personal interest. In late 2003 I standardized the table at 11 columns of technical specs and from December 2003 to April 2004 I added about 2800 model versions.

I sourced data from AutoCatalog 1997, 1999, 2001, 2002, 2003, 2004 (from Auto Motor & Sport magazine), and AutoSpecial 2000, 2001, 2002 (from Burda magazine), in October 2004 I also bought AutoCatalog 2005 so I started to update my database. So the database included initially most cars produced since mid-1990s to present. I launched my first FINAL edition in January 2005, it had 3600 model versions, of which 3000 from Europe.

AutoKatalog provide 40+ columns of data, but personally I was not interested in so much data, and 1024×768 screen limited my abilities. The 11-column format was inspired from these auto magazines displaying 7 columns (cylinders, displacement, power, torque, 0-100 kmph, top speed, fuel consumption) and separate table for car dimensions. Additional 4 columns (body, doors, engine placement, drivetrain) were added from my research interest.

I was updating my database constantly, after buying each year AutoCatalog 2006, 2007, 2008, 2009. The 2008 edition had 6000 model versions of which 4800 are from Europe. Sadly, AutoCatalog is no longer published in Romania since 2009, leaving database in hiatus. I done a small update in 2010 and also added few pre-1990 cars with data sourced from Wikipedia and other websites, for BMW, Lamborghini, Mercedes and Porsche I was able to find sufficient data to expand my database to WW2, reaching 7800 models from 57 brands.

In 2011 I published previews my car databases on my personal website and in 2012 realized that I can do a business by selling it to various companies and web developers.

In November 2012 I found on eBay a collection of german AutoKatalog for 1991-2013 and revived the car database from hiatus. Had to add last 4 years of cars and reached over 10700 models. Adding more early 1990s models and improving the models sourced initially from Wikipedia, raised the database to 12000 models (as January 2013) from 78 brands, covering 1990-present.

After people starting contacting me, I had to do few changes to make the automobile database suitable for the unexpected audience, for example: added 2 new columns Make and Model before the full model name (necessary for web developers making drop-down lists), added fuel type (suggested by a customer who wanted to filter cars by fuel), added number of seats, etc).

In February 2013 I bought from eBay more german AutoKatalog for 1983-1990 too, planning to include in my database all cars launched after 1983 or produced until at least 1990. I kept buying AutoKatalogs until 2014, completing collection for 1970-2014 (45 books). In 2014 the database had over 14000 models from 80+ brands, starting year vary from 1970 to 1990 depending by car make.

The vehicle database proved insufficient for certain customers who were demanding more data per car, especially car tire dimensions were demanded by numerous customers (tire shops), while several customers wanted to combine Car Models & Engines Database with Car Models Database (to have both engines and car dimensions). I expanded the vehicle database from 14 to 22 columns, adding quickly the car dimensions from Car Models Database and started a slow process of adding car tire size. But new customers demanded additional data such as transmission, so I had to do again every car.

In May 2014 I bought a 24″ 1920×1200 monitor to replace the 17″ crap, this encouraged me to engage in creating bigger databases. However, I was questioning if will worth my time spending 500+ hours to include 40+ columns of data from AutoKatalog books. I posted a message on site inviting people to contact me and make a price offer if they are interested in expanding database to 45+ columns. By end-2014, a single person showed interest.

In 2014 I halted adding classic cars and stated expanding the automobile database to 45+ columns in my spare time. When launched for sale the expanded database in October 2015 with 99% completion for 1990-present, I was surprised to see the high number of people willing to pay ~500 euro for such great database. But most people do not need so many data and the 7-column database (60% cheaper) remains best-selling. 1990-present package is the best selling, due to this reason, expanding 1970-1990 cars to 45-columns was not a priority.

Of course certain customers demand new specialty data that is not available in AutoKatalog and sometimes nowhere. I cannot satisfy everyone.

AutoKatalog magazine was surprisingly discontinued in 2014, so starting from 2015 I am sourcing new car data from ADAC Datenbank. ADAC do have separate row for each transmission, trim and equipment level, thus having too many rows for each engine. Every few months I am scraping ADAC website then use my personal knowledge to select one row per unique engine and copy-paste the data into my database, to match the format of the former AutoKatalog. This keeps my car database an original product and not a copy of any existing website, thus you can use it for your website without worrying about a copyright claim from another website.

In 2016 I resumed backward expansion and reached 99% completion for 1986-present. In summer 2017 I spent another 200 hours to add 1200 additional cars from 1975-1986 period, and together with updates with new cars, the database exceeded 20000 model versions. In next phase I will add 1970-1975 cars too. The cars from 1970-1986 added in database prior to 2014 remain in 22 column format, waiting to be expanded to 45+ columns in my spare time.

The ultimate database will have 99% completion for 1970-present. That is almost 50 years of history of automobile!

Years accuracy notes: AutoKatalog DO NOT provide years of production for each engine version so the years shown in databases are not sourced but rather generated based on what yearly issues of AutoKatalog each car appears in. Model production year are sourced from Wikipedia. AutoKatalog is published each year in September, it sometimes shows the car models that will be launched 1-2 months in advance, but not always. For example: Audi A4 was launched in November 1994, after the publication of AutoKatalog 1995 in September 1994, and was included for first time in AutoKatalog 1996, so I wrote in my database shows 1994-2001 for main model and 1995-XXXX for engine versions. At that time, without being aware of web scraping technology, this was the BEST I could do. I hope that this is not an issue for you.

Some customers gave me suggestions or additional requests:

  • Add an unique ID that facilitate integrating updates in existing database (for people who integrate my database with their own data so cannot simply delete old database and upload new database).
  • Correct inaccurate years and add month of production.
  • Add engine codes.
  • Add car images.

The German Database and UK Database made via web scraping already have unique identifiers, production period with month/year and car images, but not engine codes. Recompiling these things into the format of Car Models & Engines Database, would take additional effort of manual work and the number of sales may not increase.

I started web scraping in 2015 and by 2017 I realized that most people do not care about buying someone’s original manual work and prefer buying databases scraped from other websites, some people ask me specifically to scrap data from certain website, despite of potential copyright issues.

Due to this reason I decided to create additional databases for other continents and no longer invest additional effort in expanding this manually-made European database, and reduce update frequency. Each update takes about 20-30 hours of manual work to add new cars, not counting corrections for old cars. For comparison, the UK Database takes about 24 hours of running scraping software in background and less than 1 hour of manual work, and all old records are updated, giving me a higher revenue per hour of work compared with manually updating Car Models & Engines Database. Despite of this, Car Models & Engines Database remained second best-selling database in 2018, after American Year-Make-Model-Trim-Specs, probably because the German and UK ones have too many rows for each engine, separate row for each trim / equipment level, which most people do not need.

A possible solution is to strip off German database and leave only 1 row per each engine? Your suggestions are welcome!

Source of data

AutoKatalog

Primarily AutoKatalog, the german car magazine published since 1957 every year in September, one of most reputable car publications in the world, and ADAC. AutoKatalog shows each year cars currently in production, with no years of production indicated. The years in my database are generated by the books the car appears in, for example if a car appears in AK 2000, 2001, 2002 I indicated year 1999-2002.

The original database (2003-2014): initially it included only data that I was personally interested, engine and performance data. In a period of 2005-2006 I was deciding when to make a new row by checking performance data, which have sometimes variations from year to year despite that engine is the same, thus causing some duplicates in engine data.

The current database (2014-present): I made separate rows for each yearly issue of AutoKatalog, but for time economy, in the first stage I filled up only the rows of the first year the car appears in AutoKatalog (or second year, if the first year looks pre-launch data). There are 2 columns of production years, which I filled for just one of the rows for each year of AutoKatalog, then filter up database by deleting rows with empty production years, producing final database having only one row per each engine-body-drivetrain combination, with no duplicates, ignoring the variations in performance data.

Data accuracy notes

AutoKatalog itself does have errors, but having myself high experience in cars and data analysis, I have spotted numerous errors and input correct data. However some data remains possibly wrong because I do not know what would be correct values.

The most important data, such as engine and dimensions, are 99.99% accurate. Less accurate data are the following: performance data, where are often variations such as +/- 0.1 litres in fuel consumption between yearly issues of AutoKatalog, suspension type: sometimes it indicate double wishbone on one year and multilink in another year, can’t know which is correct, and tire size (especially the speed indicator is different from yearly issues of AutoKatalog).

Do note that nobody is perfect, 100% accuracy is impossible, but I believe that my manually hand-made car database is the most accurate vehicle database ever existed on internet, given by the comments from my customers who previously bought car databases from other sources which appears to be crawled using automatic software, including junk data, and no human is ever checking for errors.

if you want to counter-check my accuracy, I advise you to check against German websites. Expect small differences for certain cars when checking against websites from other countries, due to the bad sources of data that most websites uses.

List of car makes included

Mainstream makes: AC Cars, Alfa Romeo, Alpine, ARO, Asia Motors, Aston Martin, Audi, Austin / Rover, Autobianchi, Bentley, BMW, British Leyland (Austin, MG, Mini, Morris, Riley, Rover, Triumph, Vanden Plas, Wolseley), Buick, Cadillac, Chevrolet USA, Chrysler USA, Chrysler Europe / Simca / Talbot (France) / Sunbeam / Hillman / Talbot (UK), Citroën, Dacia, Daewoo / Chevrolet Korea, Daihatsu, De Tomaso, DeLorean, Dodge, Ferrari, Fiat, Ford Europe, FSO, GAZ (Volga), Honda, Hummer, Hyundai, IFA (Trabant, Wartburg), Infiniti, Innocenti, ISO, Isuzu, Iveco, Izh, Jaguar, Jeep, Kia ,Lada, Lamborghini, Lancia, Land Rover, Lexus, Lotus, Maserati, Maybach, Mazda, Mercedes, Mini, Mitsubishi, Morgan, Moskwitch, Nissan, Oltcit, Opel / Vauxhall, Otosan, Peugeot, Pontiac, Porsche, Proton, Reliant, Renault, Rolls-Royce, Saab, Samsung, Santana, Seat, Skoda, Smart, SsangYong, Subaru, Suzuki, Tatra, Tesla, Tofas, Toyota, TVR, UAZ, Volkswagen, Volvo, Zastava, ZAZ.

Bonus minor makes: Artega, Bitter, BMW Alpina, Bristol, Bugatti, Caterham, Cizeta, Donkervoort, Felber, Fisker, Ginetta, GTA Spano, Gumpert, Isdera, Jensen, Koenigsegg, LuAZ, Lynx, McLaren, Messerschmitt, Monica, Monteverdi, Pagani, Panther, Saleen, Venturi, Wiesmann, ZIL (producing less than 100 cars per year).

List as September 2017. Additional minor makes may be added in the future.

Data fields included

Database contain are over 60 columns, listed here together with the period that include data in. Completion is nearly 99% for every column for the period covered.

Naming: Make, Model, Version, Years, Source of data, Sold in, Class.

Body data: Body type, No. of doors, No. of seats, Engine place, Drivetrain.

Engine data: Cylinders (all), Displacement (all), Power kW (all), Power PS (all), Power rpm (all), Torque Nm (all), Torque rpm (all), Bore × Stroke (pre-2013), Compression ratio (pre-2013), Valves per cylinder (all), Crankshaft (pre-2013), Carburetor (pre-1986), Fuel injection (1986-present), Supercharger (1986-present), Catalytic (1986-1993), Oil capacity (pre-1993), No. of gears manual (all), No. of gears automatic (all), Final ratio (pre-1993).

Drivetrain data: Suspension front (all), Suspension rear (all), Assisted steering (all), Turning circle (2013-present), Brakes front (all), Brakes rear (all), ABS (1986-2013), ESP (2000-2013), Tire size (all), Tire size rear (if different than front).

Dimensions and weight data: Wheel base (all), Track front (pre-2013), Track rear (pre-2013), Length (all), Width (all), Height (all), Curb weight (all), Gross weight (all), Load (2013-present), Stützlast (2013-present), Roof load (2013-present), Cargo space (1988-present), Tow weight (1988-present), Gas tank (all).

Performance data: 0-100 kmph (all), Max speed (all), Fuel efficiency overall (all), Fuel efficiency city (2013-present), Fuel efficiency highway (2013-present), Engine type (all), Fuel type (all), CO2 g/km (2007-present), CO2 efficiency class (2013-present), Pollution class (2013-present), Base price in Germany (2013-present).

The BIG Car Database

Do I shoud add 1945-1980 cars considering lack of detailed data, or focus on new cars?

Add older cars regardless how little and unnaccurate data is available
Don’t add older cars unless some accurate detailed data is available (minimum engine size, horsepower, and production years for each engine), to not damage the overall level of 99% data completion for the main columns
We do not need cars older than 20-30 years. Rather use your free time to add more details about 2000s cars
Other
Please Specify:

Quiz Maker

Poll – abbreviations

The reason for which I use abbreviations is because I sourced data from books which also use abbreviated due to limited amount of paper space, and because when I started this European database in 2003 I had an 1024×768 screen, that’s why I made it in 12 columns only. Since 2014 I have a 24″ 1920×1200 screen and this allowed me to expand database to 45 columns, my screen is still not big enough as the database would require a 27″ 2560×1440 screen to display completely, so I choose to hide columns when adding new data. However, after I enter all data from AutoKatalog books going back to 1970, there will be no more reason to have a database that fit in screen width, as the updates with new cars are done via copy-pasting and not manual typing. The databases I made for other continents, America and India, source data from websites which do not have limitations of a printed book, so they use full words, and I wonder if should do the same for European database.

Should I use abbreviations or full names? (in columns body type, drivetrain, injection type, turbocharger, suspension, brakes, fuel type)

Abbreviations
Full names
Other
Please Specify:

surveyMaker

Poll – tires

How I should indicate tire size for sport cars with different front and rear tires?

one column saying “245/35-305/30 R 20” (this is original AutoKatalog format)
one column saying “front 245/35 R 20 / rear 305/30 R 20”
two columns “tires” and “rear tires (if different than front)”, leave second column empty for 90%+ of cars which have same tires front and back
Other
Please Specify:

Poll Maker

Poll – hierarchy

Data structure is not optimal, causing loss of customers? Don’t go away, YOU can suggest a new data structure!

Other
Please Specify:

Poll Maker

Car Models Database with body styles & dimensions

Car Models DatabaseLast release: 6 May 2019, including 34 models launched 2018 and 17 models launched 2019, see changelog.

Download SAMPLES (one make):
Car Models Database.XLS for general use
Car Models Database.CSV for programming
Car Models Database.SQL for web developers

Buy FULL database + FREE updates for one year:

Description & history

Started in 2005, originally a research-analysis to compare cars and study evolution in terms of dimensions, weight and engines. For example Golf 1 (3705 mm length, 1100-1800 cm³, 50-112 PS), vs Golf 5 (4204 mm length, 1400-3200 cm³, 75-250 PS). However, since started selling in 2012, its typical customers are car shipping companies pricing transport costs by car dimensions, or car cover manufacturers.

At least one customer bought this Car Body Specs Database (SURPRISE!) not for body specs, but just for list of car body types. Previously I was adding car models only when/if data of car dimensions was available, but since 2013 I choose as ultimate goal to add EVERY car manufactured after World War II, even if no dimensions are available, over time I may find dimensions and complete the table.

Car Models Database contains separate row for each car body style, showing basic data:

Car dimensions (length, width, height, wheelbase, cube) with 99% completion for 1980-present and 75% for 1945-1980.
Car weight range and cargo space with 99% completion for 1980-present.
Car engine range (displacement, horsepower and top speed), for 1980-present (except for cars not sold in Europe).

Coverage

Cars sold in Europe, european domestic models 1945-present, Asian imports 1970/1980-present, American imports 1990-present, for more details see List of car manufacturers and years included.

Bonus: some Asian cars not imported in Europe, for example Malaysian cars (Proton’s all 15 models, not just the two imported in continental Europe, Perodua’s 10 models, none imported in continental Europe but few imported in United Kingdom). Add-on with cars produced in India, Brazil and Argentina (added during 2013).

This is NOT a database of Latin America / India market. I guarantee completion for European market ONLY. Indians and Latin Americans can buy European database + add-on of vehicles produced locally, delete yourself the vehicles not imported in your country and request me to add if there is any additional foreign models available in your country but not in European database!

Source of data: AutoKatalog for 1980s-present (99% data completion and accuracy guaranteed), and Wikipedia for older automobiles (expect some mess and missing data in pre-1980s). All data written manually in Excel, making an original product “Made by Teoalida”.

Sorting order: alphabetic by make, followed by class from mini to luxury cars, followed by model from old to new, followed by body style. You are advised to NOT change sorting order for example do not sort models alphabetically.

This vehicle database focus on model only and breakdown for body type (sedan, wagon, coupe, convertible, etc). There are NO list of car engines. For breakdown by individual engines, see Car Models & Engines Database.

List of car makes included

Mainstream makes: AC Cars, Alfa Romeo, Alpine, Alvis, ARO, Asia Motors, Aston Martin, Audi, Austin / Rover, Austin-Healey, Autobianchi, Auto-Union, Bentley, BMW, Bond, British Leyland (Austin, MG, Mini, Morris, Riley, Rover, Triumph, Vanden Plas, Wolseley), Buick, Cadillac, Chevrolet USA, Chrysler USA, Chrysler Europe / Simca / Talbot (France) / Sunbeam / Hillman / Talbot (UK), Citroën, Dacia, Daewoo / Chevrolet Korea, DAF, Daihatsu, De Tomaso, DeLorean, Dodge, Ferrari, Fiat, Ford Europe, FSO, GAZ (Volga), Honda, Hummer, Hyundai, IFA (Trabant, Wartburg, Barkas), Infiniti, Innocenti, ISO, Isuzu, Iveco, Izh, Jaguar, Jeep, Kia, Lada, Lamborghini, Lancia, Land Rover, Lexus, Lotus, Maserati, Matra, Maybach, Mazda, Mercedes, Mini, Mitsubishi, Morgan, Moskwitch, Nissan, NSU, Oltcit, Opel, Otosan, Perodua, Peugeot, Pontiac, Porsche, Proton, Reliant, Renault, Rolls-Royce, Saab, Samsung, Santana, Seat, Skoda, Smart, SsangYong, Standard-Triumph, Subaru, Suzuki, Tatra, Tesla, Tofas, Toyota, TVR, UAZ, Vauxhall, Volkswagen, Volvo, Zastava, ZAZ.

Bonus minor makes: Artega, Bitter, Bristol, Bugatti, Caterham, Cizeta, Donkervoort, Felber, Fisker, Ginetta, GTA Spano, Gumpert, Isdera, Jensen, Koenigsegg, LuAZ, Lynx, McLaren, Messerschmitt, Monica, Monteverdi, Pagani, Panther, Saleen, Shelby SuperCars, Venturi, Wiesmann, ZIL (producing less than 100 cars per year).

List as September 2017. Italic makes to be added in next releases. Additional minor makes may be added in the future.

Discontinued: Timeline database

Download SAMPLE: Car Models Database Timeline version.XLS

Same rows, but sorted by class and year instead of alphabetic by make, useful for my personal research but difficult to maintain and check for missing models.

This was the only database format since 2005 to December 2012, when I re-sorted database alphabetically and sold both versions in same package but the timeline had a lag in updates. I updated the Timeline version and offered of sale both versions separately at main yearly update of November 2013, but due to customers preference for alphabetic version, I no longer updated Timeline version since May 2014 and removed from sale at end of 2014.

Timeline version’s purpose is to visualize car evolution over years, and in 1 year NOBODY bought for this purpose.

Car Models Database Timeline

This database will undergo a complete redesign in the future

I was thinking in March 2016 to offer a major update for Car Models Database after Car Models & Engines Database will be fully expanded to 1970, I will recheck every car and add correct dimensions, weight and engine range from the expanded bigger database. Until that, I will offer minor updates with new cars added and with dimensions only.

During 2017 I expanded Car Models & Engines to 1975 using AutoKatalog books, leaving 1970-1975 to be added at later stage, but meantime I created additional car databases for other continents that sold so well making me to lose interest for this original European database. www.ADAC.de which was once time ago 1990-present only, was redesigned in February 2017 adding car images and during 2017 was expanded backwards to 1950, then in March 2018 I added commercial vehicles in Car Models & Engines Database sourcing them from ADAC.

In 2019 I realized that AutoKatalog have NO advantage over ADAC so is better to recreate Car Models Database sourcing data from ADAC instead. By this way, the future database will have production years as month/year instead of just year and will include car images and an unique ID number for each row.

This is a possible SAMPLE: Car Models Database new proposed format.xls. Feel free to suggest changes.

Some things are yet to be decided:

  • Cars that had a mid-life facelift that changed external dimensions only slightly do not have a separate row, instead they have in Notes column “Length X mm after year Y”, would be better if I make separate row for every slight variation of dimensions, or this will make unnecessarily too many rows?
  • Make & Model & Platform controversy: since I started this database in 2003, until I published on website and started selling in 2012, there was a single column “Car name”, starting from 2012 I made columns Make and Model, since 2014 I made 4 columns Make, Model-Platform, Model, Platform. May I know how are you using these 4 columns?
  • I am questioning if this is the best format or I can improve it even further, for example I am thinking to remove (facelift) from model names, and add instead production years to differentiate pre- and post-facelift models
  • If you have any better proposal, let me know!

Car Nameplates List

Car Nameplates ListDownload SAMPLE: Car Nameplates List.xls.

Buy FULL database + FREE updates for 1 year:

While Car Models List have separate row for each model generation (example Golf 1 to Golf 7) with production years indicated for each, and single row if a car is sold in multiple countries under different names, the Car Nameplates List have a single row for each nameplate regardless of how many models wore it, sometimes define totally different models (example: Ford Fusion is a mid-size sedan in North America and also a mini MPV in Europe), and separate row for each alternative name sold in various countries. Separate row is also for sister cars such as Fiat Bravo / Brava, VW Golf / Jetta.

Car Nameplates List is sold in 3 versions:

  • XLS original order from Car Models List (from mini cars to luxury cars, followed by sports cars, MPVs and ending with SUVs), used by me to add missing models.
  • XLS alphabetic order (you could sort yourself too).
  • CSV alphabetic with 2 files, one with one row per Make + Make ID, one with one row per model with Make ID, Make, Model, etc.

History

The idea of Car Nameplates List popped in March 2013, when a customer showed me an used cars classifieds website having a drop-down list of models. At that moment I was NOT aware that my Excel database can be used for this kind of web building. Funny: that drop-down list had many errors and duplicate models such Mercedes SLS, SLS-Class, SLS AMG, 3 rows for what is actually a single model. I offered him to create myself a Car Nameplates List without such stupidities, but he refused and purchased Car Models List instead.

I still made a SAMPLE of Car Nameplates List with 2 makes and put it in possible future projects waiting for a customer interested, saying that I can build full list in 3 days for 20 euro. For 2 years NOBODY showed any interest, while Car Models List continued to get sales.

In 2015 I posted Car Nameplates List in the product grid of main page. First customer interested came in November 2015 but did not left any email address where I could announce him that project is complete. A second customer came in January 2016. By this way I decided to start the project, I spend 5 days recompiling Car Models List into Car Nameplates List. I published for sale in February 2016, 2900 nameplates, 524 KB file.

One customer suggested adding a numerical ID for each make, and make a second table with Make and Make ID, and sell both as SQL files for making drop-down boxes on his website. DONE!

The plan was to start regular updates once it gets 10 sales, but by end-2016 only 5 people purchased Car Nameplates List, while Car Models List got about 40 sales. 4 people purchased in 2017. The project looked like a failure. Sales increased to 11 in 2018 so I decided to revive the project, but did not knew how long would have taken and priority was to update the more expensive databases with higher sale volume.

I done the first update in January 2019, took me 10 hours, side-by-side on screen with Car Models List, adding 430 nameplates, making a total of 3330 nameplates from 183 makes, 725 KB. Added what was launched last 3 years but also lots of nameplates from outside Europe and North America that were added in Car Models List during last 3 years of worldwide expansion.

Another update in May 2019, 3658 nameplates from 229 makes, 801 KB. I added over 40 minor manufacturers in Car Models List and Car Nameplates List.

Because in the recent updates I added too many minor brands / exotic cars that are seen more at museums and collectors than driving on roads, at May 2019 update I also added a column that allow you to delete easily the exotic brands, if you want to have a database with only cars commonly seen on roads.

Example of cars included and marked as minor brands: Cizeta-Moroder (1990s supercar with only 20 units produced), Messerschmitt (cheap 1950s bubble car, 15,000 units produced but most of them been scraped and are not driving anymore), Peel (1960s, 2 models, less than 100 units produced, but famous for Peel P50 which is listed in Guinness book of records as world’s smallest car).

Car Engine Database

Download SAMPLE:
Car Models, Engines .XLS

Buy FULL database + FREE updates for one year:

A vehicle database intended for auto parts shops and other businesses, and webdesigners making drop-down boxes that need to go further than make and model. It is simplified from The BIG Car Database by removal of duplicate rows for same engine caused by facelifts which affected only exterior of cars.

Last release: 6 May 2019, including 564 versions launched in 2018 and 190 versions launched in 2019. Full change log.

Coverage

Cars sold in Europe. Database being based on AutoKatalog (german car magazine) have 100% coverage for cars sold in Germany which is the biggest car market of Europe. 99% of cars sold in any European country are also sold in Germany. I also added unique cars sold only in UK, communist cars sold only in Eastern bloc, as well as special cars with 2.0 liter engine to comply with Italian tax system.

If you are outside Europe, you can check list of car models included in Excel file and report missing models (sold in your country but not in Europe) so I can add their data if available, but I cannot guarantee that I can add all.

Data fields included

Database contain the following columns, indicating the period that include data in. Completion is nearly 99% for every column in the period covered. For more columns, see The BIGGEST Car Database.

Naming: Make, Model, Version, Years, Source of data, Sold in, Class.

Body data: Body type, No. of doors, No. of seats, Engine place, Drivetrain.

Engine data: Cylinders (all), Displacement (all), Power kW (all), Power PS (all), Torque Nm (all), Catalytic (1986-1993).

Drivetrain data: Tire size (all), Tire size rear (if different than front).

Performance data: Engine type (all).

How to use the database

Either you are creating a car parts website, car insurance website or just informational website, you are advised to make 3 drop-down boxes: Make, Model, Version, example: www.onlinecarparts.co.uk, DO NOT make a drop-down for Year, as Europeans do not need this, this system is good only for America. Use colums D, E, F, H, use of column E to group the list of model platforms (example BMW 3-Series – E36, E46, E90-93).

This database will undergo major redesign in the future… YOUR opinion is needed!

Several customers said that this database is too detailed and contains duplicate rows for same engine because it is offered with multiple bodies and drivetrains and they do not need these details (especially the ones in ECU chiptuning field). I am planning to make a NEW Car Engine Database stripping off data from German car database to reduce number of rows until meet your requirements.

This is a possible SAMPLE: Car Models Engines Database new proposed format.xls. Feel free to suggest changes.

If same engine is available on multiple body styles…

BMW 320i is enough, I do not need 4 rows for saloon, touring, coupe, cabrio
I need separate row for saloon, estate, coupe, cabrio, but 3/5-door hatchbacks can share same row
Even 3-door and 5-door hatchbacks should be 2 separate rows
Other
Please Specify:
Created with QuizMaker

If an engine is offered with 2-wheel drive and 4-wheel drive…

A car engine database should omit multiple drivetrain options, 1 row per engine is enough
Front wheel drive, rear wheel drive and all wheel drive should be each on separate row
Other
Please Specify:
Created with QuizMaker

Production years should be indicated for each engine?

I do not need production years for each BMW 320i, 328i, 330d, etc, is enough to indicate production years for 3-Series E46, E90, etc
Beside production years for main model, each engine version should have production years indicated
Other
Please Specify:

How to indicate engine size?

litres
cm³
Other
Please Specify:

Car Tire Database

Download SAMPLE:
Car Models, Engines, Tires .XLS

Buy FULL database + FREE updates for one year:

Last release: 6 May 2019, including 564 versions launched in 2018 and 190 versions launched in 2019. Full change log.

Coverage

Cars sold in Europe. Database being based on AutoKatalog (german car magazine) have 100% coverage for cars sold in Germany which is the biggest car market of Europe. 99% of cars sold in any European country are also sold in Germany. I also added unique cars sold only in UK, communist cars sold only in Eastern bloc, as well as special cars with 2.0 liter engine to comply with Italian tax system.

If you are outside Europe, you can check list of car models included in Excel file and report missing models (sold in your country but not in Europe) so I can add their data if available, but I cannot guarantee that I can add all.

Data fields included

Database contain the following columns, indicating the period that include data in. Completion is nearly 99% for every column in the period covered. For more columns, see The BIGGEST Car Database.

Naming: Make, Model, Version, Years, Source of data, Sold in, Class.

Body data: Body type, No. of doors, No. of seats, Engine place, Drivetrain.

Engine data: Cylinders (all), Displacement (all), Power kW (all), Power PS (all), Torque Nm (all), Catalytic (1986-1993).

Performance data: Engine type (all).

Car Models, Engines, Dimensions, Performance

Download SAMPLE:
Car Models, Engines, Dimensions, Performance .XLS

Buy FULL database + FREE updates for one year:

Last release: 6 May 2019, including 564 versions launched in 2018 and 190 versions launched in 2019. Full change log.

Coverage

Cars sold in Europe. Database being based on AutoKatalog (german car magazine) have 100% coverage for cars sold in Germany which is the biggest car market of Europe. 99% of cars sold in any European country are also sold in Germany. I also added unique cars sold only in UK, communist cars sold only in Eastern bloc, as well as special cars with 2.0 liter engine to comply with Italian tax system.

If you are outside Europe, you can check list of car models included in Excel file and report missing models (sold in your country but not in Europe) so I can add their data if available, but I cannot guarantee that I can add all.

Data fields included

Database contain the following columns, indicating the period that include data in. Completion is nearly 99% for every column in the period covered. For more columns, see The BIGGEST Car Database.

Naming: Make, Model, Version, Years, Source of data, Sold in, Class.

Body data: Body type, No. of doors, No. of seats, Engine place, Drivetrain.

Engine data: Cylinders (all), Displacement (all), Power kW (all), Power PS (all), Power rpm (all), Torque Nm (all), Torque rpm (all), Catalytic (1986-1993), No. of gears manual (all), No. of gears automatic (all).

Dimensions and weight data: Wheel base (all), Length (all), Width (all), Height (all), Curb weight (all), Gas tank (all).

Performance data: 0-100 kmph (all), Max speed (all), Fuel efficiency overall (all), Engine type (all), CO2 g/km (2007-present).

Australian car database

Small database – see SAMPLE

BIG database – see SAMPLE

Contact me for custom packages! For example you can ask for model naming, engine power and torque, wheels and tire dimensions. 1990-2019, 2005-2019 or whatever do you need. Price will be 0.3 eurocents / model.

Australia car database

I am making vehicle databases for Europe since 2003, for America since 2014, for India since 2015… what is next to do in 2016? Maybe Australia?

I created this page in May 2016 for marketing experiment to see how many people view Australia page and decide whenever worth my effort to create a car database for a country with such small population which may translate in just few sales of automobile database per year.

The page got very low traffic for one year, but during last days of May 2017 I got 2 people interested in purchasing an Australian car database, so I started studying possible sources of data, another 2 people left comments on 3 and 4 June, they asked me to scrap data from several possible websites (despite of the legal issues of scraping – HAD TO DO IT to serve people who had no other choice).

Due to anti-scraping measures on the source website, my programmer partner’s scraper don’t work so I had to look online for another scraper and found one slow and buggy, crashing a lot, forcing me to run it in small batches of about 1000-2000 cars and assemble them, scraping took about 2 weeks, and published database on 16 June 2017. From the initial 4 customers, only 1 ended purchasing, but new customers came and the number of sales in first months has been surprisingly high for a country with just 1/13 population of United States.

In November 2017, first customer asked for an update, and when tried to do it, I noticed that the source website removed META tags for year, make, model, so the only place where these essential information are displayed is page title, year, make, model, and badge are in a single field, requiring me to manually separate them after each update. Update takes 1-2 days and involving scraping last year of cars only. I had a huge luck to scrap all 90000+ cars in June 2017 to get make, model, year separately, automatically.

At April 2018 update the source website removed several data fields such as VIN, added them back few days later, removed again after 1 month, etc.

In February 2019 I met a student from Australia who asked few questions about my business, I answered them thinking that he is a customer looking to buy a database, but turned a student and later he threatened me that will build his own car database selling website if I don’t help him (help with what?), he claim that scraped from ******* with Python. ,but we decided to work together rather than competing each other. He also said that if I have more customers interested in web scraping I can pass to him, and I did passed one who probably paid him another $500.

He gave me a BETA .py scraper which was not working (I paid for it), I told him to fix and after few days he came with idea to host scraper on AWS and let script run automatically on schedule, he gave me me username/password where I could export data as CSV and sell via my website. I did this, providing an update for all my customers on 24 March (118 columns) and promising to all customers monthly updates that involve re-scraping ALL 1960-present cars, not just latest year, Private price guide and Trade in price guide will be updated accordingly.

2 customers reported missing data for most cars in Standard / Optional Equipment columns. I asked student to fix errors, on 18 April 2019 he replied “I’ve been really busy lately with Uni exams, interviews for jobs and the insurance project. Sorry Ill get to the Australian database as soon as I can” and that was the LAST day I heard from him. He did not signed in anymore. AWS account was suspended, probably because AWS offer free service and bill you at end of month, and he did not paid bills, and all what I had was a non-working BETA scraper. I think that he may also have died in car crash, he told me that passed driving exam recently and given by the fact that his dad owned a business, he may have got a very powerful car.

While I can update database again using my old method (96 columns only, without average trade-in value, without colors and features, etc), adding new cars and not updating older records, I hope to find another Python expert to correct his scraper and make it running properly.

I found someone in India experienced in Python, I paid him $50 and $100 for two small projects that he done successfully, then in July 2019 I gave him the BETA scraper from Australian student and we agreed $250 to fix it… he said that it is the most difficult project he ever done in Python, he also made many errors, we hardly met online at same time, only in September 2019 I can say that he fixed most errors after paying another $150 and I can extract most of data from the source website. I decided to provide you a TEMPORARY update with 2019 models beside the March 2019 update 1960-2019, and I ask all my customers to report errors so I can tell programmer to fix them before scraping all 1960-2019 cars with this new scraper, so whole database will be harmonized.

When tried to scrap 1960s cars, I noticed that scraper breaks when come across a car with NO image, had to pay again the indian programmer to fix it, then I found more errors. His experience is very low… I need to mention that he offered me to sell for $$ a couple of databases, which in October I figured out that “his portfolio” of databases was mostly with databases available for free download on various sites, but he lied me that created them himself and this mislead me that he is very experienced (many customers I told this story to, told me that he is really idiot and scammer to lie and charge money for databases he did not created). I spent sooo much time testing his scraper and reporting errors at Australian database, that caused customers of European and American car databases to complain for delay of updates for their databases. So I need to take a break from Australia project to serve Americans.

As 5 November I provided another update with 2019 models, and can re-scrap 2019 models anytime you require. The 1960-2018 cars I will re-scrap when I will be less busy. Need to mention that source website have an IP blocker which was not present in March 2019 when Australian student done same job, due to this IP blocker I cannot leave scraper running 24/7 and have data ready in few days, I need to actively monitor scraper and run in small batches.

If anyone else willing to help me, please help!

List of car makes included

Abarth, Alfa Romeo, Alpina, Alpine, Armstrong Siddeley, Asia Motors, Aston Martin, Audi, Austin, Austin Healey, Australian Classic Car, Bedford, Bentley, Bertone, Blade, BMC, BMW, Bolwell, Bufori, Bugatti, Buick, Cadillac, Caterham, Chery, Chevrolet, Chrysler, Citroen, Commer, CSV, Daewoo, Daihatsu, Daimler, Datsun, De Tomaso, Dodge, Elfin, Eunos, Ferrari, Fiat, Ford, Ford Performance Vehicles, Foton, FSM, Geely, Giocattolo, Great Wall, Haval, HDT, Hillman, Hino, Holden, Holden Special Vehicles, Honda, Humber, Hummer, Hyundai, Infiniti, International, ISO, Isuzu, Jaguar, Jeep, Jensen, JMC, Kia, KTM, Lada, Lamborghini, Lancia, Land Rover, LDV, Lexus, Leyland, Lightburn, Lincoln, Lotus, Mahindra, Maserati, Maybach, Mazda, McLaren, Mercedes-Benz, MG, MINI, Mitsubishi, Morgan, Morris, Nissan, Noble, NSU, Opel, Pagani, Peugeot, Pontiac, Porsche, Proton, RAM, Rambler, Renault, Robnell, Rolls-Royce, Rover, Saab, Seat, Simca, Skoda, smart, SsangYong, Studebaker, Subaru, Suzuki, Tata, TD 2000, Tesla, Toyota, TRD, Triumph, TVR, Vanden Plas, Vauxhall, Volkswagen, Volvo, Wolseley, ZX Auto.

Data fields included

Percentages calculated for 1960-2017 database. If you buy 1990-2017, 2000-2017, etc, you will get higher completion ratio. Certain data fields are available for recent cars only, for example Fuel Consumption is available starting from 2000s.

Naming: Full car name 100%, ID 100%, Make 100%, Model 100%, Year 100%, Price 97.16%, Image URL 69.76%.

Description: Body 100%, Engine 99.94%, Transmission and Drivetrain 100%, Fuel Type 100%, Fuel Consumption 52.84%.

Overview: Badge 100%, Series 100%, Body 100%, No. Doors 99.97%, Seat Capacity 98.53%, Transmission 100%, Number of Gears 99.99%, Drive 99.99%, FuelType 100%, Recommended RON Rating 55.10%, Release Year 100%, VIN 77.14%, Country of Origin 99.97%, ANCAP Safety Rating 29.29%, Overall Green Star Rating 32.33%, Text Description 25.81%.

Engine: Engine Type 99.99%, Engine Location 97.77%, Engine Size 99.92%, Induction 99.94%, Engine Configuration 78.95%, Cylinders 99.93%, Camshaft 86.72%, Valves/Ports per Cylinder 86.19%, Compression ratio 74.18%, Engine Code 58.01%, Power 85.10%, Torque 81.67%, Power to Weight Ratio 81.42%, Acceleration 0-100km/h 37.14%, Maximum Speed 0.39%.

Fuel Fuel Type 100%, Fuel Capacity 78.03%, RON Rating 55.10%, Maximum Ethanol Blend 78.03%, Fuel Delivery 99.92%, Method of Delivery 99.90%, Fuel Consumption Combined 52.84%, Fuel Consumption Extra Urban 56.27%, Fuel Consumption Urban 58.10%, Fuel Average Distance (km) 52.61%, Fuel Maximum Distance 55.69%, Fuel Minimum Distance 57.33%, CO2 Emissions Combined 51.83%, CO2 Extra Urban 30.36%, CO2 Urban 30.40%, Greenhouse Rating 32.05%, Air Pollution Rating 32.07%, Green Star Rating 32.32%, Emission Standard 25.57%.

Dimensions and weight: Length 83.34%, Width 83.31%, Height 83.03%, Wheelbase 83.64%, Track Front 77.29%, Track Rear 77.28%, Tare Mass 61.18%, Kerb Weight 75.25%, Gross Vehicle Mass 58.79%, Gross Combination Mass 31.62%, Payload 46.69%, Boot Load Space Min 9.93%, Boot Load Space Max 10.96%, Towing Capacity braked 62.81%, Towing Capacity Unbraked 59.22%, Load Length 4.22%, Load Width 4.17%, Width Between Wheel Arches 2.28%.

Warranty: Warranty in Years from First Registration 72.28%, Warranty in Km 70.45%, Warranty Customer Assistance 47.05%, Warranty Anti Corrosion from First Registration 13.23%, Free Scheduled Service 0.98%, First Service Due in Km 42.11%, First Service Due in Months 34.40%, Regular Service Interval in Km 43.86%, Regular Service Interval in Months 39.66%.

Steering and Wheels: Steering 65.04%, Rim Material 39.66%, Front Rim Description 79.46%, Rear Rim Description 79.46%, Front Tyre Description 75.54%, Rear Tyre Description 75.55%.

List of updates

122 makes, 92885 cars, as 16 June 2017. Initial launch.
123 makes, 93901 cars, as 20 November 2017. After about 10 sales, saw first person who ask for an update.
123 makes, 96604 cars, as 20 April 2018 I planned to update every 3 months, but the source website wasn’t working in February and March, thus only in April been able to do the update.
123 makes, 97610 cars, as 30 August 2018.
123 makes, 98225 cars, as 10 December 2018.
127? makes, 99925 cars, as 24 February 2019, a test with new scraper, not published.
124 makes, 100492 cars, as 24 March 2019, published with 118 columns instead of 96.
3002 cars (2019 models only), temporary update as 18 September 2019.

How often do you want updates? (Australia car database)

Every month
Every 3 months
Every 6 months
Every year
One-time project, not interested in updates
Other
Please Specify:
Created with Poll Maker

Car photo database

Everyone looking for a car database with pictures is invited to discuss how I should do this job!

Few years ago I was thinking to add photos collected from Wikipedia in the book-style Car Models Encyclopedia.DOC but I was never sure if this is the right thing to do! .DOC format proved unpopular.

Update: in 2015 I learned web scraping, allowing me to quickly create new databases by copying data from various websites. The new databases for India, Middle East and Australia, made via scraping, do contain image URL, and if you want to bulk download all image files, copy URLs from my database into Tab Save extension for Chrome.

2013 idea… should I collect car photos from Wikipedia?

There are plenty of images on Wikipedia that anyone can collect himself, but numerous people asked ME for a database of car images. Probably these people are too busy to dig for images on Wikipedia, or they want all images to be cropped at same size.

OK, I want to help you! But what I should do? Do I should collect photos of every car model, resize them at same resolution, and sell them as .RAR archive? Should be linked some way with the Excel database?

I love working with fixed data, but in case of photos there are a lot of variables. On the internet you can find cars images taken from various angles, showing cars of various colors, in different places (driving / parked / showroom), what images I should collect? Then, if the car have multiple body variants, the photo should be the base variant, random variant or one photo of each version?

The Car Models & Bodies Database contains over 3000 car models body versions produced 1945-present, so theoretically I need to collect 3000+ photos?

All these dilemmas should be decided BEFORE starting this megalithic work in wrong way… and get disappointed to not see anyone purchasing it.

During early 2013, 4 people contacted me for photo database, but only one told me something: “I dont think that will work? Unless you have a SQL database with pics linked into it.” them “thanks anyways” and quitted the chat… without telling me anything regarding format of photo itself. I hate such lazy people! How I am supposed to know what is the correct format of photo database? I am not familiarized with SQL.

In May 2013, finally one customer told me what to do: find a car website from where find and save photos (but this isn’t a copyright issue??!! I will rather choose free photos from Wikipedia), one photo for each body style, try to choose same angle and environment in all photos, then rename photos with correct car model name, then crop and resize thumbnails (I am confused about this part).

So, I created this: Car Photos Database SAMPLE
90 MB worth of photos, including all Volkswagen models since 1970s, selected photos to be at least 1024×768, side-front angle for most cars, side-back angle for body derivations. The rest of brands will be done after getting more feedback.

Just 1 week passed and another customer told me to STOP the car images database, saying that the above sample is NOT right, the images collected are useless, and may lead to copyright troubles. He was looking to purchase licensed images, made from fixed angles, cleared background… unfortunately I do not know how I can do that. Do you?

Maybe if few customers will say that is OK to collect images from Wikipedia (although may be still copyright issue). Hope you don’t want me to go on street with photo camera to take myself one image of every distinct car model? Who does have time for that, especially hunting rare cars that are never seen in my country?

2015 idea… using web scraping software to get car photos

In 2015 I learned about data scraping from websites… initially scraping only the text data, and created American and Indian car databases as well as Motorcycle database. In 2016 I figured out that I can scrap image URL too, and because customers asked, I added image URL column for the above 3 databases.

So I am selling database of image URL, and if you want to bulk download all image files, use Tab Save extension for Chrome and copy-paste URLs from my database.

In the same way I can use web scraping software to get URL of all images from any website you want. This will save me from spending large amount of time digging for images on Wikipedia.

In December 2016 a customer asked me to scrap an used cars website, to get image URL beside Make, Model, Year. Took only FEW HOURS and I got over 100.000 car images, all in same resolution. He told me to keep it private and do not publish or resell on website. So I am telling you only the idea. If anyone wants to scrap car images in this way, let me know what website to scrap!

I am glad that I did not wasted a week collecting 3000 photos from Wikipedia since there are better methods available.

Car photo survey

 

Would be OK to collect FREE photos from Wikipedia, rename file names to car model names, and sell them for MONEY as fee for the time required to sort them out?

YES, what is on Wikipedia is FREE to use, is OK to collect photos from Wikipedia and sell them, as long you charge money for collecting and sorting fee and not for photos themselves. For few thousand photos you (and we too) won’t get into copyright troubles.
NO, this is a copyright issue, don’t risk running into trouble. Moreover such car photo database would be useless even offered FREE. We are NOT going to use a “car photo database” with photos stolen from Wikipedia and run ourselves in copyright troubles.
Other
Please Specify:

Poll Maker

What type of photos?

Photos of driving cars
Photos of parked cars
Photos in showroom
Any of them… does not matter
Other
Please Specify:

free poll maker

How many photos per car?

One random photo, angle does not matter
One photo from side or maybe slightly oriented to front
Two photos per car, front-side and back-side
As many photos is possible from many angles (do you realize how much time takes?)
Other
Please Specify:

Poll Maker

Second American car database

Database starts at 1990 with model naming and prices, but specifications are available starting from 1996. Last release: 14 June 2019, including 2744 models of MY 2019 and 645 of MY 2020.

Download free SAMPLES:
American-Car-Database-No-Specs-by-Teoalida-SAMPLE.xls (5 columns)
American-Car-Database-Basic-Specs-by-Teoalida-SAMPLE.xls (28 columns)
American-Car-Database-Full-Specs-by-Teoalida-SAMPLE.xls (210 columns)

Buy FULL database + FREE updates for one year:

Description

I offered a car database for USA market since 2014, named Year-Make-Model-Trim-Specs, but it does not have several things that some customers were looking for, for example MSRP (manufacturer’s suggested retail price) until April 2018 update.

I discovered another easy-to-scrap website in October 2017 when a customer interested in MSRP suggested me to do scraping from that website and create a new database for American market, then in November 2017 another customer asked me for specific data so I offered to sell him same database, re-scraped to include all specifications. After selling privately to 2 customers, both interested in 2017-2018 models only, I decided to scrap all 1997-2018 models and publish database on website so everyone can purchase.

As March 2018 I realized that the source website added specifications for 1996 models and also 1990-1995 models without specifications (only model names and price). I do not know if in the future, they will add specifications for pre-1996 cars too.

Accuracy notes

This car database is made via web scraping and sold “as it is” without corrections or additions, because any changes done will be lost at next update. While specifications for each individual car are pretty accurate, filtering cars of a specific class may be a trouble, because it contains multiple values having same meaning: “midsize car”, “midsize cars”, “mid-size cars”. Model hierarchy is slightly messed, for example Jetta is 1996-2005, 2009-2014, 2017-2018, New Jetta is 1999, Jetta Sedan 2002-2016, etc.

You are advised to buy Year-Make-Model-Trim-Specs if you care about accuracy, unless you need certain data available only in the 230 columns package of the Second American Car Database.

Year-Make-Model-Trim-Specs beside nearly perfect specs accuracy, it is based on a manually-built list of car models URLs thus I was able to improve model hierarchy, correct errors and add additional data not included in the original website.

List of car makes

Acura, Alfa Romeo, AM General, Aston Martin, Audi, Bentley, BMW, Buick, Cadillac, Chevrolet, Chrysler, Coda, Daewoo, Dodge, Eagle, Ferrari, FIAT, Fisker, Ford, Genesis, Geo, GMC, Honda, HUMMER, Hyundai, INFINITI, Isuzu, Jaguar, Jeep, Kia, Lamborghini, Land Rover, Lexus, Lincoln, Lotus, Maserati, Maybach, Mazda, McLaren, Mercedes-Benz, Mercury, MINI, Mitsubishi, Nissan, Oldsmobile, Panoz, Plymouth, Pontiac, Porsche, Ram, Rolls-Royce, Saab, Saturn, Scion, smart, Subaru, Suzuki, Tesla, Toyota, Volkswagen, Volvo.

List of updates

1136 models, 7224 model years, of which 6608 with specs, ? with trims, 46140 trims, as December 2017
1191 models, 8390 model years, of which 8074 with specs, 6644 with trims, 46804 trims, as 12 March 2018, published 18 March.
1210 models, 8651 model years, of which 8313 with specs (? full specs), 56363 trims (? full specs) as 20 September 2018.
1215 models, 8694 model years, of which 8313 with specs (? full specs), 56369 trims (? full specs) as 26 December 2018 (the source website did not added more specs in 3 months?)
1225 models, 8762 model years, of which 8439 with specs (7008 full specs), 57181 trims (49381 full specs) as 1 Feb 2019.
1229 models, 8790 model years, of which 8454 with specs (7023 full specs), 57312 trims (49512 full specs) as 28 March 2019.
1235 models, 8877 model years, of which 8547 with specs (7116 full specs), 58006 trims (50209 full specs) as 4 June 2019.

Each update involve re-scraping all makes, models, years and trim, the job takes about 50 hours at speed of 3 seconds per page.