Download FREE sample (one make):
Car make, model, version, no specs (5 columns)
Car make, model, version, basic specs (28 columns)
Car make, model, version, full specs & features (188 columns)
Alternate formats: CSV and SQL (full specs & features)
Buy FULL database (all makes) + FREE updates every month:
Coverage: oldest car included in database that indicate year is Daewoo from 1994, the FIRST foreign car manufacturer to enter in Indian market, followed by Ford and Opel (1996), Fiat (1997), Honda and Hyundai (1998). It may include pre-1994 models from Indian domestic brands but no year is indicated.
Makes included: Ashok Leyland, Aston Martin, Audi, Bentley, BMW, Bugatti, Caterham, Chevrolet, Chrysler, Datsun, Daewoo, DC, Eicher Polaris, Ferrari, Fiat, Force Motors, Ford, Honda, Hindustan Motors, Humber, Hyundai, ICML, Isuzu, Jaguar, Jeep, Kia, Lamborghini, Land Rover, Lexus, Mahindra, Mahindra-Renault, Maini, Maruti Suzuki, Maserati, Maybach, Mercedes-Benz, MG, Mini, Mitsubishi, Opel, Nissan, Porsche, Premier, Renault, Rolls-Royce, San, Skoda, Ssangyong, Tata, Toyota, Volkswagen, Volvo, Willys.
Download FREE sample (one make):
Bike make, model, version, full specs (93 columns)
Buy FULL database (all makes) + FREE updates when someone ask an update:
Coverage: since NO bike have production years indicated, I cannot answer which is oldest bike included in database, but I assume that 2000s-present at least.
Makes included: Aprilia, Ather, Avan Motors, Avanturaa Choppers, Bajaj, Benelli, BMW, Cleveland CycleWerks, Ducati, F.B Mondial, Harley-Davidson, Hero, Hero Electric, Hero Honda, Honda, Hyosung, Indian, Jawa, Kawasaki, KTM, LML, Mahindra, Moto Guzzi, MV Agusta, Norton, Okinawa, Royal Enfield, Suzuki, SWM, Tork, Triumph, TVS, UM, Vespa, Yamaha, Yo.
In August 2015 I learned to scrap data from websites, the India car database was my first scraping project, as opposite of the manually-written European car databases that I am making since 2003. Lots of people were asking me if I have an indian car database, that times nobody was selling.
Source of data: Carwale.com and Bikewale.com.
Unfortunately India do not have a quality car website that display years produced for every car version (as I personally wanted). Database contains production years indicated only for discontinued models and only if there are multiple generations of same model. Not good in my opinion but there was no other alternative. HAD TO DO IT in this way. If you know other better source of data please tell me.
One of the first people who bought the Indian car database in August 2015 wanted a 2-wheeler database too. Initially I said NO because I was personally interested only in cars, but once I mastered my data scraping skills, I decided to offer web scraping services for individual customers. In January 2016 another customer wanted to scrap bike specs from Bikewale.com, this was the moment I created bike database.
Carwale removed discontinued models in March 2017, so my database contains valuable data that you cannot get yourself from Carwale anymore.
In April 2019 I made new scraper for Bikewale, adding individual versions in the Indian bikes database (in the previous editions, if a bike had multiple versions, database contained only base version).
Future updates will only add new cars, no updates for existing records
Indian car database launched for sale in August 2015, and after doing ~8 sales and 3 updates, I decided in May 2016 to offer monthly updates on 1st day of month. Each update takes 8 fours of data scraping + 2 hours post-scraping manual work. I scrap every make to get models, every model URL to get versions, every version URL to get specifications, remove all data from previous update and put new data. See list of updates.
In February 2017 Carwale decided to hide discontinued models from website. I kept updating database by scraping for new versions URL, add them into database, compare the unique ID, delete duplicates, then scrap all versions URL to get specifications including current price and last recorded price that indicate whenever a car is in production or discontinued.
In November 2017 Carwale removed unique ID from each URL, causing all URLs to be changed and redirected, in 10 cases the old version URLs redirect to 404 Not Found, in 197 cases the old version URLs is redirecting to a different car that it should (multiple old URLs redirect to same new URL), making me impossible to re-scrap old cars for updates without risking loss of model versions. I will continue to update database monthly by adding new cars, but without updates for old cars, database quality will go down over time, when a model is discontinued and replaced with a new model with same name, discontinued model name will not be updated with production years to differentiate from current production model, specifications including prices will also not be updated for old cars, etc.
Car data fields included
Naming: ID, Make, Model, Version, Status 100%.
Price: Production cars 30.65%, Discontinued cars (last recorded price) 69.35%.
Body: Length (mm) 99.70%, Width (mm) 99.70%, Height (mm) 99.67%, Wheelbase (mm) 99.54%, Ground clearance (mm) 59.27%, Kerb weight (kg) 61.11%, Bootspace (litres) 48.75%, No of doors 99.54%, Seating capacity 99.57%, No of seating rows 72.36%.
Engine: Displacement (cc) 99.21%, Max power (bhp) 99.43%, Max power (rpm) 99.21%, Max torque (Nm) 99.43%, Max torque (rpm) 99.73%, Transmission type 99.78%, No of gears 97.15%, Drivetrain 86.74%, Engine type 87.39%, Cylinders 72.17%, Bore x Stroke (mm) 13.18%, Compression ratio 9.16%, Valves per cylinder 69.73%, Dual clutch 60.92%, Sport mode 61.74%, Fuel system 31.33%, Turbocharger/supercharger 50.60%, Turbocharge type 50.16%, Driving modes 51.01%, Manual shifting for automatic 50.03%, Engine start-stop 49.78%.
Fuel: Fuel type 99.67%, Alternate fuel type 63.53%, Mileage (kmpl) 87.31%, Fuel tank capacity (litres) 96.30%.
Drivetrain: Suspension front 91.30%, Suspension rear 90.68%, Brake type front 98.48%, Brake type rear 98.23%, Steering type 50.87%, Turning radius (m) 80.14%, Wheels 50.95%, Spare wheel 68.53%, Tyres front 71.28%, Tyres rear 71.22%.
Others: Colour names 93.89%, Colour RGB 93.89%, Image URL 87.85% (you can use Tab Save extension for Chrome to download image files).
Features: 131 columns, see SAMPLE file, I do not list them here to overload the page with too much text.
Bonus: Car class, Body style 100.00% (added manually from my personal experience in new+old cars package, NOT scraped from website).
Percentages as 1 January 2017 (3680 cars).
Note: Car class and Body style are NOT available in new cars only package, because would take a lot of time to re-add them for 1200 cars every monthly update, so I add them only in new+old cars package for the cars added each month (about 20-50 cars per monthly update). Cars launched after June 2019 have some engine columns merged into 1 column due to changes in Carwale website.
Trucks and buses databases
I made them in September 2016 for a customer who never paid for the job he requested me to do. First person to purchase them came in January 2018 so I updated them for first time, future updates will be done at request.
As August 2018 I noticed that CarDekho made each version URL to redirect to main model URL, effectively making me impossible to scrap specifications of other versions than base version. Probably I will never update them again. Only 3 people purchased them so does not worth my effort to keep maintaining.
Buy FULL database:
Bikes and cars DEALERS database
Several people told me to scrap dealer information from Carwale and Bikewale. Here is the database containing dealer name, street address, email and phone number.
Buy FULL database + FREE updates when someone ask an update:
DO NOT ask me about car owners database!
A number of people have trouble understanding what I am selling or don’t bother to check samples of what I am selling (database of car MODELS with specifications and features), they ask me straight to sell them a database of car OWNERS with registration number, name, address, profession, phone, email, insurance expiry date, etc. Strangely I do not get such questions from Europe and America, but ONLY from India.
I DO NOT have registration / owners data, and the companies who does have (car dealerships and insurance companies) must follow personal data protection laws and DO NOT share data of their customers to third-parties.
If you do a google search “car owners database” you see at least 10 sites selling personal data illegally, all them from India, the only country in the world where people have no respect for personal data and email/SMS spamming is national sport. But I am skeptical about how real is this data, how it was obtained and how updated it is, considering that vehicle registration authority do not keep records of emails and phone numbers, but only of residence address. Furthermore, most drivers do not even use email! They may be databases of users registered in some website that was hacked or a corrupted employee try to make extra money by selling their internal database.
The only way to get car registration data legally and up-to-date is to apply in vahan.nic.in “The Ministry has decided to offer the services to different stake holders like Banks, Insurance Companies etc on payment basis.” If you do apply, please inform me what are their prices.
News: in April 2019 been contacted by a strange person claiming to have database of customers from all new car dealerships across India, 1.5 million records. He was speaking very bad english mixed with hindi and we didn’t understood each other, when I asked for a SAMPLE with few rows to see what details he have about each car buyer/owner, he said “bulk orders only” and asked me to do bank transfer. How I can pay without knowing what I get?
I have his phone and email, he is living in Mumbai. I am not from India so I cannot meet him. Is there anyone from Mumbai who can help me and meet him in person, check database before paying, then share database with me (I can pay you half of the price of database) then I will sell it to other people interested?