Want your startup Funded? Dive into the data…

***Special thanks go to Michael Schwartzman who runs his own scraping business, specializing in recovering websites from the Wayback Machine***

I recently came across some startup data from CrunchBase (data complete till Q4 of 2014) which is the most comprehensive online database for startups. Kudos to them for sharing their data. They even have an API that you can use to create startup orient web frameworks (more on that at the end of the post…)

Whenever I find an interesting data set on the web you know how itchy I get… Dying to get insights, squeeze the data in my analysis machines and get out useful results…So let’s get started…

I used Python Pandas for the data importing & data munging and analysis and matplotlib for creating the graphs. All these in a nice iPython –pylab environment for scraping the data. For those interested I can give you my iPython notebooks, which were kindly provided to me by the developer of Wayback Machine Downloader

Moreover, everything here is solely my own opinion. Feel free to discuss with me in the comments section.

Want your startup funded? Choose your region wisely…

 

This is a graph showing the total number of startups that got funded (timeframe: 1900 – now) per region. San Francisco is the king with more than twice the number of funded startups than the next best region. London comes 4th and it is the only European city among the first ten. The competition in San Francisco is huge according to my experience though. So if you want to give it a shot there you’d better be very well prepared before you visit it. Perfect example of this is Gate Keeper a SAAS for  managing contracts with clients

Want an impressive quick calculation? The total number of companies for SF Bay Area is 6600. The population of it is 825,863. Say that we have 2 founders minimum per company. So there is one founder of a funded company in SF Bay area every 825,863/(6660*2) = 62. Gosh, this place is boiling with entrepreneurship….

One founder of a funded company in every 62 people in SF Bay Area!!!

Want your startup funded? Choose your market wisely…

So you want to create a startup. Which market should you choose? Of course the one you are passionate about…. But before you start let’s see what the data suggest. Which are the 10 most funded markets ever?

Market
Total Funding in $ Billion
Biotechnology
52.5
Mobile
46.35
Software
37.61
Clean Technology
26.88
Health Care
20.67
E-Commerce
18.3
Enterprise Software
16.6
Advertising
11.31
Hardware + Software
10.41
Semiconductors
10.07
Finance
9.76

It seems that if you are not a biotechnology company the second best thing you could do is create for mobile or software or, why not, mobile software. If this is the case you might want to test your product fast (hint: google Lean Startup ). Try using Twitter Bootstrap to create a website which is mobile friendly. Add google analytics to your website. If most of your users come from mobile then reiterate your software code to native Android or iOS according to your traffic data. Thats the good thing with data. You always have a sense of direction….

Another point of interest in this table is the huge amount of investment into cleantech. It seems that there is hope for humanity after all. 😉

Below you can see a graph of the first 4 most funded market sectors and their total funding per year for the time period between 1985 – 2013

Again here you can see the impressive 2 peaks of funding in clean energy, one in 2001 and one in 2006. Along with the impressive peaks of software funding in 1998 (just before the dotcom bubble) and again in 2006.

Want your startup supported for longer?
Prefer to iterate faster? Choose your region wisely…

The graph below displays the average funding for all startups and all the years for SF Bay Area, New York, Boston and London, stratified for acquired companies, companies that have closed and those still in operation.

The graph is full of insights again. It is like a behavioral analysis of the biggest startup hubs worldwide. SF Bay Area again is the king with biggest support to companies that are closed. This means that if you get funded in SF, chances are that you will be supported for longer with subsequent rounds even if you fail at the end.

On the other hand New York City seems to have lower average funding for companies that are closed. Probably they filter out the companies that are failing quicker. If you want to iterate faster then search for investors in New York city.

London also seems to have the lower average funding before acquisition which means that acquisitions happen faster and with lower overall funding there. It seems that in London you will get acquired faster and with lower average funding that in the other cities.

 

Bonus No1: Hottest Markets of last 5 years

 

Which market were the 3 hottest (funded most) the last 5 years worldwide?
Take a look at the table below.
 

Year
Market
Total Funding in $ Million
2014
News
253.051
Clean Technology
100.18
Pharmaceutical
43.07
2013
Biotechnology
781.38
Software
583.16
Oil & Gas
500.17
2012
Oil & Gas
1498.51
E-Commerce
1461.17
Biotechnology
937.04
2011
Biotechnology
1890.620
E-Commerce
1157.51
Mobile
1107.24
2010
Biotechnology
2341.80
E-Commerce
1893.10
Mobile
1594.23

Seems that biotechnology got the lion’s share for the 3 out of 5 years. Cleantech also seems to be revived at least for 2014. Mobile was super trendy in 2010 and 2011 and what a surprise, news market seems to be coming first for 2014.

 

Bonus No2: Most Funded Companies Everrrrr!!

Did you know that Cardinal Health along with Verizon Communications are the most funded companies ever? That Facebook is 8th most funded company ever? That Uber ranks 14th in this list? Groupon 24rd? Tesla Motors that has been so accused for heavy funding is below Zynga and Survey Monkey. Take a look at the table below. The most known companies (at least to me) are highlighted with red.

 

Rank
Company Name
Total Funding in $ Billion
1
Cardinal Health
78.79
2
Verizon Communications
30.07
3
Sberbank
5.8
4
Clearwire
5.7
5
First Data Corporation
3.5
6
COFCO
3.2
7
sigmacare
2.6
8
Facebook
2.42
9
Carestream
2.40
10
Quad/Graphics
1.9
11
Flipkart
1.75
12
Charter Communications
1.66
13
Solyndra
1.56
14
Uber
1.50
15
Venari Resources
1.49
16
Fisker Automotive
1.45
17
O3b Networks
1.37
18
Flipkart
1.75
19
Cloudera
1.201
20
Terra-Gen Power
1.20
21
Cape Wind
1.20
22
Twitter
1.16
23
Groupon
1.13
24
Alibaba
1.11
25
Dropbox
1.107
26
Xerox
1.100
27
Sirius XM Radio, Inc.
1.055
28
Wave Broadband
1.050
29
SolarCity
1.04
30
AOL
1.00
31
Blackberry
1.00
32
Bloom Energy
0.98
33
The Exchange
0.94
34
LivingSocial
0.93
35
Better Place
0.92
36
SurveyMonkey
0.90
37
Palantir Technologies
0.89
38
58.com
0.87
39
Zynga
0.86
40
BrightSource Energy
0.84
41
FireEye
0.82
42
Tesla Motors
0.82

 

Conclusive thoughts….

The possibilities are limitless when you analyse data. For example, we could keep on analyzing the time lapse between starting time of companies and first round of funding to check the average to funding time. Stratify that time according to market or type of the company or according to region, etc etc. In general you can ask every possible question to your data and get answers.

And then a general pattern emerges. How about modelling this pattern? Thinking in reverse. Since you can get insights from data, how about creating models with machine learning and feed your models with new data to get predictions? How about scraping more data about these companies from the web to create more data points for more realistic models?

What other questions could we ask the CrunchBase data? Throw me some ideas in the comments below and I will do my best to include the answers in the future….

IF YOU HAVE MORE QUESTIONS – CALL US RIGHT NOW!