Getting Emotional About Microstock – Some Thoughts on Supply & Demand
Posted on November 30th, 2008 in Uncategorized | 7 Comments »
Matt’s series on the Long Tail and microstock got me thinking and I decided to do some quick searches on human emotions on Dreamstime to see if anything interesting showed up in the data.
I did searches for 5 keywords (angry, bored, happy, sad, scared), ordered the results by download and captured views and downloads for the top 10 images. I also recorded the number of results returned. While this is completely non-rigorous analysis, there are some interesting observations.
If anyone has thoughts on how to better interpret this data, please don’t be shy. Also, if anyone would like a spreadsheet of the data, let me know and I’ll send it to you.
The things that jumped out at me from this exercise (in no particular order)
- There are way more images returned from a search for ‘happy’ than for all of the other terms I looked at. While this does make sense from a marketing perspective, the skew seems out of whack when compared to the relative download counts.
- Conversion rates for ‘bored’ and ‘scared’ are very high and image counts are very low. To only have 3,000 images for a generic emotion like ‘bored’ in a 4 million plus image universe is surprising to me.
- Don’t ignore emotional variations when conducting shoots. Collectively, they account for a significant number of downloads and there are fewer results in the pool, so in theory standing out should be easier. Long live the Long Tail!
- Search terms that have high conversion & relatively low result counts may indicate areas of opportunity but (much) more testing is needed before you can draw any conclusions.
- PIcNiche is a really cool site and they basically automate what I’ve done here using Fotolia data. I think it’s well worth a visit as you plan your next shoot. (Thanks for the pointer to it Lee.)
Image Count (number of results returned when searching for a term)

The number of images that are returned when you search for ‘happy’ versus the other keywords is staggering. There are over 15 times as many images returned for ‘happy’ versus ‘sad’. While this lines up with intuition (happy people are better for marketing) the spread took me by surprise.
Average Downloads & Views for Top 10 Images
To get this data, I just sorted the results for each search by download count, recorded the download and view numbers for the top 10 results and then calculated the average. There are also some interesting things to note in the individual image data which I discuss in the next section.

One thing to note here is that even though there are over 15 times as many images results for ‘happy’ as there are for ‘sad’, the difference between average DLs for the top 10 images is only 3x. While ‘happy’ clearly drives more downloads than the other emotions, if you incorporate them into your shoots, there’s an opportunity to capture incremental downloads.

The results for ‘sad’ were skewed by two images that had an extremely high number of views. When I removed those two data points, the average number of views for sad was: 1,900 which is more in line with the other results. Those two images were in numerous collections and my guess is they were featured on the home page or as editors’ picks at some point.
Average Conversion Rate
For this post, I defined conversion rate at the ratio of Downloads to Views. To some degree, this metric is a measure of quality/keyword relevance. If something is viewed a lot, but not downloaded that much, it either means that it is a low quality image (people viewed it but decided not to buy it) or it could mean that the image is not relevant to a high volume search keyword.

Again, the results for ‘sad’ are skewed by the two high view images. If I remove them, the average conversion for ‘sad’ rises to 9.6%. It’s interesting to look at the PicNiche data for the same search terms. PicNiche rocks in my opinion and I’m looking forward to seeing how it evolves. I think they are using Fotolia data and have access to the total downloads and views for a given term as opposed to the sample approach that I took.

If you superimpose Average Downloads per Image (Left axis) and Average Conversion Rate for the search term (Right axis) you can see that even though all terms apart ‘happy’ have similar download numbers, ‘bored’ and ‘scared’ have significantly higher conversion rates than the others. There are also far fewer of these images than in the other two cases. Clearly more analysis is required, but the hypothesis that high conversion & low image count = opportunity is probably worth exploring further.

Per Image Detail for the Top 10 Images (For the Data Geeks out there…)
Perhaps the most interesting observation here is that there isn’t a ton of variation between the top ranked image and the 10th ranked image in terms of download counts. Another thing that jumped out was how close to each other the download counts for the searches other than ‘happy’ were.

7 Responses
Now that’s what I’m talking about! Great post and great analysis! I love it. So there are opportunities to take the same model, same situation, turn the emotions and get more submittable accepted images with sales that will do fairly well and continue your tail’s growth with less competition. Love it!
Thanks for the kudos, Matt. It was a pretty interesting exercise and it does seem as though shooting offbeat variants at low incremental cost makes sense.
Heya Rahul
Thanks for the mention, I’m loving the positive feedback I keep hearing about picNiche
Glad you find it useful
I’m also very happy to see you’re finding a correlation between the manual approach and my results (this is pretty much what I was doing before I built a public version of the tool
I did run some tests with the Dreamstime (and the rest of the big 6) data in alpha which showed similar results, but dreamstime declined my request to include them in online stats
Very much looking forward to checking out your lookstat tool, it looks pretty awesome on Lee’s site
Can’t wait for your queue to get processed (I had similar server issues with queries sent to fotolia api a while back
Stick at it mate, it’s worth the effort
Thanks muchly, not just for your tool, but for your efforts too, I’m right here with you
Rob
x
Hi Rob,
Thanks for the good wishes. You’re doing great work and I’m happy to acknowledge that. I’m really interested in the overlap between a contributors strengths and the market’s needs and I’d love to chat sometime about what you’re learning and what you’ve got planned for the site.
We’re grinding through our queue as quickly as we can and I look forward to getting your feedback once you get in there and start poking around.
Cheers,
Rahul
[...] hilfreiche Statistiken erstellen, wie der LookStat-Gründer Rahul Pathak in seinem Blog beweist. Hier vergleicht er, von welchen Emotionen am meisten Motive bei den Bildagenturen vorhanden sind. Es [...]
beware looking at download numbers at DT when they are over 100, as contest images automatically get 100 downloads added to their listing so they go to level 5. This could really skew your data, especially for a keyword that isn’t downloaded much but did figure in a contest at one time.
Interesting analysis though, and certainly a good idea with a model that can do it to run through a list of emotions.
–=Tom
Hi there – thanks a ton for that nugget of information. You're absolutely right about the skew and it could have a big impact in low sample size pockets. I'm guessing the analysis is directionally correct – making sure it was statistically valid will take some more doing.
Thanks for taking the time to write in.
Rahul