Entries in Data (48)
Beautiful data visualizations of some very scary data!
An anonymous hacker under the pseudonym of “Carna Botnet” has posted a comprehensive Internet Census 2012 report of over 460 million internet connected devices that responded to PING requests or were found to have open ports. He was able to create a botnet using over 30,000 Internet devices that had remote administration available using the Telenet and still had the factory installed standard passwords. He found several hundred thousand open devices, but didn’t need that many.
Abstract: While playing around with the Nmap Scripting Engine (NSE) we discovered an amazing number of open embedded devices on the Internet. Many of them are based on Linux and allow login to standard BusyBox with empty or default credentials. We used these devices to build a distributed port scanner to scan all IPv4 addresses. These scans include service probes for the most common ports, ICMP ping, reverse DNS and SYN scans. We analyzed some of the data to get an estimation of the IP address usage.
All data gathered during our research is released into the public domain for further study.
The visualizations he was able to create using the gathered data are fantastic. Check out the IMAGES page of the report for beautiful, high-resolution images.
The map visualization above shows the geolocation data of all 460 million devices that responded to the queries from the botnet, clustered around population centers as you might expect. The animated GIF below shows the geolocated devices that responded during the course of a day, showing that many devices are turned off overnight but many more are just left on constantly.
My favorite visualization from the data is the Hilbert Map, which uses the a 2-dimensional Hilbert Curve to map out the continuous sequence of IP4 addresses into a square area, and then color-codes the address blocks that responded to a PING request. There’s even a cool zoomable viewer of the Hilbert Map that lets you drill into the details.
This form of mapping was inspired by the xkcd Map of the Internet, which shows the Internet addresses that were distributed to major corporations in the 1990s before the Regional Internet Registries took over the allocation.
On February 12, 2013, President Obama gave his annual State of the Union speech, but this year it was “enhanced” with charts, data visualizations and additional information in a sidebar of the display (full video above). The team at NPR (@nprapps) published a great review a few days later called Chart Check: Did Obama’s Graphics ‘Enhance’ His Big Speech? They also included opinions from a couple of the best data visualization experts Stephen Few (PerceptualEdge.com) and Nathan Yau (FlowingData.com)
I will say that I think the use of the charts was very successful and does make the President’s speech more effective. By their very nature, the charts imply that the President has data behind his message, and that can be a very persuasive, compelling tactic. You’ll also notice the wide array of chart styles so they are each memorable for different topics in the speech. We didn’t get 27 bar charts, because the audience wouldn’t have been able to tell them apart after the speech. We got different data visualizations for different types of data. Stacked bars, line charts, area charts and grids colored icons.
The key frame from the video (above) is what first caught my eye. This is the still image shown before you start playing the video. I was instantly concerned about all of the charts after seeing this one about Deficit Reduction. It may be because I work with data visualizations every day, but I could see instantly that the chart was wrong. How can the $500 Billion part of the stacked bar be larger than the $600 Billion part? That can’t be right! Seriously, I look at this stuff all the time, and this jumps out at me in a big way. Welcome to my life.
Here’s the full chart:
One of the biggest risks with data visualizations and infographics is what I call the Risk of Negative Impression. The idea is that while good visuals can quickly leave a good impression with your audience, if your visualizations are incorrect or flawed, you can leave a bad impression just as quickly and effectively. The audience thinks, “if they messed up this chart, why should I trust anything else they have to say?” Then they feel like they have to carefully scrutinize every chart, and you have lost all credibility with your audience.
The NPR piece does a great job of breaking down 14 of the 27 charts from the speech, and even created some corrected charts to show a more realistic real visualization of the data. I highly recommend you read the whole article on the NPR site.
I’ll mention one more example. By visualizing data, the designer adds context and bias to the information. The best designers try to minimize the bias, but even the choices about what data to include in the visualization help frame the audience’s understanding. One common way to skew perception of the data is to change the scale of one or both of the axes. A number of slides from the speech don’t start at zero, so the chart exaggerates the changes. This is a common practice when charting stock values so the audience can see the small changes, but they often make the changes feel much bigger than they actually are. That was the intent with this chart that only shows the range of values from 400,000 to 550,000.
Stephen Few redesigned this corrected chart for the NPR piece, and I think he nailed it. By expanding the y-axis to start at 0, he puts the changes over time into a different perspective for the audience.
The White House has published all of the 107 slides as a scribd.com presentation:
Found on the White House blog
The LEGO Minifigs infographic is a history of LEGO Minifigs (Mini Figures). The infographic designed by Hot Butter Studio for visual.ly includes the dates when characters were introduced or when a certain feature was added. An added bonus to the infographic is the information about female LEGO minifigs and the FRIENDS line that was designed primarily for girls.
This is a fun infographic with some interesting factoids that will keep readers engaged with the design. LEGOs are cool right now, so the timing for this design is good. It’s also a topic that has not been well covered in infographics, so it stands out as unique information.
I wish a few of the data points were visualized like number of minifigs sold each year or space events along an actual timeline visual.
Thanks to Karyn for sending in the link!
If you have any thoughts of attending the 2013 Strata Conference in Santa Clara, CA, the discount code “COOL20” will cut 20% off the registration price for readers of Cool Infographics! Big data, visualization, privacy, science and business! What’s not to love?!?
This is an expensive conference, so the 20% discount is a BIG deal; saving hundreds of dollars! The 2013 conference will run from February 26-28, 2013 in Santa Clara, CA. If you can register early…
Early REGISTRATION Prices END THURSDAY, January 17TH!
Strata Conference 2013 - Feb 26-28, 2013 in Santa Clara, CA
Join the best minds in data for the latest in the data revolution: trends, tools, new practices, careers, and culture. Bringing together decision-makers, practitioners, and leading vendors from enterprise and the web, Strata provides three days of training, breakout sessions, and plenary discussions, along with an expo hall showcasing the new data ecosystem.
Check out the video including clips from some of the prior presentations!
The new Top 20 Field Service Management Software infographic from Capterra continues their series of “Top 20 Most Popular” infographics for different software categories. I’ve previously posted about the Top 20 Marketing Automation Software Solutions and the Top 20 Medical Records Software Solutions.
This design shows a little more detail behind how they score and rank the different software options.
Field Service Management software serves companies that send technicians or other employees into the field by helping them automate scheduling and dispatching. Below is a look at the most popular options as measured by a combination of their total number of clients, active users and online presence. In order to see a comprehensive list, please visit our Field Service Management Software Directory.
Capterra developed a popularity index consisting of three components to rank the field service management providers: number of customers (40%), number of end users (40%), and online presence (20%). The online presence metrics included traffic estimates from Compete.com, as well as the company’s number of LinkedIn followers, Facebook page likes, Twitter followers, and Klout scores – each weighted equally to comprise 20% of the vendor’s overall score.
The stacked bars to showing the three separate score metrics works nicely, and is a clear visual of the descending total scores. The Rank numbers are in colored boxes that match the primary brand colors of each software company, but the readers don’t know that. Without seeing the actual company logos, the colors just look random and create some unnecessary visual noise.
By not showing the company logos throughout the design, it’s harder for the reader to see where a particular company appears in the different sections. A visual logo would be easy to recognize at a glance, but in just text, the reader has to read every entry to try to find a match.
Nice, clear Call-To-Action at the end of the design, so the target audience readers know what they should with this information. The footer should also include a copyright statement and the URL link to the infographic landing page on Capterra’s site so readers can find the original, full-size version.
Old age meets new age. Defending the Death Star with A little Data Center Design takes a concept that people can relate to (Star Wars) and applies modern day data center technologies. Brought to you by Data Center Reports.
When Aristotle first explained the concept of “hamartia” in Poetics, he probably didn’t know just how many hero and villain stories would be driven forward by fatal flaws in character, judgment or planning. The Star Wars saga is an epic tale that is powered by fatal flaws – yet we couldn’t help but wonder how things might have turned out had the Empire used a little of today’s security insights to better protect the superweapon better known as the Death Star.
This infographic design tells a good story with illustrations. No big data sets to visualize, but a simple story that’s incredibly easy for the reader to understand. The Star Wars comparison puts the different technologies into context for the reader.
The footer should include a copyright and the URL link to the original infographic so readers can easily find the original high-resolution version.
Found on Data Center Reports
MHPM Project Managers has taken a different approach with the release their first CSR (Corporate Sustainability Report). Instead of the normal text report that other companies release, MHPM created an infographic poster with all of their sustainability information. It serves as a great example to their clients of how even CSR data can be designed in an engaging way.
MHPM Project Leaders passed a milestone towards integrating sustainable practices into its business operations today, with the release of its first annual corporate sustainability report.
MHPM’s corporate sustainability report evaluates MHPM’s impact on the environment and the community, its transportation practices and workplace policies. The findings reveal areas for improvement and provide a benchmark against which to measure future performance.
Designed by InfoNewt, this poster was printed at 24” x 36” by MHPM and is also available online on the Corporate Sustainability Report section of their website. The front side of the post visualizes all of the data, and puts most of it into context by comparing to prior year results. The back side of the design includes all of the required text, which keeps the front side less cluttered and easy to read. The entire design is inspired by the Global Reporting Initiative’s guidelines.
This is a fantastic use of infographic design principles! The full size, high-resolution poster is available as a PDF download from the CSR page.
Did you know that dynamic content can improve conversions by 50%? Or that emails sent on a Saturday get the highest number of click-throughs? Were you aware that when influencers share your content on social it can result in a dramatic increase in traffic and conversions?
That’s why we produce a chart every week that modern marketers can easily learn from and use. And it’s why we’ve gone back with our friends at JESS3 to reproduce the most indispensable data points, coming up with 40 understandable, actionable charts. Wide-ranging in scope, the charts hit the most important topics hitting marketers today – from social media to email.
With that in mind, we offer the charts in two forms. You can head over to the custom-made website and explore the charts by topic, getting to the data that matters most to you. Or you can download all 40 charts in a free eBook. It’s well worth keeping near your desk.
Even those each of these is only exploring one data set, these are actually good infographic designs that follow some important design rules.
First, each one tells one story really well, and the Key Message is easy to understand. Most readers of infographics are only looking at a design for less than 5 seconds, and a good infographic design will successfully communicate their primary message in that short time.
Second, each design is easy to share. Each of the 40 designs has it’s own landing page on Eloqua’s site, and their own dedicated social media sharing buttons. This is fantastic for SEO, and much easier to utilize the information for users. So, if you’re interested in a data set about email to customers, you can share that information with colleagues without also sharing a data set about Pinterest.
You can read more about the series on the Eloqua blog.
A couple things I do think are missing from each design that would help Eloqua in the future.
- Since these are being shared individually, the URL back to the original on Eloqua site should be included in the image.
- A copyright statement to clarify usage rights. Does everyone have permission from Eloqua to include these charts and data in their own presentations?
- Much of the data is proprietary to Eloqua, so the raw data behind the designs isn’t available to the public. It would be great for Eloqua to make the data behind each chart available publicly (like in a Google Docs spreadsheet) since they are making the data public in the charts.
- A conclusion. I think the subtle implication is that companies should call Eloqua for help using this information, but a short statement suggesting what action companies should take based on each data set would close each design nicely.
Thanks to Jarred for sending in the link!
If you have seen the comedy show The Big Bang Theory, then you know Sheldon…. If you haven’t seen it, go watch it and get back to us. Have you ever noticed his wardrobe? The Sheldon’s T-Shirts infographic from fibers.com tells you his favorite shirts, how often he wears his shirts, and even what colors he wears the most!
Graphs, Charts and illustrated T-Shirts with correlating sizes to wearing frequency - would there be any other way to visualize Sheldon Cooper’s t-shirt collection from The Big Bang Theory? We think not.
Big thanks to Sheldon’s Shirts where we got most of the data for this graphic. You can find a lot of Sheldon’s Shirts for purchase on the following websites:
This is just a fun infographic that uses some data visualization to appeal to fans of the show. Good design using publicly available data that has been complied in an engaging way.
The charts actually very well done. Charts are color-coded to match the data. Icons are included on the bars or in the pie slices, so no chart legends are needed. This makes the data faster and easier to understand.
Found on Fibers.com