These are the screenshots for 2017. It covers 879 hospitals in Seed #2. A quick note of interest is that more and more sites are using fixed top navigations that stay in place as the user scroll down the page. Because of this, some screenshots appear to have the top navigation bar duplicated.
In this post I explore the site structure of 474 sites from Seed 1 and look at things such as the most common folder names, number of branches from its root, and the number of links on each.
This is the last post in a series that analyzes images on hospital’s home pages, and it makes use of face detection with IBM’s Watson Visual Recognition library. The idea is to look the largest image on each home page and see how many of them contain faces. The visual recognition library not only can detect a face, but it can also determine if the face it’s male or female.
Continuing the analysis of images on hospital site’s home pages, this post takes a look at the type of images that are presented. It does so by making use of IBM’s Watson Visual Recognition library, which can identify objects in an image and make classifications accordingly. Each one of the images referred to in the previous post was used to come up with the following classifications.
This is the first on a series of posts that analyzes the images on the home pages of hospital sites. This post in particular takes a look at all the images that make up a web page and finds the largest on each as determined by its size in kilobytes. Using the list on Seed #2, I created a script that accomplishes this task.
One of the tasks done by the scripts I use to explore hospital websites is that of taking screenshots so I can compare them historically through time. By doing this, one can quickly observe the tendencies on website designs and the differences between them. The gallery below shows these screenshots.
I’ve collected a new list of hospitals and their respective URLs to make this ‘Seed #2’ This list is created on June 2016 and it contains 946 hospitals.
Here’s the full list:
The following analysis was done using the seed #1 dataset and it determines the most commonly used HTML tags. This dataset contains the home page HTML of 1,657 hospitals. By parsing the source code of these sites, I was able to determine the most frequently used and the site with most tags on its home page.
First, these are the top 10 hospitals with the most tags. The average number of tags on pages in this dataset is 390 tags, while the median is 320.
I've been working on the healthcare space for more than 10 years and I am passionate about the technology that runs it.