These are the screenshots for 2017. It covers 879 hospitals in Seed #2. A quick note of interest is that more and more sites are using fixed top navigations that stay in place as the user scroll down the page. Because of this, some screenshots appear to have the top navigation bar duplicated.
0 Comments
In this post I explore the site structure of 474 sites from Seed 1 and look at things such as the most common folder names, number of branches from its root, and the number of links on each.
This is the last post in a series that analyzes images on hospital’s home pages, and it makes use of face detection with IBM’s Watson Visual Recognition library. The idea is to look the largest image on each home page and see how many of them contain faces. The visual recognition library not only can detect a face, but it can also determine if the face it’s male or female.
Continuing the analysis of images on hospital site’s home pages, this post takes a look at the type of images that are presented. It does so by making use of IBM’s Watson Visual Recognition library, which can identify objects in an image and make classifications accordingly. Each one of the images referred to in the previous post was used to come up with the following classifications.
This is the first on a series of posts that analyzes the images on the home pages of hospital sites. This post in particular takes a look at all the images that make up a web page and finds the largest on each as determined by its size in kilobytes. Using the list on Seed #2, I created a script that accomplishes this task.
Page resources are those elements that are loaded as soon as the HTML of the page gets loaded into the browser. These could be images, CSS files, JavaScript files, and other type of resources needed for the page to be rendered. Using the list on Seed #2, I studied all the resources called for each one of the hospital sites and analyzed the number of resources needed for each page. The type of calls and percentages as compared to each page is shown below.
One of the tasks done by the scripts I use to explore hospital websites is that of taking screenshots so I can compare them historically through time. By doing this, one can quickly observe the tendencies on website designs and the differences between them. The gallery below shows these screenshots.
I’ve collected a new list of hospitals and their respective URLs to make this ‘Seed #2’ This list is created on June 2016 and it contains 946 hospitals.
Here’s the full list:
Sniffing the user agent is a common practice to detect the browser being used to request a page. This technique can be done in a few ways and it is mostly used to send the user to a mobile specific site. With the advent of responsive design, this practice is sometimes no longer necessary as the site will render correctly on a mobile device. I took the sites on the seed #1 to see which sites would still try to sniff my user agent via JavaScript. Here are the results: 1,276 sites did try to detect the user agent, while 114 did not.
The following analysis was done using the seed #1 dataset and it determines the most commonly used HTML tags. This dataset contains the home page HTML of 1,657 hospitals. By parsing the source code of these sites, I was able to determine the most frequently used and the site with most tags on its home page.
First, these are the top 10 hospitals with the most tags. The average number of tags on pages in this dataset is 390 tags, while the median is 320. |
AuthorI've been working on the healthcare space for more than 10 years and I am passionate about the technology that runs it. Archives
June 2018
Categories |