Some people eat, sleep and chew gum, I do genealogy and write...

Thursday, March 19, 2015

Digitizing Genealogy -- Flatbed Scanner

CanoScan LiDE40 is Canon's A4 USB CIS flatbed image scanner.By Qurren (Qurren's file) [GFDL ( or CC-BY-SA-3.0 (], via Wikimedia Commons
The term "scanner" covers a huge assortment of electronic devices for digitizing all sorts of original documents. In fact, the term has now been extended to 3D scanners that digitize three dimensional objects for 3D printers or additive manufacturing. Wikipedia has ten different types of devices listed on its disambiguation page for the term "scanner."  Some of the devices, known as book scanners, actually incorporate high resolution cameras into the scanning process. The devices using the label "scanner" can cost under $100 or cost many tens of thousands of dollars. As a genealogist, unless you intend to get into the commercial business of making scanned images as a service, you will probably settle for one of the many models of "flatbed" scanners such as the one illustrated above. This type of device is sometimes called an "image scanner" to differentiate it from other types of scanners.

A flatbed scanner has a flat glass plate, similar to a copy machine, and the same type of copy machine light source. Some of the flatbed scanners today use a charge-coupled device (CCD) or a contact image sensor (CIS) as the image sensor. The light source for flatbed scanners is usually a xenon, LED or cold cathode fluorescent type. An alternate type of scanner uses a contact image sensor (CIS) scanning consists of a moving set of red, green and blue LEDs strobed for illumination and a connected monochromatic photodiode array under a rod lens array for light collection. See Wikipedia: Image scanner.

The main issue in using a flatbed scanner is the time it takes to make one digital copy of a document or photograph. It can become extremely tedious to raise and lower the lid and replace the documents or photos to be copied. The image quality is usually very good, depending on the software supplied by the manufacturer. Most of the flatbed scanners available today, scan in color, black and white an gray scale. The newer models have all moved well beyond the needs of high quality archival standard scanning. I will be talking a lot more about scanner resolution in future posts.

You could spend a considerable amount of time comparing flatbed scanner models, or you could simply walk into the nearest store selling the devices and buy the first one that caught your eye. You would probably end up with a perfectly adequate scanner. But I suggest a mid-road approach. Spend some time reading reviews and thinking about the following considerations:

  • How many documents do you expect to scan?
  • How long will you spend scanning?
  • Do you intend to use the scanner for purposes other than genealogy?
  • Are you concerned about archive quality or do you want quick and dirty?

As you think through the physical process of scanning, you must also consider what you are going to do with the computer files created by the scanner. I will also be posting more information about understanding and using the different file formats. It does no good to purchase a scanner and then fail to learn how to transfer the files and use them as media attached to your genealogical database whether online or on your own computer.

Depending on the amount of paper you need to scan and the time you have to spend doing the scanning, you may wish to look into the possibility of purchasing a sheet-fed scanner. This is also a topic for another post. As you can probably tell, scanning documents for use in a genealogical context can be a somewhat technical area. I only have to go online and look at the quality of the images uploaded to family tree websites to know that many researchers lack a basic understanding of the way to scan an image or how to handle the digitized image file. Of course, the scanned image can be no better than the original, but sometimes there is a high quality scanned image already available of the ancestor and someone has uploaded a very, very poor quality copy of exactly the same image. There are a lot of considerations in making images for genealogical purposes besides those involved in the technology of the various scanning devices.

Here are two images of exactly the same photograph from the FamilySearch Memories section. Both images are copies from an original photograph. A high quality scan of the original is available on the same page as these two photos.

This is apparently the photo from which the two copies were made.

There are a lot more considerations about image quality and choice than just the technology involved in making scanned images. 

Here is a list of the previous posts in this series:

1 comment:

  1. This is one of my fears about digital critical editions. The new digital ECM may be great now, but will it be great in ten years? Maybe, but how do we know? We can’t, because we don’t know the future. There is always talk of future-proofing our digital work. But let us be honest: that is a myth. When I worked on the CBGM, there were parts of the software for the Catholic Letters that only ran on Mac OS 9. What happens when the computer running that defunct operating system dies? cap digitizing