https://www.familysearch.org/en/search/full-text/
- The main catalog
- The historical record collections
- The Images collection
- Full-text Search
- Simple Search
- The Books collection
The bridge between our "old school" research standards and these powerful new AI-driven tools is precision. We have to stop treating the search box like an easy solution and start treating it like a surgical tool. FamilySearch’s Full-text tool is essentially a massive search engine for historical documents—deeds, wills, and probate records that were previously "locked" inside digital images. To maintain our research integrity, we must master the technical nuances of how to talk to this specific system. We must also be painfully aware that any search we make only reaches an unknown number of documents that have been processed and made available to the search.
The first step is understanding that the computer is literal. If you search for Sarah Miller in the Full-text box without constraints, the computer will find every "Sarah" and every "Miller" in a land deed or a court record. By using an Exact Phrase search—"Sarah Miller" in quotes—you are imposing a research standard on the machine. You are telling it that the relationship between those two words is non-negotiable. This is the only way to effectively search for a specific name in a sea of handwritten text that has been converted by AI.
But historical records are rarely that clean. In a probate record, your ancestor might be listed as "Sarah, the daughter of John Miller." This is where I find the Proximity operator to be a helpful tool for the modern genealogist. By searching for "Sarah Miller"~10, you are telling the FamilySearch engine that these two words must appear within ten words of each other. But the results of my use of all these Boolean tools are mixed I am not sure that FamilySearch's full-text search understands them because the results are inconsistent.
Using tips for finding names, for example in legal documents where titles or middle names often separate a first and last name. If they are applied consistently, they could bridge the gap between the rigid search and the messy reality of 19th-century legal phrasing.
We also have to contend with the "wildcard" nature of history. Spelling was often more of an art than a science in the past, and even the best AI transcription can misinterpret a letter. Using a Single Character Wildcard, such as Sm?th, allows you to find both Smith and Smyth. In the Full-text search environment, where an "o" might look like an "a" to a computer, these wildcards are essential for ensuring that no record is left behind simply because of a digital misread. Using this tool, may also prove frustrating when the returns start to number in the millions.
Finally, we must learn to curate our results by using Exclusion. If you are searching for a surname that is also a common place name or term, your search can quickly become cluttered. For example, if you are looking for a family named "Rice" in a county known for its agriculture, searching for Rice -paddy or Rice -planting allows you to strip away the irrelevant data and focus on the human beings.
The examples I am giving here illustrate that any AI implementation is only as useful as the ability of the user to maintain control. Meanwhile, give the state of affairs with the collections on the FamilySearch website, relying on Full-text Search for all of your research is lamentably impossible.
No comments:
Post a Comment