Powered by Zoomin Software. For more details please contactZoomin

The Semaphore Fact Extraction Framework (FACTS)

A methodology (Extractors)

  • Last Updated: May 29, 2026
  • 1 minute read
    • Semaphore
    • Documentation

Extractor Methodology

This is where the real art of fact extraction takes place!

At this point, you should know:

  1. How the content breaks down into distinctive document types.
  2. How to identify those document types.
  3. Which document types have which facts.
  4. How your facts should be structured.

All that remains is to write the extractors.

How does CS “see” your content?

The first crucial step is this: after processing your content through CS and examining it in CAT / CSTI, carefully note how your fact now appears.

Tip: Use how CS tokenizes your content to determine if the fact can be found through its atomic structure. Is it, in its entirety (as a single fact if simple, or multiple facts if complex), matchable against some pattern of concept, taxonomy, wildcard, or entity facts with no other anchors?

Extraction Decision Flow

  1. Is the fact matchable as described above?
    1. Yes:
    2. No:
TitleResults for “How to create a CRG?”Also Available inAlert