STAR Criteria Recording Knowledge
Searchable. Transferable. Archiveable. These are the three attributes that are indispensable in an information storage system.
Searchable: a) The ability to do automated searches for phrases within texts. <-Increasingly this includes the ability to search for similar pictures as well). b) Able to be found in general search results (i.e. at the lowest common denominator of search: (in the US in 2023 this is a Google search).
Transferable: Able to transfer between formats (e.g. from .docx to PDF, etc.)
Archivable: To be able to archive something entails having a stable reference to that item, and usually means having a digest of that item to aid in search-ability.
Redundant: Resistant to loss through backups, storage in multiple places, etc.
The STAR system attempts to identify best practices and provide an objective scoring system on which to rate various forms of storage.
Table of Contents:
§1 Searchable, Transferable, Archivable, and Redundant
Background
STAR was born out of my frustration with college textbooks in their various forms.
Most of my professors assigned physical textbooks, on which I was unable to perform computer-aided searches unless their contents were also available in a publicly available repository such as Google Books, which happened rarely. Some professors assigned electronic textbooks (either ones of their own devising, or from distributors such as McGraw Hill), but those that I worked with had horrible search support. They also worked on the basis of timed-subscriptions which made them un-archivable from a consumer viewpoint.
HTML sources such as Wikipedia were searchable and transferable, but their archivability was hampered by the lack of a standardized versioning system, in an environment where data changes constantly.
In 2013, I added "Redundant" to this list, with the rise of Dropbox and other services, the thought that your information resides in any single place became worrying. However, the weak point of these systems is still archivability: full, human-readable journaling is even more important in collaborative environments where people have 'delete' access.
Example Score Matrix (Simple "Yes" "No"):
FAIR criteria: "Findable, Accessible, Interoperable, and Reusable." Originally published in "FAIR Guiding Principles for scientific data management and stewardship" in Scientific Data (2016). Learn more about preparing FAIR data for reuse and reproducibility. (Source: Glossary of data management terms, Cornell University)
STAR: FAIR:
Searchable Findable
Transferable Accessible
Archivable Interoperable
Redundant Reusable
(Source: "FAIR Guiding Principles for scientific data management and stewardship" in Scientific Data (2016))
The FAIR Guiding Principles
To be Findable:
F1. (meta)data are assigned a globally unique and persistent identifier
F2. data are described with rich metadata (defined by R1 below)
F3. metadata clearly and explicitly include the identifier of the data it describes
F4. (meta)data are registered or indexed in a searchable resource
To be Accessible:
A1. (meta)data are retrievable by their identifier using a standardized communications protocol
A1.1 the protocol is open, free, and universally implementable
A1.2 the protocol allows for an authentication and authorization procedure, where necessary
A2. metadata are accessible, even when the data are no longer available
To be Interoperable:
I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation.
I2. (meta)data use vocabularies that follow FAIR principles
I3. (meta)data include qualified references to other (meta)data
To be Reusable:
R1. meta(data) are richly described with a plurality of accurate and relevant attributes
R1.1. (meta)data are released with a clear and accessible data usage license
R1.2. (meta)data are associated with detailed provenance
R1.3. (meta)data meet domain-relevant community standards
§2 Egregious Bacronyms.
2.1 STAR for Textbooks: Searchable, Transformable, Annotatable, and Readable
Searchable: The ability to do automated searches for phrases within texts is a larger topic than I can do justice to at the moment.
Transformable: Able to cut, copy, paste; apply emphasis (bold, italic, etc.), and otherwise manipulate the text to aid in readability and recall.
Annotatable: Ability to add comments, diagrams, extra titles, hyperlink, and otherwise add information to the document in ways that go beyond transforming above.
Readable: There's a STAR critera for readability all ready for you: see below.
2.2 STAR for Readability: Signposted, Titled, Arranged (Organized), and Relational (Linked)
Signposting is not just about phrases: It is about appropriate structural cues which include titling, emphasis, highlighting, font, and layout to aid fast scanning of a document for content.
Titled: Proper use of titling allows fast scanning and allows the reader to get to exactly the place they need to be to obtain the information they want.
A brilliant example of titling can be found in the imperative language that can be found in the Handbook of Principles of Organizational Behavior (check out the table of contents).
Wikipedia articles also frequently contain great examples of this.
Organization means logical flows and discrete elements, with a minimum number of elements that loop back on themselves (unless the work is primarily aesthetic rather than informational).
Relational: Hyperlinking is the single most powerful tool that has arisen from the information age. With it, one can link to the specific definition that they want to get across, a disambiguation page, a movie; almost anything. They have an unmatched ability to bring context to a situation, when used in accordance to a logical and intuitive set of guidelines.