| |
Filenaming Structures Summary
Relevant Sections from RFP 96-18
- Section C.10 Filenames and Delivery Directories (pp. C-23-C25)
- Section J. Attachment 4 (pp. J-24-J35)
Note on Filenames
These structures will be used wherever possible. Additional structures may be added if the existing structures lack certain features required by a given collection.
All item IDs and filenames are lowercase and consist of not more than 8 characters.
| Filename Pattern Legend: |
| c | Control page number |
| p | Print page number |
| f | Feature identifier |
| x | Horizontal grid coordinate, alpha (for segmented images) |
| y | Vertical grid coordinate, numeric (for segmented images) |
- Filename/Directory Structures: Unnumbered Documents in Folders
- Typically used: manuscript documents in folders described to the folder level.
- Existing collection using this structure: Margaret Mead Collection
- Item ID: Based on original container and folder number, becomes a directory name.
- Example 1 (4th folder found in container 102): 10204
- Example 2 (10th folder found in container 50): 05010
- Filename Pattern: ccccf
- Based on sequential page found in the folder, with new documents identified.
- Example 1 (1st page in the folder, starts new document): 0001d.tif
- Example 2 (3rd page in the folder): 0003.tif
- Example 3 (12th page in folder, starts new document): 0012d.tif
- Filename/Directory Structure 2A: Bibliographic Record/Print-Page Number Structure
- Typically used: books without machine-readable text
- Existing collection using this structure: none
- Item ID: Based on LCCN, becomes a directory name.
- Example 1 (California all the way back to 1828. LCCN: ca4-14356): 014356
- Example 2 (Along the bowstring or south shore ofLlake Superior. LCCN: 03-6059): 06059
- Filename pattern: cccppppf
- Based on sequential pages in the book and print page numbers and special features of pages (e.g., title page, table of contents, etc.)
- Example 1 (4th page, no print page number, table of contents): 0040000n.tif
- Example 2 (25th page, print page number 18, no feature): 0250018.tif
- Filename/Directory Structure 2B: Bibliographic Record/No-Print-Page Number Structure
- Typically used: books with machine-readable text
- Existing collection using this structure: Local History; Upper Midwest
- Item ID: Based on LCCN, becomes a directory name.
- Example 1 (California all the way back to 1828. LCCN: ca4-14356): 014356
- Example 2 (Along the bowstring or south shore of Lake Superior. LCCN: 03-6059): 06059
- Filename Pattern: cccc
- Based on sequential pages in the book
- Example 1 (4th page, no print page number, table of contents): 0004.tif
- Example 2 (25th page, print page number 18, no feature): 0025.tif
- Filename/Directory Structure 3A:
Serial Page Images/Print Pages Tracked
- Typically used: Serials without machine-readable text
- Existing collection using this structure: none
- Item ID: Based on Issue year and number (yyyynn)
- Filename pattern: cccppppf
- Filename/Directory Structure 3B:
Serial Page Images/No Print Page Numbers Tracked
- Typically used: Serials with machine-readable text
- Existing collection using this structure: none
- Item ID: Based on Issue year and number (yyyynn)
- Filename Pattern: cccc
- Filename/Directory Structure 3C:
Collation Records and/or Cumulative Indexes
- Typically used: Serial collation records and indexes
- Existing collection using this structure: none
- Item ID: Based on Issue year and number (yyyynn)
- Filename Pattern: cccpppp
- Filename/Directory Structure 4:
Copyright Registration and Technical Document Number Structure
- Typically used: Copyright deposits, technical reports
- Existing collection using this structure: none
- Item ID: Based on registration number or document number
- Filename Pattern: cccppppf
- Filename/Directory Structure 5:
Large Volumes
- Typically used: Law Library materials
- Existing collection using this structure: none
- Item ID: Based on Law Library collation item number
- Example 1 (House Journal volume 1): 001
- Example 2 (Congressional Globe Volume 5): 005
- Note: though items numbers repeat for every group of material, they are distinguished by the aggregate names.
- Filename pattern: ccccpppp
- Based on sequential pages in the volume and print page numbers
- Example 1 (5th page scanned, print page number 3): 00050003.tif
- Example 2 (200th page, print page number 185): 02000185.tif
- Target Filenames
- ID Targets: 0000.tif or 0000.jpg
- Resolution Targets: tg300bt.tif (see details on J-35)
- Segmented images
- Filename Pattern: cccfxy
- Example 1 (bottom half of page the 79th page scanned in two parts): 079sb1.tif
- SGML and Associated Files
- SGML filename: itemid.sgm
- Page Info Group filename: itemid.pgi
- Reference filename: itemid.ref
- Omission Report filename:itemid.omi
- Entity filename: itemid.ent
-- Return to top --
-- Return to Text Quality Review Home --
Last revised April 23, 1997
|