Converts a local file to Markdown with ragnar and collapses chunk text for term screening.
Arguments
- local_path
Path to a local source file.
- origin
Origin label passed to ragnar.
- target_size, target_overlap, segment_by_heading_levels
Chunking controls.
- screening_cache_dir
Optional directory for cached screening text.
- use_screening_cache
Logical; use cached screening text when available.
- refresh_screening_cache
Logical; ignore existing cache entries and rewrite them.
Examples
if (FALSE) { # \dontrun{
text <- extract_file_text_for_screening(
local_path = file.path("data", "local_sources", "report.pdf"),
origin = "local-file://report.pdf"
)
} # }
