Hi,
I currently have large pdf’s of technical documents, I’m going to have to rewrite them in a more ‘pinecone friendly’ format. I’ve been told Markdown or html format works well, but I think I’ll still have issues with data taken out of context.
Below is a snippet of the docs, if you look at some of the paragraphs seperately, you’d have no idea they were talking about the relevent heading. Short of putting the heading on every paragraph and making sure they are less than my chunk size, is there a good way to get around this?
ZONE TEMPERATURE CONTROLLER INTERFACES
GENERAL:
The Air Conditioning System Controller (ACSC) communicates with other systems through hardware interfaces.
SDAC:
System data information is transmitted to the System Data Acquisition Concentrator (SDAC) via ARINC buses for system monitoring. Temperature, valve position, and other data are used for warnings and display.
DMU:
The ACSCs send system main status data to the Data Management Unit (DMU) for maintenance monitoring functions. The ACSC sends trim-air Pressure Regulating Valve (PRV) position, pack flow, water extractor and pack compressor discharge temperatures, BYPass valve, and ram air inlet flap positions to the DMU.
In html it would still cause the same issue
*Edit - of course it’s displaying in HTML you get the drift
ZONE TEMPERATURE CONTROLLER INTERFACES
GENERAL:
The Air Conditioning System Controller (ACSC) communicates with other systems through hardware interfaces.
SDAC:
System data information is transmitted to the System Data Acquisition Concentrator (SDAC) via ARINC buses for system monitoring. Temperature, valve position, and other data are used for warnings and display.
DMU:
The ACSCs send system main status data to the Data Management Unit (DMU) for maintenance monitoring functions. The ACSC sends trim-air Pressure Regulating Valve (PRV) position, pack flow, water extractor and pack compressor discharge temperatures, BYPass valve, and ram air inlet flap positions to the DMU.
Keeping in mind that this is a tiny snippet and there’s generally 1 heading per page, some paragraphs are a lot larger too.