This made me think of Microsoft's MarkItDown tool/library [1] which seems to have quite the overlap in supported file formats [2,3] and conversion options [4]. I couldn't find any reference to the library on the website, so it may be useful to know about.
This made me think of Microsoft's MarkItDown tool/library [1] which seems to have quite the overlap in supported file formats [2,3] and conversion options [4]. I couldn't find any reference to the library on the website, so it may be useful to know about.
[1] https://github.com/microsoft/markitdown
[2] https://tomd.io/#supported-extensions
[3] https://github.com/microsoft/markitdown/tree/main/packages/m...
[4] https://github.com/microsoft/markitdown/blob/8a9d8f15936b206...
Looks great. Any plans to offer as an API?
The API feature is already in the pipeline. Could you share the scenarios where you would need the API?