Eugene Yurtsev
423f497168
Add BlobParser abstraction ( #3979 )
...
This PR adds the BlobParser abstraction.
It follows the proposal described here:
https://github.com/hwchase17/langchain/pull/2833#issuecomment-1509097756
2023-05-05 21:43:38 -04:00
Eugene Yurtsev
7cce68a051
Add minimal file system blob loader ( #3669 )
...
This adds a minimal file system blob loader.
If looks good, this PR will be merged and a few additional enhancements will be made.
2023-05-01 21:37:26 -04:00
Harrison Chase
bd7e0a534c
Harrison/csv loader ( #3771 )
...
Co-authored-by: mrT23 <tal.r@codium.ai >
2023-04-28 21:54:24 -07:00
Eugene Yurtsev
708787dddb
Blob: Add validator and use future annotations ( #3650 )
...
Minor changes to the Blob schema.
---------
Co-authored-by: Zander Chase <130414180+vowelparrot@users.noreply.github.com >
2023-04-27 14:33:59 -04:00
Eugene Yurtsev
5d02010763
Introduce Blob and Blob Loader interface ( #3603 )
...
This PR introduces a Blob data type and a Blob loader interface.
This is the first of a sequence of PRs that follows this proposal:
https://github.com/hwchase17/langchain/pull/2833
The primary goals of these abstraction are:
* Decouple content loading from content parsing code.
* Help duplicated content loading code from document loaders.
* Make lazy loading a default for langchain.
2023-04-27 09:45:25 -04:00
Harrison Chase
f281033362
rm pandas dependency ( #2102 )
2023-03-28 08:38:19 -07:00
Harrison Chase
30e3b31b04
Harrison/document cleanup ( #2062 )
...
Co-authored-by: Delip Rao <delip@users.noreply.github.com >
2023-03-27 16:32:55 -07:00
Harrison Chase
cc423f40f1
Harrison/youtube loader ( #1545 )
...
Co-authored-by: Julian Wustl <57504258+Julianwustl@users.noreply.github.com >
2023-03-08 20:53:27 -08:00