Matt Robinson
a97e4252e3
feat: add UnstructuredExcelLoader for .xlsx and .xls files ( #5617 )
...
# Unstructured Excel Loader
Adds an `UnstructuredExcelLoader` class for `.xlsx` and `.xls` files.
Works with `unstructured>=0.6.7`. A plain text representation of the
Excel file will be available under the `page_content` attribute in the
doc. If you use the loader in `"elements"` mode, an HTML representation
of the Excel file will be available under the `text_as_html` metadata
key. Each sheet in the Excel document is its own document.
### Testing
```python
from langchain.document_loaders import UnstructuredExcelLoader
loader = UnstructuredExcelLoader(
"example_data/stanley-cups.xlsx",
mode="elements"
)
docs = loader.load()
```
## Who can review?
@hwchase17
@eyurtsev
2023-06-03 12:44:12 -07:00
..
2023-06-03 12:44:12 -07:00
2023-05-02 15:24:53 -07:00
2023-06-01 00:54:42 -07:00
2023-05-02 15:24:53 -07:00
2023-05-29 07:25:17 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-02 15:24:53 -07:00
2023-05-02 15:24:53 -07:00
2023-05-02 15:24:53 -07:00
2023-05-25 00:21:31 -07:00
2023-05-18 09:56:51 -04:00
2023-05-05 17:44:54 -07:00
2023-05-26 18:55:21 -07:00
2023-05-05 17:44:54 -07:00
2023-05-02 15:24:53 -07:00
2023-05-29 07:19:43 -07:00
2023-05-05 17:44:54 -07:00
2023-05-02 15:24:53 -07:00
2023-05-05 17:44:54 -07:00
2023-05-29 07:19:43 -07:00
2023-05-31 17:54:05 -07:00
2023-06-01 13:06:17 -07:00
2023-05-02 15:24:53 -07:00
2023-05-02 15:24:53 -07:00
2023-05-05 17:44:54 -07:00
2023-05-19 14:28:17 -07:00
2023-06-03 12:44:12 -07:00
2023-05-31 17:54:05 -07:00
2023-05-02 15:24:53 -07:00
2023-05-18 09:55:14 -04:00
2023-05-02 15:24:53 -07:00
2023-05-02 15:24:53 -07:00
2023-05-29 20:11:21 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-18 09:27:53 -04:00
2023-05-02 15:24:53 -07:00
2023-05-05 17:44:54 -07:00
2023-05-16 23:42:53 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-24 11:47:01 -07:00
2023-05-24 12:31:55 -07:00
2023-05-23 21:56:41 -07:00
2023-05-05 17:44:54 -07:00
2023-05-02 15:24:53 -07:00
2023-05-22 16:43:07 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-02 15:24:53 -07:00
2023-05-14 18:26:25 -07:00
2023-05-02 15:24:53 -07:00
2023-05-23 21:56:41 -07:00
2023-05-05 17:44:54 -07:00
2023-05-15 09:47:02 -04:00
2023-05-21 09:13:20 -07:00
2023-05-31 15:02:57 -07:00
2023-05-02 15:24:53 -07:00
2023-05-31 17:54:05 -07:00
2023-05-02 15:24:53 -07:00
2023-05-31 15:27:38 -07:00
2023-05-05 17:44:54 -07:00
2023-05-02 15:24:53 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-16 14:35:25 -07:00
2023-05-23 21:56:41 -07:00
2023-05-05 17:44:54 -07:00
2023-05-29 19:47:56 -07:00
2023-05-05 17:44:54 -07:00
2023-05-31 15:03:21 -07:00
2023-04-13 22:15:03 -07:00
2023-05-23 15:57:33 -07:00
2023-05-05 17:44:54 -07:00
2023-05-05 17:44:54 -07:00
2023-05-06 09:32:45 -07:00
2023-05-05 17:44:54 -07:00