requests-html

mirror of https://github.com/kennethreitz/requests-html.git synced 2026-06-05 23:00:20 +00:00

Author	SHA1	Message	Date
Alessandro Romano	f4d9de9f9c	Delete old class member browser_argsverify	2018-09-19 15:45:17 +02:00
Ordanis Sanchez	36a647e6ea	Fix tests	2018-09-19 15:45:17 +02:00
Ordanis Sanchez	b769fc3dac	Update docs	2018-09-19 15:45:17 +02:00
Ordanis Sanchez	1faa61b7d0	Add asyncsession.run method	2018-09-19 15:45:17 +02:00
Ordanis Sanchez	3d6b82862d	Fix r.html.next() for next url	2018-09-19 15:45:17 +02:00
Ordanis Sanchez	69bc5dcb20	Fix HTML class to use async iter and render on bare mode	2018-09-19 15:45:17 +02:00
Ordanis Sanchez	99f9c89766	Add arender method to HTML	2018-09-19 15:45:17 +02:00
Ordanis Sanchez	a25b3737ad	Add async iterator to HTML class	2018-09-19 15:45:17 +02:00
Ordanis Sanchez	2d4c58deb8	Add HTMLSession.browser runtime exception, AsyncSession an async close method	2018-09-19 15:45:17 +02:00
Ordanis Sanchez	47a3646405	Create a base session	2018-09-19 15:45:17 +02:00
Ordanis Sanchez	d0a5642de7	Fix merge errors on HTMLSession	2018-09-19 15:43:13 +02:00
Alessandro Romano	5cc1ca4a70	Add verify parameter coherently with ignoreHTTPSErrors pyppeteer parameter	2018-09-19 15:42:20 +02:00
kennethreitz	aba5c8cbb5	Merge pull request #212 from pigna90/add-ignoreHTTPSError-parameter Added ignoreHTTPErrors parameter	2018-09-18 12:00:00 -04:00
Alessandro Romano	52ddd80824	Merge branch 'master' into add-ignoreHTTPSError-parameter	2018-09-18 17:31:17 +02:00
kennethreitz	7353466355	Merge pull request #153 from clarksun/patch-1 Automatically?	2018-09-18 02:56:27 -04:00
kennethreitz	51afd9e474	Merge pull request #157 from CodeMogul/fixes Made basic edits	2018-09-18 02:56:12 -04:00
kennethreitz	29acbaabc7	Merge pull request #160 from SN9NV/patch-1 Create LXML from raw_html	2018-09-18 02:54:05 -04:00
kennethreitz	f760df2be2	Merge pull request #162 from SN9NV/patch-2 Replace errors when decoding raw_html	2018-09-18 02:52:37 -04:00
kennethreitz	a693658f21	Merge pull request #163 from Norbinsh/master Minor typo fix	2018-09-18 02:52:00 -04:00
kennethreitz	625910e1a5	Merge branch 'master' into master	2018-09-18 02:51:54 -04:00
kennethreitz	c47fd127b1	Merge pull request #176 from wasabigeek/patch-1 Docs: Add note to install Linux packages	2018-09-18 02:47:34 -04:00
kennethreitz	c6f6858ea0	Merge pull request #200 from meetmangukiya/patch-1 requests_html.py: Typo HTTPSession -> HTMLSession	2018-09-18 02:44:41 -04:00
kennethreitz	e37b40e59f	Merge pull request #189 from carrionc/patch-2 Multiple chromium tab fix	2018-09-18 02:42:54 -04:00
kennethreitz	e05933acfc	Merge pull request #191 from montenegrodr/patch-1 fix: typo	2018-09-18 02:42:43 -04:00
kennethreitz	5de8c88a1f	Merge pull request #203 from rachmadaniHaryono/master exclude html files for github linguist	2018-09-18 02:42:34 -04:00
kennethreitz	5d7c859975	Merge pull request #193 from pennyarcade/master Update requests_html.py	2018-09-18 02:41:02 -04:00
kennethreitz	16c0dbe13d	Merge pull request #201 from timotk/patch-1 Fix minor typo	2018-09-18 02:40:16 -04:00
kennethreitz	87b183c7fc	Merge pull request #205 from leven-cn/develop Add "tag" attribute for Element objects	2018-09-18 02:40:06 -04:00
kennethreitz	95a113cfc3	Merge pull request #217 from m9mhmdy/master Fix a small typo	2018-09-18 02:37:40 -04:00
kennethreitz	ee34f3f9ea	Update README.rst	2018-09-17 08:07:49 -04:00
kennethreitz	d159d2045a	Update README.rst	2018-09-17 08:07:22 -04:00
kennethreitz	6190a47eef	Update README.rst	2018-09-17 08:06:43 -04:00
m9mhmdy	cb6e5fb557	Fix a small typo	2018-08-30 16:25:25 +02:00
Alessandro Romano	b1a7acf33a	Added ignoreHTTPErrors parameter	2018-08-09 15:03:15 +02:00
Li Yun	1c21f63672	Add "lineno" attribute for Element object	2018-07-04 11:30:59 +08:00
Li Yun	116a4b08eb	Add "tag" attribute for Element object	2018-07-04 11:20:08 +08:00
rachmadaniHaryono	96973aaf4e	chg: dev: generic html	2018-07-02 21:49:28 +08:00
rachmadaniHaryono	6fef1d8583	new: dev: ignore html count	2018-07-02 21:44:00 +08:00
Timo	71e2571d3a	Fix minor typo lopp -> loop in init docstring of AsyncHTMLSession.	2018-06-25 13:18:57 +02:00
Meet Mangukiya	4db2931ddc	requests_html.py: Typo HTTPSession -> HTMLSession	2018-06-24 22:23:47 +05:30
Martin Rotwang	96dbba8fbd	Update requests_html.py e.g. to add a proxy setting usage: s=Session(browser_args=['--no-sandbox', '--proxy-server=127.0.0.1:9876']) @see: https://github.com/GoogleChrome/puppeteer/issues/336	2018-06-05 12:39:46 +02:00
Robson D. Montenegro	a1c5e6ac8b	fix: typo	2018-06-03 21:52:51 +01:00
carrionc	956e60054c	Multiple chromium tab fix Within the render function, the page is rendered through the _async_render function. This function will try to render content by first creating a page, and currently will only close said page if the content is generated. However, if at any point there's a timeout beforehand, the current page isn't closed, and instead _async_render will be called again [as per the # assigned to retries in render()] and end up leaving behind an unused page. This change will enable render to close the "failed" attempt BEFORE opening a new page to try again, and should fix the issue of massive cpu buildup with multiple chromium instances. Sorry if this is messy, it's my first time using git to make a change.	2018-05-30 00:40:37 -04:00
Nicholas	81998d84c4	Docs: Add note to install Linux packages I ran into `pyppeteer.errors.BrowserError: Failed to connect to browser port:` and after a bit of snooping found that some Linux packages needed to be installed on my machine for pyppeteer to run. Suggest to add a note to save others time!	2018-05-07 01:32:59 +08:00
Angus Dippenaar	2a7d08722d	Initialize PyQuery with lxml PyQuery with XML sites also has the same issue that LXML does with unicode encoded strings because it uses LXML to parse the page. The fix has already been applied to LXML, so we can fix the issue with PyQuery by passing the already parsed LXML into PyQuery.	2018-04-14 21:32:00 +02:00
Shay Elmualem	50c9058d04	Minor typo fix	2018-04-07 22:18:46 +03:00
Angus Dippenaar	05ff6e87ca	Replace errors when decoding raw_html Some websites don't have valid bytes, even when the encoding is specified. I'm not 100% sure if replacing "bad" bytes is the correct way to fix the problem. It seems to fix the issues I've run into with some sites.	2018-04-07 17:15:51 +02:00
Angus Dippenaar	c21f0784cd	Create LXML from raw_html Create LXML from `self.raw_html` instead of `self.html` to allow LXML to process plain XML pages as per beda42's findings in issue https://github.com/kennethreitz/requests-html/issues/145 I have tested this change with 200 sites and it seems to fix the issue. HTML pages seem to all be working as expected. I haven't run into an issue with any that I've tested.	2018-04-05 13:47:39 +02:00
Siddhesh Nachane	cb55034b42	Made basic fixes 1. Corrected Comments and DocStrings Spell Errors. 2. Added .vscode folder to .gitignore 3. Replaced `i` with place holder `_` (as i is never used)	2018-03-31 22:51:34 +05:30
Sun Wei	132e5cb522	Automatically?	2018-03-26 17:57:50 +08:00

1 2 3 4 5 ...

393 Commits