dirkf
3d649843fe
[YouTube] Simplify pattern for nsig function name extraction
2024-12-13 03:15:05 +00:00
dirkf
9bc8fcf23c
[YouTube] Pass nsig value as return hook, fixes player 3bb1f723
2024-12-13 03:15:05 +00:00
dirkf
a93fff3052
[YouTube] Handle player 3bb1f723
...
* fix signature code extraction
* raise if n function returns input value
* add new tests from yt-dlp
Co-authored-by: bashonly
2024-12-13 03:15:05 +00:00
dirkf
067ad7439c
[jsinterp] Strip /* comments */ when parsing
...
* NB: _separate() is looking creaky
2024-12-13 03:15:05 +00:00
dirkf
7ab85d88d6
[jsinterp] Fix and improve "methods"
...
* push, unshift return new length
* impove edge cases for push/pop, shift/unshift, forEach, indexOf, charCodeAt
* increase test coverage
2024-12-13 03:15:04 +00:00
dirkf
4bf85ca5ba
[jsinterp] Fix and improve split/join
...
* improve split/join edge cases
* correctly implement regex split (not like re.split)
2024-12-13 03:15:04 +00:00
dirkf
9993e8c5a9
[jsinterp] Fix and improve arithmetic operations
...
* addition becomes concat with a string operand
* improve handling of edgier cases
* arithmetic in float like JS (more places need cast to int?)
* increase test coverage
2024-12-12 11:42:15 +00:00
dirkf
7d3045ce12
[jsinterp] Support multiple indexing (eg a[1][2])
...
* extend single indexing with improved RE (should probably use/have used _separate_at_paren())
* fix some cases that should have given undefined, not throwing
* standardise RE group names
* support length of objects, like {1: 2, 3: 4, length: 42}
2024-12-12 11:42:15 +00:00
dirkf
527c137e4e
[jsinterp] Fix and improve loose and strict equality operations
...
* reimplement loose equality according to MDN (eg, 1 == "1")
* improve strict equality (eg, "abc" === "abc" but 'abc' is not 'abc')
* add tests for above
2024-12-12 11:42:15 +00:00
dirkf
7e93935dda
[jsinterp] Implement typeof
operator
2024-12-12 11:42:15 +00:00
dirkf
dbc08fba83
[jsinterp] Improve slice implementation for player b12cc44b
...
Partly taken from yt-dlp/yt-dlp#10664, thx seproDev
Fixes #32896
2024-08-06 20:51:38 +01:00
Aiur Adept
71223bff39
[Youtube] Fix nsig extraction for player 20dfca59 ( #32891 )
...
* dirkf's patch for nsig extraction
* add generic search per yt-dlp/yt-dlp/pull/10611 - thx bashonly
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2024-08-01 19:18:34 +01:00
dirkf
e1b3fa242c
[Youtube] Find n
function name in player 3400486c
...
Fixes #32877
2024-07-25 00:16:00 +01:00
dirkf
16f5bbc464
[YouTube] Fix nsig processing for player b22ef6e7
...
* improve extraction of function name (like yt-dlp/yt-dlp#10390 )
* always use JSInterp to extract function code (yt-dlp/yt-dlp#10396 , thx seproDev, pukkandan)
2024-07-11 00:50:46 +01:00
dirkf
d35ce6ce95
[jsinterp] Support functionality for player b22ef6e7
...
* support `prototype` for call() and apply() (yt-dlp/yt-dlp#10392 , thx Grub4k)
* map JS `Array` to `list`
2024-07-11 00:50:46 +01:00
dirkf
756f6b45c7
[jsinterp] Re-align JSInterp and tests (esp.) with yt-dlp
...
Thx: various yt-dlp authors
2024-07-11 00:50:46 +01:00
dirkf
4652109643
[core,utils] Implement unsafe file extension mitigation
...
* from https://github.com/yt-dlp/yt-dlp/security/advisories/GHSA-79w7-vh3h-8g4 , thx grub4k
2024-07-02 15:38:50 +01:00
dirkf
ad01fa6cca
[jsinterp] Add Debugger from yt-dlp
...
* https://github.com/yt-dlp/yt-dlp/commit/8f53dc4
* thx pukkandan
2024-06-20 20:03:49 +01:00
dirkf
3bde6a5752
[test] Improve download test
...
* skip reason can't be unicode in Py2
* remove duplicate assert...Equal functions
2024-06-11 12:52:13 +01:00
dirkf
21924742f7
[InfoExtractor] Misc yt-dlp back-ports, etc
...
* add _yes_playlist() method
* avoid crash using _NETRC_MACHINE
* use _search_json() in _search_nextjs_data()
* _search_nextjs_data() default is JSON, not text
* test for above
2024-05-30 15:46:36 +01:00
dirkf
eee9a247eb
[utils] Split out traversal.py dummy and traversal tests
2024-05-30 15:46:36 +01:00
dirkf
06da64ee51
[utils] Update traverse_obj() from yt-dlp
...
* remove `is_user_input` option per https://github.com/yt-dlp/yt-dlp/pull/8673
* support traversal of compat_xml_etree_ElementTree_Element per https://github.com/yt-dlp/yt-dlp/pull/8911
* allow un/branching using all and any per https://github.com/yt-dlp/yt-dlp/pull/9571
* support traversal of compat_cookies.Morsel and multiple types in `set()` keys per https://github.com/yt-dlp/yt-dlp/pull/9577
thx Grub4k for these
* also, move traversal tests to a separate class
* allow for unordered dicts in tests for Py<3.7
2024-05-30 15:46:36 +01:00
dirkf
c2766cb80e
[test/test_download] Support 'playlist_maxcount:count' expected value
...
* parallel to `playlist_mincount'
* specify both for a range of playlist lengths
* if max < min the test will always fail!
2024-04-22 01:34:26 +01:00
dirkf
21792b88b7
[external/FFmpeg] Fix and improve --ffmpeg-location handling
...
* pass YoutubeDL (FileDownloader) to FFmpegPostProcessor constructor
* consolidate path search in FFmpegPostProcessor
* make availability of FFmpegFD depend on existence of FFmpegPostProcessor
* detect ffmpeg executable on instantiation of FFmpegFD
* resolves #32735
2024-03-27 13:11:17 +00:00
dirkf
acc383b9e3
[utils] Let int_or_none() accept a base, like int()
2024-03-08 13:03:42 +00:00
Hubert Hirtz
f0812d7848
[utils] Handle user:pass in URLs ( #28801 )
...
* Handle user:pass in URLs
Fixes "nonnumeric port" errors when youtube-dl is given URLs with
usernames and passwords such as:
http://username:password@example.com/myvideo.mp4
Refs:
- https://en.wikipedia.org/wiki/Basic_access_authentication
- https://tools.ietf.org/html/rfc1738#section-3.1
- https://docs.python.org/3.8/library/urllib.parse.html#urllib.parse.urlsplit
Fixes #18276 (point 4)
Fixes #20258
Fixes #26211 (see comment)
* Align code with yt-dlp
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2024-03-04 01:27:55 +00:00
dirkf
4eaeb9b2c6
[InfoExtractor] Support byte range for DASH
...
* adapted from https://github.com/ytdl-org/youtube-dl/pull/30279
* thx former GH user kikuyan
2024-02-02 12:36:05 +00:00
dirkf
c58b655a9e
[InfoExtractor] Support DASH subtitle extraction (yt-dlp back-port)
2024-02-02 12:36:05 +00:00
dirkf
dc512e3a8a
[YouTube] Fix like_count
extraction using likeButtonViewModel
...
* also fix various tests
* TODO: check against yt-dlp tests
2024-01-22 11:10:34 +00:00
dirkf
6651871416
[compat] Rework compat for method
parameter of compat_urllib_request.Request
constructor
...
* fixes #32573
* does not break `utils.HEADrequest` (eg)
2024-01-22 11:10:34 +00:00
dirkf
66ab0814c4
[utils] Revert bbd3e7e
, updating docstring, test instead
2023-09-03 23:15:19 +01:00
dirkf
21caaf2380
[test] Remove redundancy from lambda expected value regex
2023-09-03 01:13:40 +01:00
dirkf
abef53466d
[utils] Rework URL path munging for ., .. components
...
* move processing to YoutubeDLHandler
* also process `Location` header for redirect
* use tests from https://github.com/yt-dlp/yt-dlp/pull/7662
2023-07-29 14:27:26 +01:00
dirkf
e7926ae9f4
[utils] Rework decoding of Content-Encoding
s
...
* support nested encodings
* support optional `br` encoding, if brotli package is installed
* support optional 'compress' encoding, if ncompress package is installed
* response `Content-Encoding` has only unprocessed encodings, or removed
* response `Content-Length` is decoded length (usable for filesize metadata)
* use zlib for both deflate and gzip decompression
* some elements taken from yt-dlp: thx especially coletdjnz
2023-07-29 14:27:26 +01:00
dirkf
a25e9f3c84
[compat] Use compat_open()
2023-07-25 13:19:43 +01:00
dirkf
aac33155e4
[build] Add and use devscripts/utils
2023-07-25 13:19:43 +01:00
dirkf
2b7dd3b2a2
[utils] Fix update_Request() with empty data (not None)
2023-07-25 13:19:43 +01:00
dirkf
44faa71b19
[test/test_execution.py] Use compat_subprocess_get_DEVNULL()
2023-07-25 13:19:43 +01:00
dirkf
7bce2ad441
[build] Fix various Jython CI and test issues
2023-07-25 13:19:43 +01:00
dirkf
b2741f2654
[InfoExtractor] Add search methods for Next/Nuxt.js from yt-dlp
...
* add _search_nextjs_data(), from https://github.com/yt-dlp/yt-dlp/pull/1386
thanks selfisekai
* add _search_nuxt_data(), from https://github.com/yt-dlp/yt-dlp/pull/1921 ,
thanks Lesmiscore, pukkandan
* add tests for the above
* also fix HTML5 type recognition and tests, from
222a230871
,
thanks Lesmiscore
* update extractors in PR using above, fix tests.
2023-07-19 22:14:50 +01:00
dirkf
d9d07a9581
[utils] Improve js_to_json, align with yt-dlp
...
* support variable substitution, from https://github.com/yt-dlp/yt-dlp/pull/#521 etc,
thanks ChillingPepper, Grub4k, pukkandan
* improve escape handling, from https://github.com/yt-dlp/yt-dlp/pull/#521
thanks Grub4k
* support template strings from https://github.com/yt-dlp/yt-dlp/pull/6623
thanks Grub4k
* add limited `!` evaluation (eg, !!0 -> false, see tests)
2023-07-19 22:14:50 +01:00
dirkf
825a40744b
[utils] Align traverse_obj() with yt-dlp
...
Thanks Grub4k for these:
* traverse `Iterable`s, from https://github.com/yt-dlp/yt-dlp/pull/6902 , etc
* traverse `set` key for transformations/filters, `re.Match` group names, from
776995bc10
, etc
* traverse `re.Match`es, from https://github.com/yt-dlp/yt-dlp/pull/5174
* always return list when branching, from https://github.com/yt-dlp/yt-dlp/pull/5170
2023-07-19 22:14:50 +01:00
dirkf
1d8d5a93f7
[test] Fixes for old Pythons
2023-07-18 10:50:46 +01:00
bashonly
21438a4194
[downloader/external] Fix cookie support
2023-07-18 10:50:46 +01:00
Simon Sawicki
8334ec961b
[core] Process header cookies on loading
2023-07-18 10:50:46 +01:00
dirkf
b383be9887
[core] Remove Cookie
header on redirect to prevent leaks
...
Adated from yt-dlp/yt-dlp-ghsa-v8mc-9377-rwjj/pull/1/commits/101caac
Thx coletdjnz
2023-07-18 10:50:46 +01:00
dirkf
46fde7caee
[core] Update redirect handling from yt-dlp
...
* Thx coletdjnz: https://github.com/yt-dlp/yt-dlp/pull/7094
* add test that redirected `POST` loses its `Content-Type`
2023-07-18 10:50:46 +01:00
dirkf
1720c04dc5
[test] Make skipped tests in test_execution work with Py 2.6
2023-07-18 10:50:46 +01:00
dirkf
d5ef405c5d
[core] Align error reporting methods with yt-dlp
2023-07-18 10:50:46 +01:00
dirkf
f47fdb9564
[utils] Add {expected_type} and Iterable support to traverse_obj()
2023-07-18 10:50:46 +01:00