Octocurl

Octocurl (formally, octoget) is a wrapper around CURL for bulk downloads. If you have sufficient bandwidth, you can download multiple files, each with individual worker.

How to use

Usage: octocurl [options] url1 [url2] [url...]

-c x Specify the number of concurrent downloads (Default: 4)

-s Prefetch the file sizes and sort to download larger files first.

-p Do not prefetch the file size. (Fetching size when the file is on queue. (Ignored when -s is used))

-r Regular expression of each url. (This option requires -o option.)

-o Form of the output file name. (Use * for placeholder for the regex match)

Examples:

octocurl http://example.com Downloads the webpage and saves it as 'example.com'

octocurl -c 3 http://example.com/{1,2,3,4}.ts Downloads 1.ts, 2.ts, 3.ts, 4.ts with three workers.

octocurl -s http://example.com/{1,2,3,4}.ts Prefetches the file size before downloading the contents and sorts them (descending). This might be useful for downloading many files of potentially highly varying file sizes.

octocurl -p http://example.com/{1,2,3,4}.ts Fetches the file size when it is on queue.

octocurl -r '[0-9].ts' -o './*' http://vid.com/{1,2}.ts-foo-bar Uses regex to save the files as 1.ts, 2.ts

Build

Prerequisite

You need ncurses, libcurl, and pthreads library to compile this.

Compilation (Make)

Currently, installation is not automatic.

Compiling for development can be done by issuing

make

This creates the binary at bin/octocurl.

Compiling for standard use can be done by issuing

make release

This also creates the binary at bin/octocurl.

Compilation (CMake)

If you prefer to use CMake:

mkdir build

cd build

cmake -S .. -B .

make

This creates the binary at build/octocurl.

Testing

# You can test if it works by:

make test

# You can clean the downloaded files from testing by make testclean

Example

Downloading Linux from Scratch Packages (v11.2) as mentioned in https://linuxfromscratch.org/lfs/view/stable/chapter03/introduction.html using the command

wget --input-file=wget-list-sysv --continue

It took 1 minute and 42.4695 seconds for all the packages to download.

On the other hand, using octocurl with the following command:

bin/octocurl $(cat wget-list-sysv)

it took 29.0505 seconds. (gcc was the bottleneck here, so apart from gcc, everything downloaded quickly. Setting worker number to 25, it only took 22.076 seconds.)

Note that this sort of performance gain is due to using this program in a high bandwidth environment.

Here are results from more controlled trials (consistent background network activity, etc.):

Commands	Time	Comment
wget --input-file=wget-list-sysv --continue	1m34.072s	Sequential downloading (at a later time)
bin/octocurl $(cat wget-list-sysv)	0m29.048s	octocurl with 4 workers
bin/octocurl -c 25 $(cat wget-list-sysv)	0m20.049s	octocurl with 25 workers
bin/octocurl -c 86 $(cat wget-list-sysv)	0m19.070s	octocurl with worker for each file (Note that, above certain number of workers, there isn't much speedup; rather you might even experience some performance drops.)

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
bin		bin
docs		docs
obj		obj
src		src
test		test
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Octocurl

How to use

Build

Prerequisite

Compilation (Make)

Compilation (CMake)

Testing

Example

About

Releases

Packages

Languages

License

pauljoohyunkim/octocurl

Folders and files

Latest commit

History

Repository files navigation

Octocurl

How to use

Build

Prerequisite

Compilation (Make)

Compilation (CMake)

Testing

Example

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages