Releases · ewels/clusterflow

This repository has been archived by the owner on Apr 11, 2023. It is now read-only.

10 Apr 09:45

ewels

v0.5

545a73e

Cluster Flow v0.5 Latest

Latest

Salmon - New module!
- Written by @darogan
Software versions now explicitly cleaned and printed to logs for all tools.
- Final summary e-mails list versions at bottom.
- New MultiQC module parses these, plus more.
Launch logfile cf_[pipeline_id]_submissionlog.txt now lists environment information
- Prints loaded environment modules, current PATH, user and system information
Made use of $RealBin instead of $Bin
- Cluster Flow now works when the main cf executable is symlinked
Additional optional trimming parameters for TrimGalore! module
Entirely new section of documentation describing module parameters
Fixed a bug in parsing command line merge regexes.
picard_dedup uses the environment TMPDIR if available
Increased memory requirement for BWA alignment module.
Many minor bugfixes

Assets 2

14 Jul 14:40

ewels

v0.4

31cd58a

Cluster Flow v0.4

This is a huge release that got lost on it's way. A great number of things have changed since the
v0.3 release, so please be prepared for a bit of tinkering if you're updating.

I'll try to make releases smaller and more frequently in the future to avoid this happening again!

Download

Download Cluster Flow v0.4 (.zip)
Download Cluster Flow v0.4 (.tar.gz)

Changelog

Warning: Break of backwards compatability
- The way that genome references are handled has been rewritten.
  - Genome references are no longer tied to specific types, they are now agnostic.
    to the type of reference, making it far easier to whatever type of reference you need.
    Additionally, the wizard to add genome paths has been written and is now largely automated,
    making it super fast to add new genomes.
  - A consequence of this change is any genomes.config files written before v0.3 of
    Cluster Flow will no longer work. Thankfully the fix is easy! Replace @bowtie_path
    with @reference bowtie. @gtf_path changes to @reference gtf and so on.
    @genome_path changes to @reference fasta.
  - If you have any custom pipelines these will also need to be updated. @require_bowtie
    changes to @require_reference bowtieand so on. See updated example module files
    for examples on how to update custom modules.
  - Apologies for any inconvenience that this change incurs. Feel free to get in touch
    if you have any problems.
- ~/clusterflow/ directory moved to ~/.clusterflow/ to reduce home directory clutter.
  - Cluster Flow won't find your old config file - run mv ~/clusterflow/ ~/.clusterflow/ to fix.
New Stuff
- New website! http://clusterflow.io
  - New home of cluster flow documentation
  - Docs are now versioned along with CF version.
  - Online interactive demo and other cool stuff. Code can be found at https://github.com/ewels/clusterflow-website
- You can now run Cluster Flow locally (new @cluster_environment local )
  - Tested on Mac OSX and Linux. Includes --qstat and --qdel functionality
  - Allows easy testing and use of pipelines for those without access to a HPC cluster.
  - New --environment command line option allows you to set this at run time.
- The --make_config wizard has been renamed to --setup and does a lot more stuff
  - Should make first-run of Cluster Flow much easier - just download and run cf --setup
- Support for STAR RNA-seq aligner (thanks to @stu2)
- New ChIP-Seq analysis pipeline (thanks to @orzechoj)
  - Includes six new modules: bedToNrf, bedtools_bamToBed, deeptools_bamCoverage, deeptools_bamFingerprint, phantompeaktools_runSpp and picard_dedup
- Modules are given more information via the run file to help
  decide the amount of memory and cores they bid for (eg. number of files, reference)
- Running job stats (cf --qstat) now has a summary at the bottom, and counts running and completed jobs.
- New config option @time_multiplier to globally adjust time requests for slow clusters
  - To stop things getting silly, there's also @max_time to put a top limit on what's possible..
- Added a special case value null for the @environment_module_alias config option, to prevent Cluster Flow from loading that specific module.
- All perl scripts now have env perl in shebang to increase portability
- Modules can now have file extensions, as long as they have .cfmod at the end of the basename
  - This helps editing tools with syntax highlighting, amongst other things
- Python comes to Cluster Flow! The first Python module is up and running, along with a Helpers.py module file
  - See the example_module.py file for help in writing your own modules in Python
  - The basic Perl module helpers are now available in the Python packages as well, more translation to follow
- Now using GRIDEngine h_vmem memory option instead of vf
  - Gives a hard memory limit instead of a request limit at job submission time
  - Thanks to @stu2 and @s-andrews
- Support for explicit GRIDEngine queue nomination on the command line
- Modules now print their software versions to the log where possible.
- New --merge (command line) and @merge_regex (config file) options to automatically merge input files.
  - This is implemented using a new module, cf_merge_files, which can also be used in pipelines
  - If the supplied regexes only match single files, the module can be used to simply rename files
- New --runfile_prefix option to help avoid potential filename clashes.
- New @cluster_project config option to specify project for cluster jobs.
- Added compatability with GRIDEngine ~/.sge_request files (by ignoring them).
  Thanks to @s-andrews
- New tophat module called tophat which introduces a workaround for buggy MAPQ
  reporting by tophat whilst keeping unique alignments. Thanks to @FelixKrueger.
  - The previous tophat module is still available if you're not interested in MAPQ scores and
    would like slightly faster processing. This is now called tophat_broken_MAPQ.cfmod.
- Pipeline completion e-mails are now written to disk as well (HTML and plain text)
- New log file containing the job submission commands as well as the output received from the cluster at submission (usually numeric job identifiers)
- Removed the --qstatcols command line option and added the @colourful config option to replace it
  - The config wizard is also updated to add this to your personal config
- Added checks to make sure that we have at least one config file, and that the cluster environment is set
- Added new @environment_module_always config option to always load certain environment modules at run time.
- Added new @require_python_package pipeline option to check that a Python pacakge is installed before pipeline launch
Bugs Squashed
- Fixed output filename problem in tophat with output cleaning
- Fixed bugs causing minimum memory allocation regardless of availability
- Fixed bug causing Bowtie2 to fail if Bowtie1 index absent
- Cleaned up some unrecognised output that always made it into the log file
- Allow multiple merge regexes

Assets 2

11 May 15:02

ewels

v0.3docs

39e35fd

Redeveloped documentation for v0.3 Pre-release

Pre-release

This tag marks the finish point of the docs redevelopment, as part of moving to http://clusterflow.io

Note that the actual code is v0.4devel.

Assets 2

11 Jul 09:39

ewels

v0.3

3e3dbea

Version 0.3

Version v0.3 of Cluster Flow. It's a big one!

Download

Download Cluster Flow v0.3 (.zip)
Download Cluster Flow v0.3 (.tar.gz)

Changelog

New Stuff
- Awesome new HTML report e-mails
  - Much more readable HTML report e-mails which look super-snazzy (see example)
  - Any errors are highlighted making them quick to identify (see example)
  - Custom strings set in the config can flagged as highlights or as warnings
  - Designed to work on desktop and mobile phone screens
- Cluster Flow now re-orders the log file so that output from different modules doesn't overlap
  - Made each module prepend its stdout and sterr with a CF module flag
  - Made the cf_run_finished module parse the above flag and print out module by module
- Rewrote how the environment module loading works - now much more robust
  - Uses modulecmd to prepare perl syntax commands
  - Moved code into the load_environment_modules function in Helpers.pm
- Added environment module aliases
  - This allows you to load specific environment module versions or use different names to those specified within CF modules
  - eg. Replace fastqc with FastQC/0.11.2
Updates
- Made the bismark_methXtract module create genome-wide coverage reports if GTF path is available
- Added the -q parameter to the FastQC module to make the log files cleaner
- Removed the now uneccesary bismark_tidy module and renamed bismark_messy to bismark_report
Bugs Squashed
- Fixed dependency bug introduced in v0.2 which was making all downloads fire simultaneously
- Fixed issue where modules using the CF::Constants Perl Module couldn't load the central config file
- Fixed typo in environment module loading in bismark_align module
- Reordered loading of the environment modules in trim_galore so that FastQC is loaded first, fixing dependency issues

Assets 2

29 May 18:11

ewels

v0.2

928e637

Version 0.2

Version 0.2, released 2014-05-29.

New Stuff
- Now compatable with SLURM
- Customise batch job commands in the config (see docs for more info)
- Created new GitHub pages website to hold documentation: http://ewels.github.io/clusterflow/
Updates
- Ported repository to github: https://github.com/ewels/clusterflow
- Wrote new readme for github
Bugs Squashed
- Custom modules in ~/clusterflow/modules/ weren’t being found
- General code clean-ups all over the place

Assets 2

16 May 09:36

ewels

v0.1

88f2e8e

Version 0.1

Initial release of Cluster Flow v0.1

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Download

Changelog

Download

Changelog

Navigation Menu

Releases: ewels/clusterflow

Cluster Flow v0.5

Cluster Flow v0.4

Download

Changelog

Redeveloped documentation for v0.3

Version 0.3

Download

Changelog

Version 0.2

Version 0.1