Skip to content

GHDNA is an experimental hash function that takes a DNA sequence as an input and provides a unique signature in the output. The signature provided by the function is a constant length sequence of digits.

License

Notifications You must be signed in to change notification settings

Gagniuc/GHDNA-hash-function

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

94 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GHDNA hash function

A cryptographic hash function is a one-way function which provides a unique signature for any variable length sequence from a given finite set. Note that "one-way" means that an input alwais leads to the same output, but not vice versa. Thus, any input leads to a unique output signature of a constant length. GHDNA is an experimental hash function that takes a DNA sequence as an input and provides a unique signature in the output. The signature provided by the function is a constant length sequence of digits, such as:

Input output
TATTCGGATCACGGACGTACC 74499889294608
TATTCGGATCACGGACGTACA 40651499483769
ATTCGGATCACGGACGTACC 44170498343430
TTCGGATCACGGACGTACC 98964487625810
ATCACGGAC 59451027176382

The GHDNA project contains a series of independent applications that use the GHDNA hashing function. Some of them are used for testing, and others are used as a demo for applications. The main quality that a cryptographic hashing function must have is to evenly distribute the hashing keys over the domain range. This distribution is determined using the GHDNA Domain test application. The GHDNA and GHDNA DATA BLOCK applications represent the simple version that shows how the GHDNA function can be used directly. The GHDNA Avalanche test application demonstrates how tiny changes in the input sequence can generate totally different and unpredictable hashing keys. Collisions are also tested, where it is checked if a hashing key is also associated with another previous imput. The GHDNA Speed test application measures the processing time of the GHDNA function in order to be able to compare it with other cryptographic functions (please see the attached article). In terms of processing time, GHDNA is very fast compared to the existing ones, but this speed is a bit relative and may also be due to the lack of complexity when compared to the often used cryptographic hashing functions. The GHDNA database engine application uses the GHDNA cryptographic function to perform a demonstration within a primitive database engine. This experimental hash function also uses a new algorithm called Dynamic Block Allocation (DBA), which can be found [here]. Note: in the BASIC family of computer languages, the "^" character represents exponentiation.

References

  • Paul A. Gagniuc and Constantin Ionescu-Tîrgovişte. GHDNA: a hash function for DNA segment-based aligments and motif search. Proc. Rom. Acad., Series B, 2014, 16(3), p. 155–167.
  • P. Gagniuc and C Ionescu-Tirgoviste. Dynamic block allocation for biological sequences. Proc. Rom. Acad., Series B, 2013, 15(3), p. 233-240.

About

GHDNA is an experimental hash function that takes a DNA sequence as an input and provides a unique signature in the output. The signature provided by the function is a constant length sequence of digits.

Resources

License

Stars

Watchers

Forks

Sponsor this project

 

Packages