You can obtain multiple defined amino acid windows (shifted by 1 residue position towards the C-terminus) from a protein sequences using the ``SequencePreprocessor().get_sliding_aa_window()`` method. We first create an example sequence and the ``SequencePrepreprocessor()`` object as follows:

In [1]:
import aaanalysis as aa

seq = "ABCDEFGHIJ"
sp = aa.SequencePreprocessor()

Provide the sequence as ``seq`` parameter and specify a stop position using the ``slid_stop`` parameter:

In [2]:
# Get all 6 amino acid windows of size 5
list_windows = sp.get_sliding_aa_window(seq=seq, slide_start=0, slide_stop=9)
print(list_windows)

['ABCDE', 'BCDEF', 'CDEFG', 'DEFGH', 'EFGHI', 'FGHIJ']


You can change the start position (default=0) using the ``slide_start`` parameter:

In [3]:
# Get all 3 amino acid windows of size 5
list_windows = sp.get_sliding_aa_window(seq=seq, slide_start=3, slide_stop=9)
print(list_windows)

['DEFGH', 'EFGHI', 'FGHIJ']


You can adjust the window length using the ``window_size`` parameter:

In [4]:
# Get 2 amino acid windows of size 8 (starting from second residue)
list_windows = sp.get_sliding_aa_window(seq=seq, slide_start=1, window_size=8)
print(list_windows)

['BCDEFGHI', 'CDEFGHIJ']


If you wish to start counting residue positions from 1 instead of 0, set ``index1=True``:

In [5]:
# Get 3 amino acid windows of size 8 (starting from first residue)
list_windows = sp.get_sliding_aa_window(seq=seq, slide_start=1, window_size=8, index1=True)
print(list_windows)

['ABCDEFGH', 'BCDEFGHI', 'CDEFGHIJ']


Selecting too long windows could result into ``gaps`` (default='-'), which can be disabled setting ``accept_gaps=False`` (by default enabled). Sliding windows can have gaps when ``slide_stop`` is greater than the sequence length:

In [6]:
# Get amino acid window of size 10 (until residue position 12)
list_windows = sp.get_sliding_aa_window(seq=seq, slide_stop=11, window_size=10, accept_gap=True)
print(list_windows)

['ABCDEFGHIJ', 'BCDEFGHIJ-', 'CDEFGHIJ--']
