Amino acid dipepetide frequency for Bat polyomavirus 6d

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.229AlaAla: 9.229 ± 4.385
1.978AlaCys: 1.978 ± 1.042
3.296AlaAsp: 3.296 ± 1.861
3.955AlaGlu: 3.955 ± 3.181
1.978AlaPhe: 1.978 ± 0.502
3.955AlaGly: 3.955 ± 2.336
0.659AlaHis: 0.659 ± 0.661
2.637AlaIle: 2.637 ± 1.087
3.955AlaLys: 3.955 ± 0.548
3.296AlaLeu: 3.296 ± 1.798
2.637AlaMet: 2.637 ± 0.908
3.296AlaAsn: 3.296 ± 1.95
0.659AlaPro: 0.659 ± 0.586
4.614AlaGln: 4.614 ± 1.936
3.296AlaArg: 3.296 ± 1.798
1.318AlaSer: 1.318 ± 1.322
5.274AlaThr: 5.274 ± 0.722
7.251AlaVal: 7.251 ± 3.221
0.659AlaTrp: 0.659 ± 0.396
1.978AlaTyr: 1.978 ± 1.029
0.0AlaXaa: 0.0 ± 0.0
Cys
0.659CysAla: 0.659 ± 1.026
0.0CysCys: 0.0 ± 0.0
1.318CysAsp: 1.318 ± 0.791
0.0CysGlu: 0.0 ± 0.0
1.318CysPhe: 1.318 ± 2.052
0.659CysGly: 0.659 ± 0.396
0.659CysHis: 0.659 ± 0.396
0.659CysIle: 0.659 ± 0.396
1.318CysLys: 1.318 ± 0.515
3.955CysLeu: 3.955 ± 2.084
1.318CysMet: 1.318 ± 0.955
0.659CysAsn: 0.659 ± 1.026
0.0CysPro: 0.0 ± 0.0
0.659CysGln: 0.659 ± 0.396
1.978CysArg: 1.978 ± 1.187
3.296CysSer: 3.296 ± 1.381
0.659CysThr: 0.659 ± 0.586
1.318CysVal: 1.318 ± 0.791
0.0CysTrp: 0.0 ± 0.0
2.637CysTyr: 2.637 ± 0.908
0.0CysXaa: 0.0 ± 0.0
Asp
3.296AspAla: 3.296 ± 1.117
1.318AspCys: 1.318 ± 0.791
3.296AspAsp: 3.296 ± 1.381
2.637AspGlu: 2.637 ± 1.254
5.933AspPhe: 5.933 ± 2.399
2.637AspGly: 2.637 ± 0.904
1.318AspHis: 1.318 ± 0.955
3.296AspIle: 3.296 ± 0.951
1.978AspLys: 1.978 ± 1.042
7.251AspLeu: 7.251 ± 2.31
2.637AspMet: 2.637 ± 1.593
1.978AspAsn: 1.978 ± 1.187
2.637AspPro: 2.637 ± 1.684
1.978AspGln: 1.978 ± 0.502
0.659AspArg: 0.659 ± 0.396
4.614AspSer: 4.614 ± 0.511
2.637AspThr: 2.637 ± 0.515
1.318AspVal: 1.318 ± 0.61
1.978AspTrp: 1.978 ± 2.002
1.318AspTyr: 1.318 ± 0.515
0.0AspXaa: 0.0 ± 0.0
Glu
3.955GluAla: 3.955 ± 1.004
1.318GluCys: 1.318 ± 0.791
6.592GluAsp: 6.592 ± 2.046
11.206GluGlu: 11.206 ± 2.516
2.637GluPhe: 2.637 ± 0.872
1.978GluGly: 1.978 ± 1.278
1.318GluHis: 1.318 ± 0.955
4.614GluIle: 4.614 ± 2.045
5.933GluLys: 5.933 ± 1.908
5.933GluLeu: 5.933 ± 1.925
0.0GluMet: 0.0 ± 0.0
6.592GluAsn: 6.592 ± 1.81
1.978GluPro: 1.978 ± 1.029
5.933GluGln: 5.933 ± 2.704
0.659GluArg: 0.659 ± 1.026
3.955GluSer: 3.955 ± 2.084
3.296GluThr: 3.296 ± 0.951
7.251GluVal: 7.251 ± 2.894
0.659GluTrp: 0.659 ± 0.396
0.659GluTyr: 0.659 ± 0.396
0.0GluXaa: 0.0 ± 0.0
Phe
1.318PheAla: 1.318 ± 0.742
3.955PheCys: 3.955 ± 2.084
2.637PheAsp: 2.637 ± 1.023
2.637PheGlu: 2.637 ± 1.087
2.637PhePhe: 2.637 ± 1.243
4.614PheGly: 4.614 ± 1.806
1.318PheHis: 1.318 ± 0.955
0.659PheIle: 0.659 ± 0.396
3.296PheLys: 3.296 ± 1.978
7.251PheLeu: 7.251 ± 1.604
0.659PheMet: 0.659 ± 0.396
3.955PheAsn: 3.955 ± 0.682
2.637PhePro: 2.637 ± 1.22
1.318PheGln: 1.318 ± 0.955
0.659PheArg: 0.659 ± 0.661
1.978PheSer: 1.978 ± 0.788
3.955PheThr: 3.955 ± 1.27
1.978PheVal: 1.978 ± 0.788
0.659PheTrp: 0.659 ± 0.661
0.659PheTyr: 0.659 ± 0.586
0.0PheXaa: 0.0 ± 0.0
Gly
3.955GlyAla: 3.955 ± 1.005
0.659GlyCys: 0.659 ± 1.026
2.637GlyAsp: 2.637 ± 1.055
5.274GlyGlu: 5.274 ± 1.938
2.637GlyPhe: 2.637 ± 0.515
7.251GlyGly: 7.251 ± 1.393
0.659GlyHis: 0.659 ± 1.026
5.933GlyIle: 5.933 ± 1.38
2.637GlyLys: 2.637 ± 1.023
5.274GlyLeu: 5.274 ± 1.1
0.0GlyMet: 0.0 ± 0.0
3.296GlyAsn: 3.296 ± 1.804
4.614GlyPro: 4.614 ± 2.834
3.296GlyGln: 3.296 ± 0.52
2.637GlyArg: 2.637 ± 1.894
1.978GlySer: 1.978 ± 1.029
1.318GlyThr: 1.318 ± 0.515
3.296GlyVal: 3.296 ± 1.518
0.0GlyTrp: 0.0 ± 0.0
1.318GlyTyr: 1.318 ± 0.742
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.659HisCys: 0.659 ± 1.026
0.0HisAsp: 0.0 ± 0.0
1.318HisGlu: 1.318 ± 1.082
1.318HisPhe: 1.318 ± 0.515
0.659HisGly: 0.659 ± 0.586
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
2.637HisLys: 2.637 ± 0.908
3.955HisLeu: 3.955 ± 1.866
0.0HisMet: 0.0 ± 0.0
0.659HisAsn: 0.659 ± 1.026
1.318HisPro: 1.318 ± 0.955
0.659HisGln: 0.659 ± 0.661
1.318HisArg: 1.318 ± 0.791
1.978HisSer: 1.978 ± 0.941
0.659HisThr: 0.659 ± 0.396
0.659HisVal: 0.659 ± 0.661
0.0HisTrp: 0.0 ± 0.0
0.659HisTyr: 0.659 ± 0.661
0.0HisXaa: 0.0 ± 0.0
Ile
1.318IleAla: 1.318 ± 1.082
0.659IleCys: 0.659 ± 0.396
3.955IleAsp: 3.955 ± 1.83
3.955IleGlu: 3.955 ± 1.27
1.318IlePhe: 1.318 ± 0.791
1.318IleGly: 1.318 ± 0.515
0.659IleHis: 0.659 ± 0.396
3.296IleIle: 3.296 ± 1.381
2.637IleLys: 2.637 ± 0.908
6.592IleLeu: 6.592 ± 1.81
0.0IleMet: 0.0 ± 0.786
0.659IleAsn: 0.659 ± 0.396
1.978IlePro: 1.978 ± 1.029
0.659IleGln: 0.659 ± 0.586
3.296IleArg: 3.296 ± 1.046
5.933IleSer: 5.933 ± 1.596
7.251IleThr: 7.251 ± 1.353
0.0IleVal: 0.0 ± 0.0
0.0IleTrp: 0.0 ± 0.0
0.659IleTyr: 0.659 ± 0.396
0.0IleXaa: 0.0 ± 0.0
Lys
1.318LysAla: 1.318 ± 1.123
1.978LysCys: 1.978 ± 1.187
1.318LysAsp: 1.318 ± 0.955
3.955LysGlu: 3.955 ± 1.124
1.318LysPhe: 1.318 ± 0.791
3.955LysGly: 3.955 ± 1.124
2.637LysHis: 2.637 ± 1.254
2.637LysIle: 2.637 ± 1.254
9.888LysLys: 9.888 ± 2.207
5.274LysLeu: 5.274 ± 0.953
1.978LysMet: 1.978 ± 0.941
3.955LysAsn: 3.955 ± 1.413
0.659LysPro: 0.659 ± 0.586
2.637LysGln: 2.637 ± 1.254
7.251LysArg: 7.251 ± 1.521
3.955LysSer: 3.955 ± 1.004
5.933LysThr: 5.933 ± 1.457
1.978LysVal: 1.978 ± 1.187
0.0LysTrp: 0.0 ± 0.0
2.637LysTyr: 2.637 ± 1.023
0.0LysXaa: 0.0 ± 0.0
Leu
8.57LeuAla: 8.57 ± 4.799
3.296LeuCys: 3.296 ± 1.381
6.592LeuAsp: 6.592 ± 2.413
7.251LeuGlu: 7.251 ± 2.835
8.57LeuPhe: 8.57 ± 1.643
3.955LeuGly: 3.955 ± 1.004
0.659LeuHis: 0.659 ± 0.586
4.614LeuIle: 4.614 ± 0.802
3.955LeuLys: 3.955 ± 1.791
12.525LeuLeu: 12.525 ± 1.355
2.637LeuMet: 2.637 ± 0.908
7.91LeuAsn: 7.91 ± 1.067
5.274LeuPro: 5.274 ± 0.363
5.274LeuGln: 5.274 ± 1.1
3.955LeuArg: 3.955 ± 0.83
9.888LeuSer: 9.888 ± 1.152
3.955LeuThr: 3.955 ± 1.266
3.296LeuVal: 3.296 ± 0.772
0.0LeuTrp: 0.0 ± 0.0
5.274LeuTyr: 5.274 ± 1.966
0.0LeuXaa: 0.0 ± 0.0
Met
1.978MetAla: 1.978 ± 1.162
0.0MetCys: 0.0 ± 0.0
3.296MetAsp: 3.296 ± 2.885
1.318MetGlu: 1.318 ± 0.791
0.0MetPhe: 0.0 ± 0.0
2.637MetGly: 2.637 ± 0.904
1.318MetHis: 1.318 ± 0.955
0.659MetIle: 0.659 ± 0.586
0.659MetLys: 0.659 ± 0.586
1.318MetLeu: 1.318 ± 1.123
0.0MetMet: 0.0 ± 0.0
0.659MetAsn: 0.659 ± 0.586
1.978MetPro: 1.978 ± 0.707
1.978MetGln: 1.978 ± 0.941
1.318MetArg: 1.318 ± 0.955
0.0MetSer: 0.0 ± 0.0
0.659MetThr: 0.659 ± 0.586
1.318MetVal: 1.318 ± 0.791
0.659MetTrp: 0.659 ± 0.586
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.955AsnAla: 3.955 ± 1.164
0.659AsnCys: 0.659 ± 0.396
1.318AsnAsp: 1.318 ± 0.515
2.637AsnGlu: 2.637 ± 2.342
3.296AsnPhe: 3.296 ± 1.013
1.978AsnGly: 1.978 ± 0.707
1.318AsnHis: 1.318 ± 1.123
2.637AsnIle: 2.637 ± 0.744
2.637AsnLys: 2.637 ± 1.029
5.933AsnLeu: 5.933 ± 1.79
2.637AsnMet: 2.637 ± 0.942
2.637AsnAsn: 2.637 ± 1.055
4.614AsnPro: 4.614 ± 1.161
1.318AsnGln: 1.318 ± 0.791
1.318AsnArg: 1.318 ± 1.123
0.659AsnSer: 0.659 ± 0.661
2.637AsnThr: 2.637 ± 1.243
4.614AsnVal: 4.614 ± 1.806
0.659AsnTrp: 0.659 ± 0.661
1.318AsnTyr: 1.318 ± 0.742
0.0AsnXaa: 0.0 ± 0.0
Pro
3.296ProAla: 3.296 ± 1.861
1.318ProCys: 1.318 ± 0.515
5.274ProAsp: 5.274 ± 1.488
5.274ProGlu: 5.274 ± 0.722
0.659ProPhe: 0.659 ± 0.396
4.614ProGly: 4.614 ± 1.34
0.0ProHis: 0.0 ± 0.0
3.296ProIle: 3.296 ± 1.438
3.296ProLys: 3.296 ± 0.772
5.274ProLeu: 5.274 ± 1.1
1.978ProMet: 1.978 ± 0.941
0.659ProAsn: 0.659 ± 0.586
4.614ProPro: 4.614 ± 1.518
1.318ProGln: 1.318 ± 0.515
1.978ProArg: 1.978 ± 1.757
0.659ProSer: 0.659 ± 0.661
2.637ProThr: 2.637 ± 1.087
1.318ProVal: 1.318 ± 1.171
0.0ProTrp: 0.0 ± 0.0
3.955ProTyr: 3.955 ± 0.769
0.0ProXaa: 0.0 ± 0.0
Gln
3.955GlnAla: 3.955 ± 1.866
1.318GlnCys: 1.318 ± 0.955
0.0GlnAsp: 0.0 ± 0.0
3.296GlnGlu: 3.296 ± 1.434
3.955GlnPhe: 3.955 ± 1.791
1.978GlnGly: 1.978 ± 0.941
0.0GlnHis: 0.0 ± 0.0
3.296GlnIle: 3.296 ± 0.52
1.318GlnLys: 1.318 ± 0.791
4.614GlnLeu: 4.614 ± 0.802
0.0GlnMet: 0.0 ± 0.0
1.978GlnAsn: 1.978 ± 1.029
3.955GlnPro: 3.955 ± 1.164
1.978GlnGln: 1.978 ± 0.788
3.296GlnArg: 3.296 ± 2.502
0.0GlnSer: 0.0 ± 0.0
1.318GlnThr: 1.318 ± 0.515
4.614GlnVal: 4.614 ± 1.975
0.659GlnTrp: 0.659 ± 0.396
2.637GlnTyr: 2.637 ± 0.515
0.0GlnXaa: 0.0 ± 0.0
Arg
3.955ArgAla: 3.955 ± 2.336
0.0ArgCys: 0.0 ± 0.0
3.296ArgAsp: 3.296 ± 1.381
4.614ArgGlu: 4.614 ± 1.804
1.978ArgPhe: 1.978 ± 0.788
1.318ArgGly: 1.318 ± 1.123
1.978ArgHis: 1.978 ± 0.788
0.659ArgIle: 0.659 ± 0.586
3.955ArgLys: 3.955 ± 2.312
3.955ArgLeu: 3.955 ± 0.682
1.978ArgMet: 1.978 ± 1.029
1.978ArgAsn: 1.978 ± 0.899
0.0ArgPro: 0.0 ± 0.0
0.0ArgGln: 0.0 ± 0.0
0.659ArgArg: 0.659 ± 0.661
4.614ArgSer: 4.614 ± 1.599
2.637ArgThr: 2.637 ± 0.515
3.955ArgVal: 3.955 ± 1.11
0.659ArgTrp: 0.659 ± 0.661
3.955ArgTyr: 3.955 ± 1.572
0.0ArgXaa: 0.0 ± 0.0
Ser
5.933SerAla: 5.933 ± 1.97
1.318SerCys: 1.318 ± 2.052
3.296SerAsp: 3.296 ± 1.7
1.978SerGlu: 1.978 ± 1.162
1.318SerPhe: 1.318 ± 0.515
1.318SerGly: 1.318 ± 0.742
1.318SerHis: 1.318 ± 0.61
2.637SerIle: 2.637 ± 0.908
1.978SerLys: 1.978 ± 1.187
6.592SerLeu: 6.592 ± 0.45
1.318SerMet: 1.318 ± 0.564
1.978SerAsn: 1.978 ± 1.187
2.637SerPro: 2.637 ± 1.257
6.592SerGln: 6.592 ± 2.654
3.296SerArg: 3.296 ± 1.353
5.274SerSer: 5.274 ± 1.641
1.978SerThr: 1.978 ± 1.029
4.614SerVal: 4.614 ± 1.34
2.637SerTrp: 2.637 ± 1.831
0.659SerTyr: 0.659 ± 0.661
0.0SerXaa: 0.0 ± 0.0
Thr
3.296ThrAla: 3.296 ± 1.798
0.659ThrCys: 0.659 ± 0.586
3.296ThrAsp: 3.296 ± 0.772
7.91ThrGlu: 7.91 ± 1.983
1.318ThrPhe: 1.318 ± 0.791
4.614ThrGly: 4.614 ± 1.212
0.659ThrHis: 0.659 ± 0.661
1.318ThrIle: 1.318 ± 0.515
1.318ThrLys: 1.318 ± 0.61
8.57ThrLeu: 8.57 ± 1.853
0.659ThrMet: 0.659 ± 0.586
3.296ThrAsn: 3.296 ± 1.861
5.274ThrPro: 5.274 ± 0.608
1.978ThrGln: 1.978 ± 0.707
3.955ThrArg: 3.955 ± 1.124
2.637ThrSer: 2.637 ± 1.023
7.91ThrThr: 7.91 ± 1.627
1.978ThrVal: 1.978 ± 0.502
1.318ThrTrp: 1.318 ± 1.123
0.659ThrTyr: 0.659 ± 0.661
0.0ThrXaa: 0.0 ± 0.0
Val
3.296ValAla: 3.296 ± 1.983
0.659ValCys: 0.659 ± 0.396
2.637ValAsp: 2.637 ± 1.087
3.955ValGlu: 3.955 ± 0.83
2.637ValPhe: 2.637 ± 1.055
5.274ValGly: 5.274 ± 1.689
1.318ValHis: 1.318 ± 1.171
1.978ValIle: 1.978 ± 1.072
6.592ValLys: 6.592 ± 1.709
5.274ValLeu: 5.274 ± 1.689
0.0ValMet: 0.0 ± 0.0
2.637ValAsn: 2.637 ± 1.023
3.296ValPro: 3.296 ± 0.772
1.318ValGln: 1.318 ± 0.791
0.659ValArg: 0.659 ± 0.586
3.296ValSer: 3.296 ± 1.117
4.614ValThr: 4.614 ± 1.514
1.318ValVal: 1.318 ± 0.515
1.318ValTrp: 1.318 ± 1.082
1.978ValTyr: 1.978 ± 1.042
0.0ValXaa: 0.0 ± 0.0
Trp
1.978TrpAla: 1.978 ± 1.983
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.978TrpGlu: 1.978 ± 0.502
1.318TrpPhe: 1.318 ± 0.955
0.659TrpGly: 0.659 ± 1.026
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.318TrpLys: 1.318 ± 0.791
1.978TrpLeu: 1.978 ± 0.899
0.659TrpMet: 0.659 ± 1.026
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.659TrpGln: 0.659 ± 0.661
1.318TrpArg: 1.318 ± 0.955
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.318TrpVal: 1.318 ± 1.123
0.659TrpTrp: 0.659 ± 0.396
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.659TyrAla: 0.659 ± 0.396
1.318TyrCys: 1.318 ± 0.515
0.659TyrAsp: 0.659 ± 0.661
1.318TyrGlu: 1.318 ± 0.955
2.637TyrPhe: 2.637 ± 1.485
3.296TyrGly: 3.296 ± 1.046
0.659TyrHis: 0.659 ± 0.396
1.978TyrIle: 1.978 ± 0.899
3.955TyrLys: 3.955 ± 0.548
2.637TyrLeu: 2.637 ± 1.254
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
3.296TyrPro: 3.296 ± 1.861
0.0TyrGln: 0.0 ± 0.0
2.637TyrArg: 2.637 ± 1.593
2.637TyrSer: 2.637 ± 0.904
3.296TyrThr: 3.296 ± 1.798
0.659TyrVal: 0.659 ± 0.396
1.318TyrTrp: 1.318 ± 0.791
0.659TyrTyr: 0.659 ± 0.661
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski