Amino acid dipepetide frequency for Banana streak OL virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.801AlaAla: 2.801 ± 0.936
0.934AlaCys: 0.934 ± 0.444
3.268AlaAsp: 3.268 ± 0.925
3.735AlaGlu: 3.735 ± 1.776
1.401AlaPhe: 1.401 ± 0.666
2.801AlaGly: 2.801 ± 2.472
1.401AlaHis: 1.401 ± 2.833
4.669AlaIle: 4.669 ± 1.994
3.735AlaLys: 3.735 ± 0.729
4.202AlaLeu: 4.202 ± 2.663
1.867AlaMet: 1.867 ± 0.888
1.401AlaAsn: 1.401 ± 0.666
2.801AlaPro: 2.801 ± 1.332
3.735AlaGln: 3.735 ± 1.946
2.334AlaArg: 2.334 ± 1.11
2.334AlaSer: 2.334 ± 1.11
3.735AlaThr: 3.735 ± 2.994
1.867AlaVal: 1.867 ± 1.101
0.467AlaTrp: 0.467 ± 0.222
2.334AlaTyr: 2.334 ± 1.11
0.0AlaXaa: 0.0 ± 0.0
Cys
0.934CysAla: 0.934 ± 0.444
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
2.334CysGlu: 2.334 ± 1.11
0.934CysPhe: 0.934 ± 0.444
1.401CysGly: 1.401 ± 0.666
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.867CysLys: 1.867 ± 0.888
0.934CysLeu: 0.934 ± 0.444
0.467CysMet: 0.467 ± 0.222
0.467CysAsn: 0.467 ± 0.222
0.0CysPro: 0.0 ± 0.0
0.467CysGln: 0.467 ± 0.222
2.334CysArg: 2.334 ± 1.11
0.0CysSer: 0.0 ± 0.0
0.934CysThr: 0.934 ± 1.393
0.467CysVal: 0.467 ± 0.222
0.0CysTrp: 0.0 ± 0.0
0.467CysTyr: 0.467 ± 0.222
0.0CysXaa: 0.0 ± 0.0
Asp
2.801AspAla: 2.801 ± 0.936
0.467AspCys: 0.467 ± 0.222
4.202AspAsp: 4.202 ± 1.998
5.135AspGlu: 5.135 ± 1.12
2.801AspPhe: 2.801 ± 0.736
3.268AspGly: 3.268 ± 0.925
0.467AspHis: 0.467 ± 0.222
5.135AspIle: 5.135 ± 1.327
2.334AspLys: 2.334 ± 1.11
4.669AspLeu: 4.669 ± 6.605
0.467AspMet: 0.467 ± 0.406
3.268AspAsn: 3.268 ± 1.554
1.867AspPro: 1.867 ± 1.101
2.334AspGln: 2.334 ± 0.834
2.334AspArg: 2.334 ± 0.997
4.202AspSer: 4.202 ± 1.998
3.735AspThr: 3.735 ± 0.967
2.801AspVal: 2.801 ± 2.277
1.867AspTrp: 1.867 ± 1.101
1.867AspTyr: 1.867 ± 0.973
0.0AspXaa: 0.0 ± 0.0
Glu
4.202GluAla: 4.202 ± 3.416
0.467GluCys: 0.467 ± 0.222
7.47GluAsp: 7.47 ± 2.105
14.006GluGlu: 14.006 ± 3.794
2.801GluPhe: 2.801 ± 0.736
3.268GluGly: 3.268 ± 1.554
1.867GluHis: 1.867 ± 0.888
5.135GluIle: 5.135 ± 1.327
9.804GluLys: 9.804 ± 1.922
10.271GluLeu: 10.271 ± 4.567
2.801GluMet: 2.801 ± 1.332
3.268GluAsn: 3.268 ± 1.554
2.334GluPro: 2.334 ± 1.11
3.268GluGln: 3.268 ± 2.106
5.135GluArg: 5.135 ± 0.652
2.801GluSer: 2.801 ± 0.936
3.735GluThr: 3.735 ± 1.776
8.87GluVal: 8.87 ± 1.185
2.334GluTrp: 2.334 ± 1.11
3.735GluTyr: 3.735 ± 0.729
0.0GluXaa: 0.0 ± 0.0
Phe
0.934PheAla: 0.934 ± 0.444
0.934PheCys: 0.934 ± 0.444
1.867PheAsp: 1.867 ± 0.888
3.268PheGlu: 3.268 ± 0.698
0.934PhePhe: 0.934 ± 0.444
0.934PheGly: 0.934 ± 0.444
1.401PheHis: 1.401 ± 0.666
3.268PheIle: 3.268 ± 0.698
1.401PheLys: 1.401 ± 0.666
2.334PheLeu: 2.334 ± 1.956
1.401PheMet: 1.401 ± 0.666
0.467PheAsn: 0.467 ± 0.222
1.401PhePro: 1.401 ± 0.666
1.867PheGln: 1.867 ± 2.177
0.934PheArg: 0.934 ± 0.444
0.467PheSer: 0.467 ± 0.222
2.801PheThr: 2.801 ± 1.332
0.934PheVal: 0.934 ± 0.444
0.467PheTrp: 0.467 ± 0.222
2.801PheTyr: 2.801 ± 1.332
0.0PheXaa: 0.0 ± 0.0
Gly
2.801GlyAla: 2.801 ± 0.936
0.934GlyCys: 0.934 ± 0.444
1.867GlyAsp: 1.867 ± 1.101
6.536GlyGlu: 6.536 ± 0.231
2.801GlyPhe: 2.801 ± 1.332
1.401GlyGly: 1.401 ± 1.236
0.934GlyHis: 0.934 ± 0.444
1.867GlyIle: 1.867 ± 1.101
5.602GlyLys: 5.602 ± 1.301
4.202GlyLeu: 4.202 ± 1.998
1.401GlyMet: 1.401 ± 0.89
2.334GlyAsn: 2.334 ± 1.11
0.934GlyPro: 0.934 ± 0.444
0.467GlyGln: 0.467 ± 0.222
2.334GlyArg: 2.334 ± 0.834
2.334GlySer: 2.334 ± 2.624
5.602GlyThr: 5.602 ± 3.302
3.735GlyVal: 3.735 ± 0.967
0.934GlyTrp: 0.934 ± 0.444
1.401GlyTyr: 1.401 ± 0.666
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.467HisAsp: 0.467 ± 1.514
2.334HisGlu: 2.334 ± 0.834
0.934HisPhe: 0.934 ± 0.444
0.467HisGly: 0.467 ± 0.222
0.0HisHis: 0.0 ± 0.0
1.401HisIle: 1.401 ± 1.139
1.867HisLys: 1.867 ± 0.888
1.867HisLeu: 1.867 ± 0.888
0.934HisMet: 0.934 ± 0.444
0.467HisAsn: 0.467 ± 1.514
0.934HisPro: 0.934 ± 0.444
2.334HisGln: 2.334 ± 1.11
1.867HisArg: 1.867 ± 0.888
0.0HisSer: 0.0 ± 0.0
0.467HisThr: 0.467 ± 0.222
2.801HisVal: 2.801 ± 1.332
0.467HisTrp: 0.467 ± 0.222
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.334IleAla: 2.334 ± 0.997
1.867IleCys: 1.867 ± 0.888
5.602IleAsp: 5.602 ± 1.495
6.536IleGlu: 6.536 ± 1.693
2.801IlePhe: 2.801 ± 0.936
5.135IleGly: 5.135 ± 1.12
1.401IleHis: 1.401 ± 1.139
9.337IleIle: 9.337 ± 0.506
4.669IleLys: 4.669 ± 2.22
4.202IleLeu: 4.202 ± 1.054
2.801IleMet: 2.801 ± 1.332
3.735IleAsn: 3.735 ± 1.297
1.867IlePro: 1.867 ± 0.888
4.669IleGln: 4.669 ± 3.564
4.202IleArg: 4.202 ± 1.998
5.135IleSer: 5.135 ± 0.652
4.202IleThr: 4.202 ± 1.054
2.801IleVal: 2.801 ± 0.736
0.0IleTrp: 0.0 ± 0.0
3.735IleTyr: 3.735 ± 1.776
0.0IleXaa: 0.0 ± 0.0
Lys
3.268LysAla: 3.268 ± 2.106
1.867LysCys: 1.867 ± 0.888
6.069LysAsp: 6.069 ± 4.793
8.87LysGlu: 8.87 ± 1.769
3.735LysPhe: 3.735 ± 1.946
3.735LysGly: 3.735 ± 0.967
4.202LysHis: 4.202 ± 1.998
7.937LysIle: 7.937 ± 1.537
7.003LysLys: 7.003 ± 2.501
6.536LysLeu: 6.536 ± 3.033
3.268LysMet: 3.268 ± 1.009
5.135LysAsn: 5.135 ± 1.12
1.867LysPro: 1.867 ± 0.888
3.268LysGln: 3.268 ± 0.698
2.334LysArg: 2.334 ± 0.834
3.735LysSer: 3.735 ± 1.776
3.735LysThr: 3.735 ± 1.297
4.669LysVal: 4.669 ± 2.445
0.934LysTrp: 0.934 ± 0.444
2.334LysTyr: 2.334 ± 1.11
0.0LysXaa: 0.0 ± 0.0
Leu
3.735LeuAla: 3.735 ± 2.882
0.934LeuCys: 0.934 ± 1.393
5.135LeuAsp: 5.135 ± 1.557
7.937LeuGlu: 7.937 ± 0.969
0.0LeuPhe: 0.0 ± 0.0
4.202LeuGly: 4.202 ± 1.998
2.334LeuHis: 2.334 ± 0.834
5.135LeuIle: 5.135 ± 2.441
10.271LeuLys: 10.271 ± 5.842
4.669LeuLeu: 4.669 ± 2.445
0.934LeuMet: 0.934 ± 1.321
4.669LeuAsn: 4.669 ± 0.863
2.801LeuPro: 2.801 ± 1.332
2.334LeuGln: 2.334 ± 0.834
6.069LeuArg: 6.069 ± 3.252
8.403LeuSer: 8.403 ± 4.194
3.735LeuThr: 3.735 ± 6.26
4.669LeuVal: 4.669 ± 1.667
1.401LeuTrp: 1.401 ± 1.139
2.334LeuTyr: 2.334 ± 1.11
0.0LeuXaa: 0.0 ± 0.0
Met
1.867MetAla: 1.867 ± 0.888
0.467MetCys: 0.467 ± 0.222
2.801MetAsp: 2.801 ± 1.332
2.801MetGlu: 2.801 ± 1.332
0.467MetPhe: 0.467 ± 0.222
0.934MetGly: 0.934 ± 1.321
0.467MetHis: 0.467 ± 0.222
2.334MetIle: 2.334 ± 1.11
3.735MetLys: 3.735 ± 1.776
1.401MetLeu: 1.401 ± 0.666
0.467MetMet: 0.467 ± 0.222
1.867MetAsn: 1.867 ± 0.888
1.867MetPro: 1.867 ± 0.888
0.467MetGln: 0.467 ± 0.222
0.467MetArg: 0.467 ± 0.222
0.467MetSer: 0.467 ± 1.567
3.735MetThr: 3.735 ± 1.776
1.401MetVal: 1.401 ± 1.139
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.801AsnAla: 2.801 ± 1.332
0.934AsnCys: 0.934 ± 0.444
2.334AsnAsp: 2.334 ± 0.834
4.202AsnGlu: 4.202 ± 1.054
0.467AsnPhe: 0.467 ± 0.222
2.334AsnGly: 2.334 ± 1.11
0.467AsnHis: 0.467 ± 0.222
3.268AsnIle: 3.268 ± 1.554
1.867AsnLys: 1.867 ± 0.973
5.135AsnLeu: 5.135 ± 3.922
0.934AsnMet: 0.934 ± 0.444
1.401AsnAsn: 1.401 ± 1.236
1.867AsnPro: 1.867 ± 0.888
0.934AsnGln: 0.934 ± 0.444
1.867AsnArg: 1.867 ± 2.177
3.268AsnSer: 3.268 ± 1.516
4.669AsnThr: 4.669 ± 2.59
3.268AsnVal: 3.268 ± 1.554
0.0AsnTrp: 0.0 ± 0.0
3.268AsnTyr: 3.268 ± 1.554
0.0AsnXaa: 0.0 ± 0.0
Pro
2.334ProAla: 2.334 ± 1.11
0.0ProCys: 0.0 ± 0.0
2.334ProAsp: 2.334 ± 1.11
1.867ProGlu: 1.867 ± 0.888
1.401ProPhe: 1.401 ± 0.666
1.401ProGly: 1.401 ± 1.236
0.934ProHis: 0.934 ± 0.444
1.401ProIle: 1.401 ± 0.666
2.334ProLys: 2.334 ± 0.834
2.801ProLeu: 2.801 ± 0.736
1.401ProMet: 1.401 ± 0.666
1.401ProAsn: 1.401 ± 0.666
2.334ProPro: 2.334 ± 1.11
1.867ProGln: 1.867 ± 1.101
4.202ProArg: 4.202 ± 1.054
3.268ProSer: 3.268 ± 1.554
1.401ProThr: 1.401 ± 0.666
1.867ProVal: 1.867 ± 0.888
0.934ProTrp: 0.934 ± 0.444
0.467ProTyr: 0.467 ± 1.567
0.0ProXaa: 0.0 ± 0.0
Gln
4.669GlnAla: 4.669 ± 0.863
0.0GlnCys: 0.0 ± 0.0
1.867GlnAsp: 1.867 ± 1.101
3.735GlnGlu: 3.735 ± 2.882
0.467GlnPhe: 0.467 ± 0.222
3.268GlnGly: 3.268 ± 0.925
1.401GlnHis: 1.401 ± 0.666
4.202GlnIle: 4.202 ± 1.079
3.735GlnLys: 3.735 ± 1.946
3.735GlnLeu: 3.735 ± 4.73
0.0GlnMet: 0.0 ± 0.0
1.867GlnAsn: 1.867 ± 3.761
3.268GlnPro: 3.268 ± 1.516
2.801GlnGln: 2.801 ± 4.18
3.268GlnArg: 3.268 ± 0.698
1.401GlnSer: 1.401 ± 0.666
1.401GlnThr: 1.401 ± 1.139
2.334GlnVal: 2.334 ± 1.11
0.467GlnTrp: 0.467 ± 0.222
1.867GlnTyr: 1.867 ± 0.888
0.0GlnXaa: 0.0 ± 0.0
Arg
1.867ArgAla: 1.867 ± 0.888
0.934ArgCys: 0.934 ± 0.444
2.334ArgAsp: 2.334 ± 0.834
3.268ArgGlu: 3.268 ± 1.554
0.934ArgPhe: 0.934 ± 0.444
2.801ArgGly: 2.801 ± 0.936
0.0ArgHis: 0.0 ± 0.0
7.003ArgIle: 7.003 ± 1.669
5.135ArgLys: 5.135 ± 3.922
3.735ArgLeu: 3.735 ± 1.297
2.334ArgMet: 2.334 ± 1.11
2.334ArgAsn: 2.334 ± 1.956
3.268ArgPro: 3.268 ± 0.698
2.801ArgGln: 2.801 ± 0.936
2.801ArgArg: 2.801 ± 0.736
5.135ArgSer: 5.135 ± 1.12
4.202ArgThr: 4.202 ± 1.054
3.735ArgVal: 3.735 ± 1.776
2.801ArgTrp: 2.801 ± 1.332
1.867ArgTyr: 1.867 ± 0.888
0.0ArgXaa: 0.0 ± 0.0
Ser
3.268SerAla: 3.268 ± 2.33
0.934SerCys: 0.934 ± 0.444
2.801SerAsp: 2.801 ± 1.736
6.536SerGlu: 6.536 ± 1.397
1.867SerPhe: 1.867 ± 0.888
2.334SerGly: 2.334 ± 0.834
0.467SerHis: 0.467 ± 0.222
4.669SerIle: 4.669 ± 1.178
5.135SerLys: 5.135 ± 2.228
5.135SerLeu: 5.135 ± 1.12
0.467SerMet: 0.467 ± 0.222
2.801SerAsn: 2.801 ± 1.736
2.334SerPro: 2.334 ± 0.997
3.268SerGln: 3.268 ± 1.516
6.536SerArg: 6.536 ± 1.864
4.669SerSer: 4.669 ± 0.863
2.334SerThr: 2.334 ± 1.11
1.401SerVal: 1.401 ± 1.236
0.0SerTrp: 0.0 ± 0.0
1.867SerTyr: 1.867 ± 0.888
0.0SerXaa: 0.0 ± 0.0
Thr
4.202ThrAla: 4.202 ± 1.054
0.467ThrCys: 0.467 ± 0.222
3.735ThrAsp: 3.735 ± 1.776
6.069ThrGlu: 6.069 ± 2.769
0.934ThrPhe: 0.934 ± 0.444
5.135ThrGly: 5.135 ± 2.393
0.467ThrHis: 0.467 ± 0.222
4.669ThrIle: 4.669 ± 1.178
3.268ThrLys: 3.268 ± 1.516
2.334ThrLeu: 2.334 ± 1.11
1.867ThrMet: 1.867 ± 0.888
2.334ThrAsn: 2.334 ± 1.11
2.334ThrPro: 2.334 ± 0.997
3.268ThrGln: 3.268 ± 3.2
2.801ThrArg: 2.801 ± 0.936
6.069ThrSer: 6.069 ± 3.764
3.735ThrThr: 3.735 ± 0.967
3.268ThrVal: 3.268 ± 0.925
0.934ThrTrp: 0.934 ± 1.321
0.934ThrTyr: 0.934 ± 1.321
0.0ThrXaa: 0.0 ± 0.0
Val
2.801ValAla: 2.801 ± 0.936
1.867ValCys: 1.867 ± 0.888
0.934ValAsp: 0.934 ± 0.444
3.735ValGlu: 3.735 ± 1.946
3.268ValPhe: 3.268 ± 0.925
4.669ValGly: 4.669 ± 1.178
1.401ValHis: 1.401 ± 0.666
2.334ValIle: 2.334 ± 1.11
5.135ValLys: 5.135 ± 2.441
7.003ValLeu: 7.003 ± 3.086
1.867ValMet: 1.867 ± 0.888
4.202ValAsn: 4.202 ± 1.054
1.401ValPro: 1.401 ± 0.666
2.801ValGln: 2.801 ± 2.277
2.334ValArg: 2.334 ± 0.834
2.334ValSer: 2.334 ± 0.834
2.801ValThr: 2.801 ± 1.332
3.735ValVal: 3.735 ± 1.776
0.467ValTrp: 0.467 ± 0.222
1.401ValTyr: 1.401 ± 1.236
0.0ValXaa: 0.0 ± 0.0
Trp
1.401TrpAla: 1.401 ± 0.666
0.0TrpCys: 0.0 ± 0.0
0.934TrpAsp: 0.934 ± 1.321
2.334TrpGlu: 2.334 ± 0.997
0.934TrpPhe: 0.934 ± 0.444
0.467TrpGly: 0.467 ± 0.222
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.334TrpLys: 2.334 ± 1.11
1.867TrpLeu: 1.867 ± 0.888
0.0TrpMet: 0.0 ± 0.0
0.467TrpAsn: 0.467 ± 0.222
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.934TrpArg: 0.934 ± 0.444
0.467TrpSer: 0.467 ± 0.222
1.401TrpThr: 1.401 ± 1.139
0.934TrpVal: 0.934 ± 0.444
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.801TyrAla: 2.801 ± 1.332
0.0TyrCys: 0.0 ± 0.0
0.467TyrAsp: 0.467 ± 0.222
2.334TyrGlu: 2.334 ± 1.11
0.934TyrPhe: 0.934 ± 0.444
0.467TyrGly: 0.467 ± 0.222
0.0TyrHis: 0.0 ± 0.0
3.268TyrIle: 3.268 ± 1.554
3.268TyrLys: 3.268 ± 0.925
4.669TyrLeu: 4.669 ± 1.667
1.867TyrMet: 1.867 ± 0.888
1.401TyrAsn: 1.401 ± 0.666
0.467TyrPro: 0.467 ± 0.222
2.801TyrGln: 2.801 ± 0.936
4.202TyrArg: 4.202 ± 0.821
2.334TyrSer: 2.334 ± 1.11
0.467TyrThr: 0.467 ± 0.222
0.934TyrVal: 0.934 ± 0.444
0.0TyrTrp: 0.0 ± 0.0
0.934TyrTyr: 0.934 ± 0.444
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2143 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski