Amino acid dipepetide frequency for Tamus red mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.029AlaAla: 7.029 ± 2.424
1.406AlaCys: 1.406 ± 2.522
2.343AlaAsp: 2.343 ± 0.819
7.498AlaGlu: 7.498 ± 2.576
2.812AlaPhe: 2.812 ± 1.461
5.155AlaGly: 5.155 ± 2.814
1.874AlaHis: 1.874 ± 1.028
3.28AlaIle: 3.28 ± 1.259
6.092AlaLys: 6.092 ± 2.679
9.372AlaLeu: 9.372 ± 3.23
1.874AlaMet: 1.874 ± 0.694
3.749AlaAsn: 3.749 ± 1.054
4.217AlaPro: 4.217 ± 1.751
2.812AlaGln: 2.812 ± 1.023
4.686AlaArg: 4.686 ± 1.279
3.749AlaSer: 3.749 ± 1.388
6.092AlaThr: 6.092 ± 1.575
3.749AlaVal: 3.749 ± 1.629
1.406AlaTrp: 1.406 ± 0.631
3.28AlaTyr: 3.28 ± 1.173
0.0AlaXaa: 0.0 ± 0.0
Cys
0.937CysAla: 0.937 ± 0.514
0.0CysCys: 0.0 ± 0.0
0.469CysAsp: 0.469 ± 0.921
1.874CysGlu: 1.874 ± 1.028
0.469CysPhe: 0.469 ± 0.257
0.937CysGly: 0.937 ± 1.194
0.0CysHis: 0.0 ± 0.0
0.937CysIle: 0.937 ± 1.203
0.469CysLys: 0.469 ± 0.257
1.406CysLeu: 1.406 ± 0.771
0.469CysMet: 0.469 ± 0.248
0.937CysAsn: 0.937 ± 0.778
1.874CysPro: 1.874 ± 1.766
0.469CysGln: 0.469 ± 0.257
0.469CysArg: 0.469 ± 0.791
0.0CysSer: 0.0 ± 0.0
0.469CysThr: 0.469 ± 0.257
0.469CysVal: 0.469 ± 0.257
0.0CysTrp: 0.0 ± 0.0
0.937CysTyr: 0.937 ± 1.678
0.0CysXaa: 0.0 ± 0.0
Asp
2.812AspAla: 2.812 ± 1.541
0.469AspCys: 0.469 ± 0.257
3.28AspAsp: 3.28 ± 1.173
2.343AspGlu: 2.343 ± 0.819
2.812AspPhe: 2.812 ± 1.261
1.874AspGly: 1.874 ± 1.315
1.874AspHis: 1.874 ± 1.046
1.406AspIle: 1.406 ± 0.704
2.343AspLys: 2.343 ± 0.679
3.28AspLeu: 3.28 ± 1.798
0.469AspMet: 0.469 ± 0.257
2.343AspAsn: 2.343 ± 1.635
2.812AspPro: 2.812 ± 1.461
1.406AspGln: 1.406 ± 0.771
1.874AspArg: 1.874 ± 0.694
3.749AspSer: 3.749 ± 0.762
2.812AspThr: 2.812 ± 1.023
4.217AspVal: 4.217 ± 1.154
1.874AspTrp: 1.874 ± 0.694
0.937AspTyr: 0.937 ± 0.514
0.0AspXaa: 0.0 ± 0.0
Glu
3.749GluAla: 3.749 ± 1.462
0.469GluCys: 0.469 ± 0.257
4.217GluAsp: 4.217 ± 1.698
4.686GluGlu: 4.686 ± 1.149
2.343GluPhe: 2.343 ± 0.679
5.623GluGly: 5.623 ± 1.812
0.0GluHis: 0.0 ± 0.0
1.874GluIle: 1.874 ± 1.557
4.686GluLys: 4.686 ± 1.939
5.623GluLeu: 5.623 ± 1.584
2.343GluMet: 2.343 ± 1.203
4.217GluAsn: 4.217 ± 1.019
3.28GluPro: 3.28 ± 1.173
5.155GluGln: 5.155 ± 2.826
3.749GluArg: 3.749 ± 1.211
4.686GluSer: 4.686 ± 2.058
2.343GluThr: 2.343 ± 1.185
6.092GluVal: 6.092 ± 1.931
2.812GluTrp: 2.812 ± 1.541
1.874GluTyr: 1.874 ± 0.719
0.0GluXaa: 0.0 ± 0.0
Phe
2.812PheAla: 2.812 ± 1.287
1.406PheCys: 1.406 ± 1.441
3.749PheAsp: 3.749 ± 1.895
4.686PheGlu: 4.686 ± 2.288
0.937PhePhe: 0.937 ± 0.668
1.874PheGly: 1.874 ± 0.719
2.343PheHis: 2.343 ± 0.819
2.343PheIle: 2.343 ± 1.285
3.749PheLys: 3.749 ± 1.895
5.623PheLeu: 5.623 ± 1.631
0.937PheMet: 0.937 ± 0.514
1.874PheAsn: 1.874 ± 0.719
1.406PhePro: 1.406 ± 0.704
1.874PheGln: 1.874 ± 0.719
0.469PheArg: 0.469 ± 0.257
5.623PheSer: 5.623 ± 1.327
3.749PheThr: 3.749 ± 1.175
2.343PheVal: 2.343 ± 0.819
0.937PheTrp: 0.937 ± 0.514
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.029GlyAla: 7.029 ± 4.075
1.874GlyCys: 1.874 ± 1.322
3.749GlyAsp: 3.749 ± 0.88
4.686GlyGlu: 4.686 ± 1.671
2.343GlyPhe: 2.343 ± 0.819
3.28GlyGly: 3.28 ± 2.285
1.874GlyHis: 1.874 ± 1.766
2.343GlyIle: 2.343 ± 0.679
3.28GlyLys: 3.28 ± 1.643
2.812GlyLeu: 2.812 ± 0.607
0.937GlyMet: 0.937 ± 0.986
1.874GlyAsn: 1.874 ± 1.322
2.812GlyPro: 2.812 ± 1.408
0.469GlyGln: 0.469 ± 0.257
1.406GlyArg: 1.406 ± 1.486
4.217GlySer: 4.217 ± 1.492
2.812GlyThr: 2.812 ± 1.136
4.686GlyVal: 4.686 ± 2.63
0.0GlyTrp: 0.0 ± 0.0
1.406GlyTyr: 1.406 ± 0.771
0.0GlyXaa: 0.0 ± 0.0
His
1.406HisAla: 1.406 ± 1.527
0.0HisCys: 0.0 ± 0.0
0.469HisAsp: 0.469 ± 0.257
1.406HisGlu: 1.406 ± 0.771
2.812HisPhe: 2.812 ± 0.978
3.28HisGly: 3.28 ± 2.357
1.874HisHis: 1.874 ± 3.815
0.469HisIle: 0.469 ± 1.337
1.406HisLys: 1.406 ± 1.092
2.812HisLeu: 2.812 ± 1.136
1.406HisMet: 1.406 ± 0.929
0.469HisAsn: 0.469 ± 0.257
0.469HisPro: 0.469 ± 0.257
0.937HisGln: 0.937 ± 0.514
2.812HisArg: 2.812 ± 1.408
3.28HisSer: 3.28 ± 2.285
2.343HisThr: 2.343 ± 1.633
3.749HisVal: 3.749 ± 1.69
0.0HisTrp: 0.0 ± 0.0
0.937HisTyr: 0.937 ± 0.668
0.0HisXaa: 0.0 ± 0.0
Ile
3.749IleAla: 3.749 ± 2.353
0.937IleCys: 0.937 ± 0.778
0.937IleAsp: 0.937 ± 0.514
1.874IleGlu: 1.874 ± 1.028
3.28IlePhe: 3.28 ± 0.96
0.469IleGly: 0.469 ± 0.921
2.812IleHis: 2.812 ± 2.421
0.469IleIle: 0.469 ± 1.313
3.749IleLys: 3.749 ± 1.462
3.28IleLeu: 3.28 ± 0.96
0.469IleMet: 0.469 ± 0.257
3.28IleAsn: 3.28 ± 1.173
1.874IlePro: 1.874 ± 0.694
2.343IleGln: 2.343 ± 0.835
2.812IleArg: 2.812 ± 1.461
3.28IleSer: 3.28 ± 1.84
3.749IleThr: 3.749 ± 2.151
1.406IleVal: 1.406 ± 1.14
0.937IleTrp: 0.937 ± 0.668
1.874IleTyr: 1.874 ± 2.387
0.0IleXaa: 0.0 ± 0.0
Lys
3.749LysAla: 3.749 ± 1.462
0.0LysCys: 0.0 ± 0.0
1.874LysAsp: 1.874 ± 1.315
3.749LysGlu: 3.749 ± 1.655
4.686LysPhe: 4.686 ± 1.173
4.217LysGly: 4.217 ± 2.312
0.937LysHis: 0.937 ± 0.514
3.749LysIle: 3.749 ± 0.762
3.28LysLys: 3.28 ± 1.798
7.498LysLeu: 7.498 ± 2.826
1.406LysMet: 1.406 ± 0.771
4.217LysAsn: 4.217 ± 2.312
6.56LysPro: 6.56 ± 2.481
3.749LysGln: 3.749 ± 2.055
1.874LysArg: 1.874 ± 1.557
1.874LysSer: 1.874 ± 1.322
5.155LysThr: 5.155 ± 2.09
4.686LysVal: 4.686 ± 1.149
0.937LysTrp: 0.937 ± 0.514
0.937LysTyr: 0.937 ± 1.203
0.0LysXaa: 0.0 ± 0.0
Leu
7.029LeuAla: 7.029 ± 2.418
0.937LeuCys: 0.937 ± 0.514
3.749LeuAsp: 3.749 ± 0.762
9.841LeuGlu: 9.841 ± 2.811
4.686LeuPhe: 4.686 ± 1.152
5.623LeuGly: 5.623 ± 1.812
1.874LeuHis: 1.874 ± 0.719
5.623LeuIle: 5.623 ± 1.694
8.903LeuLys: 8.903 ± 2.927
6.56LeuLeu: 6.56 ± 2.769
1.406LeuMet: 1.406 ± 0.631
2.343LeuAsn: 2.343 ± 1.185
7.966LeuPro: 7.966 ± 1.57
3.749LeuGln: 3.749 ± 1.175
5.155LeuArg: 5.155 ± 1.374
5.623LeuSer: 5.623 ± 1.609
7.498LeuThr: 7.498 ± 2.509
6.092LeuVal: 6.092 ± 6.377
1.406LeuTrp: 1.406 ± 0.771
1.874LeuTyr: 1.874 ± 1.028
0.469LeuXaa: 0.469 ± 0.257
Met
1.406MetAla: 1.406 ± 1.441
0.0MetCys: 0.0 ± 0.0
0.469MetAsp: 0.469 ± 0.257
1.406MetGlu: 1.406 ± 0.771
0.469MetPhe: 0.469 ± 0.257
0.469MetGly: 0.469 ± 0.257
0.937MetHis: 0.937 ± 0.668
0.469MetIle: 0.469 ± 0.257
2.343MetLys: 2.343 ± 0.835
3.28MetLeu: 3.28 ± 1.798
0.469MetMet: 0.469 ± 0.257
1.406MetAsn: 1.406 ± 0.704
0.469MetPro: 0.469 ± 1.313
0.937MetGln: 0.937 ± 0.514
2.343MetArg: 2.343 ± 0.835
3.28MetSer: 3.28 ± 1.259
0.937MetThr: 0.937 ± 0.668
0.937MetVal: 0.937 ± 0.514
0.469MetTrp: 0.469 ± 0.257
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.217AsnAla: 4.217 ± 1.514
2.812AsnCys: 2.812 ± 2.28
2.343AsnAsp: 2.343 ± 0.819
1.874AsnGlu: 1.874 ± 1.028
1.406AsnPhe: 1.406 ± 0.771
2.812AsnGly: 2.812 ± 1.541
0.937AsnHis: 0.937 ± 0.778
2.343AsnIle: 2.343 ± 1.53
1.406AsnLys: 1.406 ± 0.771
4.217AsnLeu: 4.217 ± 1.556
0.937AsnMet: 0.937 ± 0.668
0.937AsnAsn: 0.937 ± 0.668
3.28AsnPro: 3.28 ± 1.317
0.469AsnGln: 0.469 ± 0.791
0.469AsnArg: 0.469 ± 0.791
4.686AsnSer: 4.686 ± 3.06
2.812AsnThr: 2.812 ± 1.807
2.812AsnVal: 2.812 ± 0.607
0.469AsnTrp: 0.469 ± 0.257
1.874AsnTyr: 1.874 ± 1.028
0.0AsnXaa: 0.0 ± 0.0
Pro
4.217ProAla: 4.217 ± 1.371
1.406ProCys: 1.406 ± 0.771
3.749ProAsp: 3.749 ± 1.211
5.155ProGlu: 5.155 ± 1.374
2.812ProPhe: 2.812 ± 1.415
2.812ProGly: 2.812 ± 2.045
1.874ProHis: 1.874 ± 3.535
2.812ProIle: 2.812 ± 1.032
3.749ProLys: 3.749 ± 0.762
4.686ProLeu: 4.686 ± 1.149
0.469ProMet: 0.469 ± 0.257
1.874ProAsn: 1.874 ± 0.719
3.749ProPro: 3.749 ± 3.969
1.874ProGln: 1.874 ± 0.694
1.406ProArg: 1.406 ± 0.771
3.28ProSer: 3.28 ± 3.443
4.686ProThr: 4.686 ± 1.671
3.749ProVal: 3.749 ± 2.455
0.469ProTrp: 0.469 ± 0.257
0.937ProTyr: 0.937 ± 0.514
0.0ProXaa: 0.0 ± 0.0
Gln
2.812GlnAla: 2.812 ± 1.541
0.0GlnCys: 0.0 ± 0.0
1.874GlnAsp: 1.874 ± 1.028
0.937GlnGlu: 0.937 ± 0.514
2.343GlnPhe: 2.343 ± 0.835
1.406GlnGly: 1.406 ± 0.631
2.812GlnHis: 2.812 ± 1.032
1.874GlnIle: 1.874 ± 0.694
3.28GlnLys: 3.28 ± 1.798
2.343GlnLeu: 2.343 ± 1.285
1.874GlnMet: 1.874 ± 1.028
0.937GlnAsn: 0.937 ± 0.668
2.812GlnPro: 2.812 ± 1.082
2.343GlnGln: 2.343 ± 0.679
1.874GlnArg: 1.874 ± 1.134
4.686GlnSer: 4.686 ± 1.152
3.28GlnThr: 3.28 ± 1.798
2.812GlnVal: 2.812 ± 1.537
0.469GlnTrp: 0.469 ± 0.257
0.937GlnTyr: 0.937 ± 0.514
0.0GlnXaa: 0.0 ± 0.0
Arg
4.217ArgAla: 4.217 ± 1.154
0.469ArgCys: 0.469 ± 0.257
0.469ArgAsp: 0.469 ± 0.257
4.686ArgGlu: 4.686 ± 2.569
1.406ArgPhe: 1.406 ± 0.771
2.812ArgGly: 2.812 ± 1.082
0.937ArgHis: 0.937 ± 0.778
0.469ArgIle: 0.469 ± 0.791
3.28ArgLys: 3.28 ± 2.81
4.686ArgLeu: 4.686 ± 1.047
1.874ArgMet: 1.874 ± 0.694
3.28ArgAsn: 3.28 ± 1.627
0.937ArgPro: 0.937 ± 0.778
3.749ArgGln: 3.749 ± 2.091
1.874ArgArg: 1.874 ± 1.322
0.937ArgSer: 0.937 ± 0.514
2.812ArgThr: 2.812 ± 1.032
1.406ArgVal: 1.406 ± 1.513
0.469ArgTrp: 0.469 ± 0.257
2.343ArgTyr: 2.343 ± 1.273
0.0ArgXaa: 0.0 ± 0.0
Ser
6.092SerAla: 6.092 ± 1.402
0.0SerCys: 0.0 ± 0.0
3.749SerAsp: 3.749 ± 1.462
4.217SerGlu: 4.217 ± 1.154
3.749SerPhe: 3.749 ± 1.388
3.28SerGly: 3.28 ± 3.133
2.343SerHis: 2.343 ± 1.462
5.623SerIle: 5.623 ± 3.583
3.749SerLys: 3.749 ± 1.875
7.966SerLeu: 7.966 ± 2.706
0.937SerMet: 0.937 ± 0.607
4.217SerAsn: 4.217 ± 2.064
3.28SerPro: 3.28 ± 1.57
2.812SerGln: 2.812 ± 1.541
3.28SerArg: 3.28 ± 1.079
6.092SerSer: 6.092 ± 2.253
5.155SerThr: 5.155 ± 1.899
3.28SerVal: 3.28 ± 2.547
0.0SerTrp: 0.0 ± 0.0
1.406SerTyr: 1.406 ± 0.704
0.0SerXaa: 0.0 ± 0.0
Thr
7.498ThrAla: 7.498 ± 2.053
0.937ThrCys: 0.937 ± 0.514
2.812ThrAsp: 2.812 ± 1.245
5.623ThrGlu: 5.623 ± 1.764
2.343ThrPhe: 2.343 ± 1.273
3.28ThrGly: 3.28 ± 1.143
3.28ThrHis: 3.28 ± 1.173
3.28ThrIle: 3.28 ± 1.173
3.28ThrLys: 3.28 ± 0.96
7.966ThrLeu: 7.966 ± 1.68
1.874ThrMet: 1.874 ± 1.028
2.812ThrAsn: 2.812 ± 1.286
4.217ThrPro: 4.217 ± 1.605
2.343ThrGln: 2.343 ± 1.53
2.343ThrArg: 2.343 ± 2.274
4.686ThrSer: 4.686 ± 1.384
5.155ThrThr: 5.155 ± 1.85
2.812ThrVal: 2.812 ± 0.978
0.0ThrTrp: 0.0 ± 0.0
1.406ThrTyr: 1.406 ± 0.771
0.0ThrXaa: 0.0 ± 0.0
Val
4.686ValAla: 4.686 ± 1.85
0.469ValCys: 0.469 ± 0.257
2.812ValAsp: 2.812 ± 1.032
3.749ValGlu: 3.749 ± 1.895
4.217ValPhe: 4.217 ± 1.388
3.749ValGly: 3.749 ± 2.968
2.812ValHis: 2.812 ± 1.299
2.812ValIle: 2.812 ± 1.537
3.749ValLys: 3.749 ± 2.159
8.435ValLeu: 8.435 ± 3.894
0.937ValMet: 0.937 ± 0.514
0.937ValAsn: 0.937 ± 1.582
1.874ValPro: 1.874 ± 0.719
2.812ValGln: 2.812 ± 1.286
3.749ValArg: 3.749 ± 0.953
6.092ValSer: 6.092 ± 3.391
2.343ValThr: 2.343 ± 1.247
6.56ValVal: 6.56 ± 4.234
0.469ValTrp: 0.469 ± 0.257
0.469ValTyr: 0.469 ± 0.257
0.0ValXaa: 0.0 ± 0.0
Trp
2.812TrpAla: 2.812 ± 1.023
0.0TrpCys: 0.0 ± 0.0
0.937TrpAsp: 0.937 ± 0.514
0.0TrpGlu: 0.0 ± 0.0
0.937TrpPhe: 0.937 ± 0.514
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.469TrpIle: 0.469 ± 0.257
0.937TrpLys: 0.937 ± 0.514
2.343TrpLeu: 2.343 ± 1.285
0.469TrpMet: 0.469 ± 0.257
1.406TrpAsn: 1.406 ± 0.631
0.469TrpPro: 0.469 ± 0.257
0.469TrpGln: 0.469 ± 0.791
0.469TrpArg: 0.469 ± 0.257
0.0TrpSer: 0.0 ± 0.0
0.469TrpThr: 0.469 ± 0.257
0.937TrpVal: 0.937 ± 0.514
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.217TyrAla: 4.217 ± 1.605
0.0TyrCys: 0.0 ± 0.0
0.937TyrAsp: 0.937 ± 0.514
0.0TyrGlu: 0.0 ± 0.0
1.874TyrPhe: 1.874 ± 0.694
0.937TyrGly: 0.937 ± 1.194
0.937TyrHis: 0.937 ± 0.668
0.937TyrIle: 0.937 ± 0.778
1.406TyrLys: 1.406 ± 1.092
4.217TyrLeu: 4.217 ± 1.787
0.469TyrMet: 0.469 ± 0.257
0.0TyrAsn: 0.0 ± 0.0
1.406TyrPro: 1.406 ± 1.686
0.937TyrGln: 0.937 ± 0.514
0.0TyrArg: 0.0 ± 0.0
0.937TyrSer: 0.937 ± 0.668
3.28TyrThr: 3.28 ± 1.235
0.937TyrVal: 0.937 ± 0.514
0.0TyrTrp: 0.0 ± 0.0
0.937TyrTyr: 0.937 ± 1.194
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.469XaaSer: 0.469 ± 0.257
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2135 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski