Amino acid dipepetide frequency for Babaco mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.002AlaAla: 6.002 ± 2.821
0.462AlaCys: 0.462 ± 1.105
1.847AlaAsp: 1.847 ± 0.707
2.77AlaGlu: 2.77 ± 0.484
5.078AlaPhe: 5.078 ± 0.837
5.54AlaGly: 5.54 ± 1.113
1.385AlaHis: 1.385 ± 0.723
3.693AlaIle: 3.693 ± 1.914
4.617AlaLys: 4.617 ± 0.912
11.08AlaLeu: 11.08 ± 5.115
2.77AlaMet: 2.77 ± 1.247
3.693AlaAsn: 3.693 ± 1.201
2.77AlaPro: 2.77 ± 0.484
4.155AlaGln: 4.155 ± 1.4
3.693AlaArg: 3.693 ± 1.261
8.31AlaSer: 8.31 ± 4.316
2.77AlaThr: 2.77 ± 0.959
5.078AlaVal: 5.078 ± 1.689
0.462AlaTrp: 0.462 ± 0.241
2.77AlaTyr: 2.77 ± 1.447
0.0AlaXaa: 0.0 ± 0.0
Cys
0.923CysAla: 0.923 ± 1.63
0.462CysCys: 0.462 ± 0.241
0.462CysAsp: 0.462 ± 0.241
0.462CysGlu: 0.462 ± 0.241
1.847CysPhe: 1.847 ± 0.707
0.923CysGly: 0.923 ± 0.482
0.923CysHis: 0.923 ± 0.962
0.462CysIle: 0.462 ± 0.241
0.462CysLys: 0.462 ± 0.241
1.385CysLeu: 1.385 ± 0.669
0.0CysMet: 0.0 ± 0.0
0.923CysAsn: 0.923 ± 0.482
1.385CysPro: 1.385 ± 0.863
0.462CysGln: 0.462 ± 0.241
1.385CysArg: 1.385 ± 1.274
0.923CysSer: 0.923 ± 0.482
0.462CysThr: 0.462 ± 0.241
0.462CysVal: 0.462 ± 0.241
0.462CysTrp: 0.462 ± 0.241
0.923CysTyr: 0.923 ± 0.73
0.0CysXaa: 0.0 ± 0.0
Asp
3.693AspAla: 3.693 ± 1.929
0.923AspCys: 0.923 ± 0.723
2.308AspAsp: 2.308 ± 0.823
3.232AspGlu: 3.232 ± 1.175
1.385AspPhe: 1.385 ± 1.004
1.847AspGly: 1.847 ± 1.319
0.0AspHis: 0.0 ± 0.0
0.923AspIle: 0.923 ± 0.482
2.308AspLys: 2.308 ± 1.467
3.232AspLeu: 3.232 ± 0.457
0.923AspMet: 0.923 ± 0.482
1.847AspAsn: 1.847 ± 2.527
3.693AspPro: 3.693 ± 1.413
2.308AspGln: 2.308 ± 1.206
1.385AspArg: 1.385 ± 0.662
4.155AspSer: 4.155 ± 2.17
0.462AspThr: 0.462 ± 0.241
1.847AspVal: 1.847 ± 0.965
1.385AspTrp: 1.385 ± 0.723
1.385AspTyr: 1.385 ± 0.662
0.0AspXaa: 0.0 ± 0.0
Glu
4.155GluAla: 4.155 ± 1.561
0.462GluCys: 0.462 ± 0.241
1.847GluAsp: 1.847 ± 0.965
6.464GluGlu: 6.464 ± 1.973
0.923GluPhe: 0.923 ± 0.702
3.232GluGly: 3.232 ± 1.175
0.923GluHis: 0.923 ± 0.482
3.693GluIle: 3.693 ± 1.399
5.54GluLys: 5.54 ± 1.352
5.078GluLeu: 5.078 ± 1.225
0.923GluMet: 0.923 ± 0.637
3.693GluAsn: 3.693 ± 0.548
3.693GluPro: 3.693 ± 1.997
0.923GluGln: 0.923 ± 0.482
3.232GluArg: 3.232 ± 0.986
6.925GluSer: 6.925 ± 1.383
7.387GluThr: 7.387 ± 1.613
5.078GluVal: 5.078 ± 1.434
0.923GluTrp: 0.923 ± 0.482
1.385GluTyr: 1.385 ± 0.662
0.0GluXaa: 0.0 ± 0.0
Phe
0.923PheAla: 0.923 ± 1.783
1.847PheCys: 1.847 ± 1.461
4.617PheAsp: 4.617 ± 0.835
1.847PheGlu: 1.847 ± 0.965
3.232PhePhe: 3.232 ± 1.2
2.308PheGly: 2.308 ± 0.908
1.847PheHis: 1.847 ± 0.707
3.232PheIle: 3.232 ± 1.161
2.308PheLys: 2.308 ± 0.669
5.078PheLeu: 5.078 ± 1.849
1.385PheMet: 1.385 ± 0.723
2.308PheAsn: 2.308 ± 0.613
1.385PhePro: 1.385 ± 0.623
2.77PheGln: 2.77 ± 0.808
2.308PheArg: 2.308 ± 0.823
3.232PheSer: 3.232 ± 1.688
4.155PheThr: 4.155 ± 1.958
1.847PheVal: 1.847 ± 1.831
0.462PheTrp: 0.462 ± 0.241
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.693GlyAla: 3.693 ± 1.381
1.847GlyCys: 1.847 ± 0.707
3.693GlyAsp: 3.693 ± 1.651
3.693GlyGlu: 3.693 ± 1.725
2.77GlyPhe: 2.77 ± 0.985
4.617GlyGly: 4.617 ± 1.089
2.308GlyHis: 2.308 ± 1.206
3.693GlyIle: 3.693 ± 3.355
3.693GlyLys: 3.693 ± 0.548
4.617GlyLeu: 4.617 ± 1.131
0.462GlyMet: 0.462 ± 0.241
1.847GlyAsn: 1.847 ± 1.764
2.308GlyPro: 2.308 ± 0.823
2.308GlyGln: 2.308 ± 1.337
2.308GlyArg: 2.308 ± 0.823
1.847GlySer: 1.847 ± 0.796
4.155GlyThr: 4.155 ± 0.993
3.232GlyVal: 3.232 ± 1.021
0.462GlyTrp: 0.462 ± 0.241
2.308GlyTyr: 2.308 ± 1.164
0.0GlyXaa: 0.0 ± 0.0
His
2.308HisAla: 2.308 ± 0.823
0.462HisCys: 0.462 ± 0.815
0.0HisAsp: 0.0 ± 0.0
2.308HisGlu: 2.308 ± 1.206
1.385HisPhe: 1.385 ± 0.662
1.847HisGly: 1.847 ± 1.404
1.385HisHis: 1.385 ± 0.723
1.847HisIle: 1.847 ± 0.699
1.847HisLys: 1.847 ± 2.312
1.847HisLeu: 1.847 ± 1.057
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.923HisPro: 0.923 ± 0.482
0.462HisGln: 0.462 ± 0.241
3.693HisArg: 3.693 ± 1.381
4.155HisSer: 4.155 ± 1.665
2.77HisThr: 2.77 ± 1.323
0.923HisVal: 0.923 ± 0.702
0.0HisTrp: 0.0 ± 0.0
1.847HisTyr: 1.847 ± 0.707
0.0HisXaa: 0.0 ± 0.0
Ile
5.078IleAla: 5.078 ± 2.424
0.462IleCys: 0.462 ± 0.241
1.385IleAsp: 1.385 ± 0.723
4.617IleGlu: 4.617 ± 1.645
3.693IlePhe: 3.693 ± 0.956
1.385IleGly: 1.385 ± 0.863
1.847IleHis: 1.847 ± 0.825
0.923IleIle: 0.923 ± 1.46
4.617IleLys: 4.617 ± 1.258
6.925IleLeu: 6.925 ± 1.681
0.462IleMet: 0.462 ± 0.241
1.847IleAsn: 1.847 ± 0.965
4.155IlePro: 4.155 ± 1.401
3.232IleGln: 3.232 ± 0.861
3.232IleArg: 3.232 ± 1.161
5.54IleSer: 5.54 ± 2.363
3.693IleThr: 3.693 ± 3.164
1.385IleVal: 1.385 ± 1.98
0.923IleTrp: 0.923 ± 0.73
1.847IleTyr: 1.847 ± 0.825
0.0IleXaa: 0.0 ± 0.0
Lys
4.617LysAla: 4.617 ± 1.622
0.462LysCys: 0.462 ± 0.241
4.155LysAsp: 4.155 ± 2.17
3.693LysGlu: 3.693 ± 0.548
2.77LysPhe: 2.77 ± 1.247
4.155LysGly: 4.155 ± 1.401
1.847LysHis: 1.847 ± 1.404
5.078LysIle: 5.078 ± 1.325
5.078LysLys: 5.078 ± 2.652
6.925LysLeu: 6.925 ± 2.071
0.462LysMet: 0.462 ± 0.55
2.77LysAsn: 2.77 ± 0.95
5.078LysPro: 5.078 ± 1.078
2.77LysGln: 2.77 ± 0.808
1.847LysArg: 1.847 ± 0.699
8.31LysSer: 8.31 ± 1.478
5.54LysThr: 5.54 ± 2.894
1.847LysVal: 1.847 ± 0.796
0.0LysTrp: 0.0 ± 0.0
0.923LysTyr: 0.923 ± 0.482
0.0LysXaa: 0.0 ± 0.0
Leu
7.849LeuAla: 7.849 ± 4.282
1.385LeuCys: 1.385 ± 0.723
4.617LeuAsp: 4.617 ± 2.002
5.078LeuGlu: 5.078 ± 2.046
4.155LeuPhe: 4.155 ± 0.632
5.078LeuGly: 5.078 ± 2.388
3.693LeuHis: 3.693 ± 0.802
4.155LeuIle: 4.155 ± 0.712
7.387LeuLys: 7.387 ± 3.015
6.464LeuLeu: 6.464 ± 3.366
1.847LeuMet: 1.847 ± 0.707
1.847LeuAsn: 1.847 ± 0.985
8.772LeuPro: 8.772 ± 2.272
5.078LeuGln: 5.078 ± 2.046
3.693LeuArg: 3.693 ± 1.467
5.54LeuSer: 5.54 ± 0.997
7.387LeuThr: 7.387 ± 3.915
8.772LeuVal: 8.772 ± 3.026
0.923LeuTrp: 0.923 ± 0.482
3.693LeuTyr: 3.693 ± 1.381
0.0LeuXaa: 0.0 ± 0.0
Met
1.847MetAla: 1.847 ± 1.108
0.462MetCys: 0.462 ± 0.241
0.923MetAsp: 0.923 ± 0.73
1.385MetGlu: 1.385 ± 0.723
0.462MetPhe: 0.462 ± 0.241
1.385MetGly: 1.385 ± 0.723
0.462MetHis: 0.462 ± 0.241
0.923MetIle: 0.923 ± 0.482
0.923MetLys: 0.923 ± 0.482
1.847MetLeu: 1.847 ± 0.965
0.0MetMet: 0.0 ± 0.0
0.462MetAsn: 0.462 ± 0.815
1.847MetPro: 1.847 ± 0.965
0.462MetGln: 0.462 ± 0.891
1.847MetArg: 1.847 ± 0.965
1.385MetSer: 1.385 ± 1.496
0.462MetThr: 0.462 ± 0.241
0.462MetVal: 0.462 ± 0.241
0.0MetTrp: 0.0 ± 0.0
0.462MetTyr: 0.462 ± 0.241
0.0MetXaa: 0.0 ± 0.0
Asn
2.77AsnAla: 2.77 ± 0.808
0.923AsnCys: 0.923 ± 0.482
1.847AsnAsp: 1.847 ± 0.965
1.847AsnGlu: 1.847 ± 0.707
2.308AsnPhe: 2.308 ± 1.337
1.385AsnGly: 1.385 ± 0.623
1.385AsnHis: 1.385 ± 0.662
2.77AsnIle: 2.77 ± 0.805
3.232AsnLys: 3.232 ± 0.861
5.078AsnLeu: 5.078 ± 3.953
0.0AsnMet: 0.0 ± 0.0
0.923AsnAsn: 0.923 ± 0.73
3.232AsnPro: 3.232 ± 0.457
2.308AsnGln: 2.308 ± 1.337
0.462AsnArg: 0.462 ± 1.105
5.078AsnSer: 5.078 ± 3.406
2.308AsnThr: 2.308 ± 0.613
0.923AsnVal: 0.923 ± 0.482
1.385AsnTrp: 1.385 ± 1.004
0.923AsnTyr: 0.923 ± 0.482
0.0AsnXaa: 0.0 ± 0.0
Pro
5.078ProAla: 5.078 ± 1.131
0.923ProCys: 0.923 ± 0.962
1.847ProAsp: 1.847 ± 0.707
3.232ProGlu: 3.232 ± 0.861
2.308ProPhe: 2.308 ± 1.467
3.232ProGly: 3.232 ± 0.969
0.923ProHis: 0.923 ± 1.63
4.617ProIle: 4.617 ± 2.002
6.002ProLys: 6.002 ± 3.135
5.078ProLeu: 5.078 ± 2.655
0.462ProMet: 0.462 ± 0.241
4.617ProAsn: 4.617 ± 2.102
5.078ProPro: 5.078 ± 1.131
2.308ProGln: 2.308 ± 0.908
0.923ProArg: 0.923 ± 2.21
5.54ProSer: 5.54 ± 2.952
4.155ProThr: 4.155 ± 0.712
3.693ProVal: 3.693 ± 0.951
0.462ProTrp: 0.462 ± 0.241
1.385ProTyr: 1.385 ± 0.723
0.0ProXaa: 0.0 ± 0.0
Gln
5.078GlnAla: 5.078 ± 0.928
0.462GlnCys: 0.462 ± 0.241
0.923GlnAsp: 0.923 ± 0.723
4.617GlnGlu: 4.617 ± 1.819
1.847GlnPhe: 1.847 ± 0.6
3.232GlnGly: 3.232 ± 0.457
1.385GlnHis: 1.385 ± 0.662
3.693GlnIle: 3.693 ± 1.201
1.385GlnLys: 1.385 ± 0.662
3.232GlnLeu: 3.232 ± 1.688
0.923GlnMet: 0.923 ± 0.842
0.923GlnAsn: 0.923 ± 0.482
0.923GlnPro: 0.923 ± 0.73
2.308GlnGln: 2.308 ± 1.206
0.462GlnArg: 0.462 ± 0.241
2.77GlnSer: 2.77 ± 1.447
2.308GlnThr: 2.308 ± 1.337
2.77GlnVal: 2.77 ± 0.484
0.462GlnTrp: 0.462 ± 0.241
1.385GlnTyr: 1.385 ± 0.662
0.0GlnXaa: 0.0 ± 0.0
Arg
5.54ArgAla: 5.54 ± 1.61
0.462ArgCys: 0.462 ± 0.241
2.308ArgAsp: 2.308 ± 0.857
3.232ArgGlu: 3.232 ± 0.868
2.308ArgPhe: 2.308 ± 0.613
3.232ArgGly: 3.232 ± 0.766
1.385ArgHis: 1.385 ± 1.129
2.308ArgIle: 2.308 ± 0.669
2.308ArgLys: 2.308 ± 0.669
3.693ArgLeu: 3.693 ± 0.956
0.923ArgMet: 0.923 ± 0.482
0.923ArgAsn: 0.923 ± 0.482
1.385ArgPro: 1.385 ± 1.502
2.308ArgGln: 2.308 ± 0.823
1.385ArgArg: 1.385 ± 0.723
3.232ArgSer: 3.232 ± 2.043
1.847ArgThr: 1.847 ± 1.676
1.847ArgVal: 1.847 ± 0.985
0.462ArgTrp: 0.462 ± 0.241
1.385ArgTyr: 1.385 ± 0.723
0.0ArgXaa: 0.0 ± 0.0
Ser
4.617SerAla: 4.617 ± 3.13
0.923SerCys: 0.923 ± 0.482
1.847SerAsp: 1.847 ± 0.825
7.387SerGlu: 7.387 ± 0.541
3.232SerPhe: 3.232 ± 0.986
5.54SerGly: 5.54 ± 1.904
2.77SerHis: 2.77 ± 1.323
6.002SerIle: 6.002 ± 1.946
5.078SerLys: 5.078 ± 1.868
8.772SerLeu: 8.772 ± 1.177
1.385SerMet: 1.385 ± 0.723
4.617SerAsn: 4.617 ± 2.259
6.002SerPro: 6.002 ± 2.73
1.847SerGln: 1.847 ± 0.825
3.232SerArg: 3.232 ± 0.868
8.772SerSer: 8.772 ± 1.971
5.54SerThr: 5.54 ± 2.417
3.693SerVal: 3.693 ± 0.814
1.385SerTrp: 1.385 ± 0.723
2.308SerTyr: 2.308 ± 1.811
0.0SerXaa: 0.0 ± 0.0
Thr
6.464ThrAla: 6.464 ± 1.417
2.308ThrCys: 2.308 ± 0.884
0.923ThrAsp: 0.923 ± 1.224
3.232ThrGlu: 3.232 ± 0.986
3.693ThrPhe: 3.693 ± 1.261
2.77ThrGly: 2.77 ± 1.357
4.155ThrHis: 4.155 ± 1.515
1.847ThrIle: 1.847 ± 0.6
4.155ThrLys: 4.155 ± 1.487
7.849ThrLeu: 7.849 ± 1.572
2.308ThrMet: 2.308 ± 0.669
3.232ThrAsn: 3.232 ± 0.457
4.617ThrPro: 4.617 ± 1.121
0.0ThrGln: 0.0 ± 0.0
1.385ThrArg: 1.385 ± 1.502
3.693ThrSer: 3.693 ± 1.592
2.77ThrThr: 2.77 ± 1.311
3.232ThrVal: 3.232 ± 0.457
0.462ThrTrp: 0.462 ± 1.105
2.77ThrTyr: 2.77 ± 1.447
0.0ThrXaa: 0.0 ± 0.0
Val
4.617ValAla: 4.617 ± 2.462
0.0ValCys: 0.0 ± 0.0
1.385ValAsp: 1.385 ± 0.662
4.155ValGlu: 4.155 ± 1.0
2.308ValPhe: 2.308 ± 0.804
2.308ValGly: 2.308 ± 1.534
0.923ValHis: 0.923 ± 0.702
3.693ValIle: 3.693 ± 1.018
3.232ValLys: 3.232 ± 0.986
5.54ValLeu: 5.54 ± 2.857
0.923ValMet: 0.923 ± 0.482
2.77ValAsn: 2.77 ± 1.357
3.693ValPro: 3.693 ± 2.612
3.693ValGln: 3.693 ± 1.929
4.617ValArg: 4.617 ± 2.759
2.308ValSer: 2.308 ± 1.373
2.308ValThr: 2.308 ± 1.206
2.77ValVal: 2.77 ± 0.985
0.0ValTrp: 0.0 ± 0.0
0.923ValTyr: 0.923 ± 0.73
0.0ValXaa: 0.0 ± 0.0
Trp
1.847TrpAla: 1.847 ± 0.965
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.385TrpGlu: 1.385 ± 0.623
0.0TrpPhe: 0.0 ± 0.0
0.923TrpGly: 0.923 ± 0.962
0.0TrpHis: 0.0 ± 0.0
0.923TrpIle: 0.923 ± 0.482
0.923TrpLys: 0.923 ± 0.482
0.923TrpLeu: 0.923 ± 0.482
0.462TrpMet: 0.462 ± 0.241
0.462TrpAsn: 0.462 ± 0.891
0.462TrpPro: 0.462 ± 0.241
0.923TrpGln: 0.923 ± 0.702
0.0TrpArg: 0.0 ± 0.0
0.462TrpSer: 0.462 ± 0.241
0.0TrpThr: 0.0 ± 0.0
0.923TrpVal: 0.923 ± 0.482
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.847TyrAla: 1.847 ± 0.965
0.462TyrCys: 0.462 ± 0.241
2.308TyrAsp: 2.308 ± 0.669
1.847TyrGlu: 1.847 ± 0.965
1.385TyrPhe: 1.385 ± 0.723
0.923TyrGly: 0.923 ± 0.962
0.462TyrHis: 0.462 ± 0.241
2.77TyrIle: 2.77 ± 2.106
3.693TyrLys: 3.693 ± 0.951
2.77TyrLeu: 2.77 ± 1.447
0.923TyrMet: 0.923 ± 0.449
1.385TyrAsn: 1.385 ± 0.863
0.462TyrPro: 0.462 ± 0.241
0.462TyrGln: 0.462 ± 0.241
1.385TyrArg: 1.385 ± 0.863
2.77TyrSer: 2.77 ± 1.323
1.385TyrThr: 1.385 ± 0.623
1.385TyrVal: 1.385 ± 0.723
0.0TyrTrp: 0.0 ± 0.0
0.923TyrTyr: 0.923 ± 0.962
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2167 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski