Amino acid dipepetide frequency for Rhynchosia yellow mosaic India virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.625AlaAla: 3.625 ± 0.905
1.813AlaCys: 1.813 ± 1.224
0.604AlaAsp: 0.604 ± 0.549
4.23AlaGlu: 4.23 ± 1.794
3.021AlaPhe: 3.021 ± 1.127
1.208AlaGly: 1.208 ± 0.601
0.604AlaHis: 0.604 ± 0.532
3.625AlaIle: 3.625 ± 1.742
4.834AlaLys: 4.834 ± 1.26
4.834AlaLeu: 4.834 ± 1.353
0.604AlaMet: 0.604 ± 0.532
3.021AlaAsn: 3.021 ± 1.441
1.208AlaPro: 1.208 ± 0.62
3.625AlaGln: 3.625 ± 1.006
3.021AlaArg: 3.021 ± 1.129
4.834AlaSer: 4.834 ± 1.464
4.23AlaThr: 4.23 ± 1.413
0.604AlaVal: 0.604 ± 0.682
1.208AlaTrp: 1.208 ± 0.73
0.604AlaTyr: 0.604 ± 0.498
0.0AlaXaa: 0.0 ± 0.0
Cys
1.208CysAla: 1.208 ± 1.307
0.0CysCys: 0.0 ± 0.0
1.208CysAsp: 1.208 ± 0.566
2.417CysGlu: 2.417 ± 1.07
0.604CysPhe: 0.604 ± 0.663
1.208CysGly: 1.208 ± 0.732
0.0CysHis: 0.0 ± 0.0
1.208CysIle: 1.208 ± 0.927
1.813CysLys: 1.813 ± 1.043
1.208CysLeu: 1.208 ± 0.807
1.208CysMet: 1.208 ± 0.961
2.417CysAsn: 2.417 ± 1.076
1.208CysPro: 1.208 ± 1.052
0.604CysGln: 0.604 ± 0.886
1.813CysArg: 1.813 ± 0.815
1.208CysSer: 1.208 ± 0.89
1.813CysThr: 1.813 ± 0.891
1.208CysVal: 1.208 ± 0.611
0.0CysTrp: 0.0 ± 0.0
1.208CysTyr: 1.208 ± 0.566
0.0CysXaa: 0.0 ± 0.0
Asp
1.208AspAla: 1.208 ± 0.995
0.604AspCys: 0.604 ± 0.654
3.021AspAsp: 3.021 ± 1.141
3.021AspGlu: 3.021 ± 0.969
3.021AspPhe: 3.021 ± 1.033
4.23AspGly: 4.23 ± 1.45
1.813AspHis: 1.813 ± 0.955
3.021AspIle: 3.021 ± 1.17
2.417AspLys: 2.417 ± 0.897
4.834AspLeu: 4.834 ± 1.809
0.0AspMet: 0.0 ± 0.0
1.813AspAsn: 1.813 ± 0.718
3.625AspPro: 3.625 ± 1.102
1.813AspGln: 1.813 ± 1.027
2.417AspArg: 2.417 ± 1.49
5.438AspSer: 5.438 ± 0.913
3.625AspThr: 3.625 ± 1.588
4.23AspVal: 4.23 ± 1.412
1.208AspTrp: 1.208 ± 0.732
0.604AspTyr: 0.604 ± 0.549
0.0AspXaa: 0.0 ± 0.0
Glu
4.834GluAla: 4.834 ± 2.015
0.604GluCys: 0.604 ± 0.454
1.208GluAsp: 1.208 ± 0.601
3.625GluGlu: 3.625 ± 1.049
3.625GluPhe: 3.625 ± 1.961
3.021GluGly: 3.021 ± 0.979
1.208GluHis: 1.208 ± 0.89
2.417GluIle: 2.417 ± 1.444
3.021GluLys: 3.021 ± 1.488
6.042GluLeu: 6.042 ± 2.27
0.604GluMet: 0.604 ± 0.682
4.23GluAsn: 4.23 ± 1.873
1.813GluPro: 1.813 ± 0.991
3.021GluGln: 3.021 ± 1.049
2.417GluArg: 2.417 ± 1.015
4.23GluSer: 4.23 ± 1.14
3.021GluThr: 3.021 ± 1.197
0.604GluVal: 0.604 ± 0.663
0.604GluTrp: 0.604 ± 0.654
1.208GluTyr: 1.208 ± 0.908
0.0GluXaa: 0.0 ± 0.0
Phe
0.604PheAla: 0.604 ± 0.454
1.208PheCys: 1.208 ± 0.815
4.834PheAsp: 4.834 ± 1.452
4.23PheGlu: 4.23 ± 1.16
2.417PhePhe: 2.417 ± 1.073
2.417PheGly: 2.417 ± 1.49
1.208PheHis: 1.208 ± 0.995
1.813PheIle: 1.813 ± 1.081
3.625PheLys: 3.625 ± 1.045
5.438PheLeu: 5.438 ± 2.733
0.604PheMet: 0.604 ± 0.498
2.417PheAsn: 2.417 ± 1.165
2.417PhePro: 2.417 ± 1.562
2.417PheGln: 2.417 ± 1.002
2.417PheArg: 2.417 ± 1.426
3.021PheSer: 3.021 ± 1.217
2.417PheThr: 2.417 ± 1.076
2.417PheVal: 2.417 ± 0.952
0.604PheTrp: 0.604 ± 0.454
2.417PheTyr: 2.417 ± 1.534
0.0PheXaa: 0.0 ± 0.0
Gly
2.417GlyAla: 2.417 ± 1.276
2.417GlyCys: 2.417 ± 1.003
4.23GlyAsp: 4.23 ± 1.506
1.813GlyGlu: 1.813 ± 0.8
2.417GlyPhe: 2.417 ± 1.849
3.021GlyGly: 3.021 ± 1.25
1.208GlyHis: 1.208 ± 0.73
3.625GlyIle: 3.625 ± 0.905
6.647GlyLys: 6.647 ± 2.106
0.0GlyLeu: 0.0 ± 0.0
1.813GlyMet: 1.813 ± 1.566
3.021GlyAsn: 3.021 ± 1.224
1.813GlyPro: 1.813 ± 0.618
2.417GlyGln: 2.417 ± 1.266
3.625GlyArg: 3.625 ± 1.431
0.604GlySer: 0.604 ± 0.663
3.625GlyThr: 3.625 ± 1.147
0.604GlyVal: 0.604 ± 0.454
0.0GlyTrp: 0.0 ± 0.0
0.604GlyTyr: 0.604 ± 0.682
0.0GlyXaa: 0.0 ± 0.0
His
1.813HisAla: 1.813 ± 0.891
0.604HisCys: 0.604 ± 0.654
1.208HisAsp: 1.208 ± 0.611
0.604HisGlu: 0.604 ± 0.498
1.813HisPhe: 1.813 ± 1.295
2.417HisGly: 2.417 ± 0.999
0.604HisHis: 0.604 ± 0.654
1.813HisIle: 1.813 ± 1.027
0.0HisLys: 0.0 ± 0.0
1.813HisLeu: 1.813 ± 1.006
0.604HisMet: 0.604 ± 0.549
2.417HisAsn: 2.417 ± 1.225
0.604HisPro: 0.604 ± 0.498
1.813HisGln: 1.813 ± 0.902
2.417HisArg: 2.417 ± 1.347
0.604HisSer: 0.604 ± 0.454
1.813HisThr: 1.813 ± 1.595
2.417HisVal: 2.417 ± 0.802
0.604HisTrp: 0.604 ± 0.682
1.208HisTyr: 1.208 ± 0.601
0.0HisXaa: 0.0 ± 0.0
Ile
3.625IleAla: 3.625 ± 1.239
1.208IleCys: 1.208 ± 0.62
3.021IleAsp: 3.021 ± 1.066
4.834IleGlu: 4.834 ± 2.31
2.417IlePhe: 2.417 ± 1.069
1.208IleGly: 1.208 ± 0.908
0.604IleHis: 0.604 ± 0.549
1.208IleIle: 1.208 ± 0.908
4.834IleLys: 4.834 ± 1.632
3.625IleLeu: 3.625 ± 1.274
3.625IleMet: 3.625 ± 1.297
3.021IleAsn: 3.021 ± 0.973
1.813IlePro: 1.813 ± 1.031
2.417IleGln: 2.417 ± 1.042
4.834IleArg: 4.834 ± 0.93
9.668IleSer: 9.668 ± 2.122
3.625IleThr: 3.625 ± 1.331
1.208IleVal: 1.208 ± 0.93
0.0IleTrp: 0.0 ± 0.0
1.813IleTyr: 1.813 ± 0.907
0.0IleXaa: 0.0 ± 0.0
Lys
3.021LysAla: 3.021 ± 2.016
1.813LysCys: 1.813 ± 1.111
4.834LysAsp: 4.834 ± 1.351
6.042LysGlu: 6.042 ± 2.023
2.417LysPhe: 2.417 ± 1.225
0.604LysGly: 0.604 ± 0.498
1.813LysHis: 1.813 ± 0.942
4.23LysIle: 4.23 ± 0.867
4.834LysLys: 4.834 ± 1.918
6.042LysLeu: 6.042 ± 1.369
2.417LysMet: 2.417 ± 1.132
6.647LysAsn: 6.647 ± 1.791
4.23LysPro: 4.23 ± 1.037
1.813LysGln: 1.813 ± 0.753
3.021LysArg: 3.021 ± 1.285
5.438LysSer: 5.438 ± 1.287
1.813LysThr: 1.813 ± 1.281
4.23LysVal: 4.23 ± 1.386
0.0LysTrp: 0.0 ± 0.0
2.417LysTyr: 2.417 ± 0.985
0.0LysXaa: 0.0 ± 0.0
Leu
1.813LeuAla: 1.813 ± 1.046
3.625LeuCys: 3.625 ± 0.829
4.23LeuAsp: 4.23 ± 1.105
1.813LeuGlu: 1.813 ± 0.792
0.604LeuPhe: 0.604 ± 0.549
4.834LeuGly: 4.834 ± 1.936
3.625LeuHis: 3.625 ± 1.544
3.625LeuIle: 3.625 ± 2.012
7.251LeuLys: 7.251 ± 2.26
8.459LeuLeu: 8.459 ± 2.087
1.813LeuMet: 1.813 ± 1.448
2.417LeuAsn: 2.417 ± 0.89
2.417LeuPro: 2.417 ± 1.07
4.23LeuGln: 4.23 ± 1.137
4.834LeuArg: 4.834 ± 3.665
6.647LeuSer: 6.647 ± 2.129
3.021LeuThr: 3.021 ± 0.827
3.021LeuVal: 3.021 ± 1.029
1.208LeuTrp: 1.208 ± 0.638
2.417LeuTyr: 2.417 ± 1.108
0.0LeuXaa: 0.0 ± 0.0
Met
3.021MetAla: 3.021 ± 1.019
0.604MetCys: 0.604 ± 0.549
3.021MetAsp: 3.021 ± 1.231
2.417MetGlu: 2.417 ± 0.65
3.021MetPhe: 3.021 ± 0.974
1.208MetGly: 1.208 ± 0.601
0.604MetHis: 0.604 ± 0.654
3.021MetIle: 3.021 ± 1.63
1.813MetLys: 1.813 ± 0.77
1.208MetLeu: 1.208 ± 0.803
0.0MetMet: 0.0 ± 0.0
0.604MetAsn: 0.604 ± 0.549
1.813MetPro: 1.813 ± 1.388
1.813MetGln: 1.813 ± 1.125
2.417MetArg: 2.417 ± 0.983
1.208MetSer: 1.208 ± 0.827
0.0MetThr: 0.0 ± 0.0
1.813MetVal: 1.813 ± 0.827
1.208MetTrp: 1.208 ± 0.927
0.604MetTyr: 0.604 ± 0.532
0.0MetXaa: 0.0 ± 0.0
Asn
5.438AsnAla: 5.438 ± 1.629
2.417AsnCys: 2.417 ± 1.061
3.625AsnAsp: 3.625 ± 0.83
1.813AsnGlu: 1.813 ± 0.966
1.208AsnPhe: 1.208 ± 0.62
1.208AsnGly: 1.208 ± 0.927
4.23AsnHis: 4.23 ± 2.34
3.021AsnIle: 3.021 ± 1.608
3.021AsnLys: 3.021 ± 1.327
3.021AsnLeu: 3.021 ± 1.197
2.417AsnMet: 2.417 ± 1.035
2.417AsnAsn: 2.417 ± 0.89
3.625AsnPro: 3.625 ± 0.841
1.813AsnGln: 1.813 ± 1.006
2.417AsnArg: 2.417 ± 1.078
6.647AsnSer: 6.647 ± 2.745
4.23AsnThr: 4.23 ± 2.082
5.438AsnVal: 5.438 ± 1.866
0.604AsnTrp: 0.604 ± 0.549
5.438AsnTyr: 5.438 ± 1.81
0.0AsnXaa: 0.0 ± 0.0
Pro
1.208ProAla: 1.208 ± 1.063
0.604ProCys: 0.604 ± 0.532
1.813ProAsp: 1.813 ± 0.907
1.813ProGlu: 1.813 ± 0.942
2.417ProPhe: 2.417 ± 0.571
2.417ProGly: 2.417 ± 1.002
2.417ProHis: 2.417 ± 1.469
4.23ProIle: 4.23 ± 2.277
4.23ProLys: 4.23 ± 1.456
4.834ProLeu: 4.834 ± 1.724
3.021ProMet: 3.021 ± 1.892
3.625ProAsn: 3.625 ± 1.892
2.417ProPro: 2.417 ± 0.76
0.604ProGln: 0.604 ± 0.654
4.23ProArg: 4.23 ± 1.038
4.834ProSer: 4.834 ± 2.344
1.813ProThr: 1.813 ± 1.006
1.813ProVal: 1.813 ± 0.991
0.604ProTrp: 0.604 ± 0.454
1.208ProTyr: 1.208 ± 0.638
0.0ProXaa: 0.0 ± 0.0
Gln
1.813GlnAla: 1.813 ± 0.939
1.208GlnCys: 1.208 ± 0.93
1.813GlnAsp: 1.813 ± 0.839
3.021GlnGlu: 3.021 ± 1.101
2.417GlnPhe: 2.417 ± 0.952
1.813GlnGly: 1.813 ± 0.718
2.417GlnHis: 2.417 ± 1.352
2.417GlnIle: 2.417 ± 1.076
1.208GlnLys: 1.208 ± 0.908
2.417GlnLeu: 2.417 ± 1.076
1.208GlnMet: 1.208 ± 0.941
0.604GlnAsn: 0.604 ± 0.654
2.417GlnPro: 2.417 ± 1.069
1.813GlnGln: 1.813 ± 1.961
2.417GlnArg: 2.417 ± 1.002
3.021GlnSer: 3.021 ± 1.38
1.813GlnThr: 1.813 ± 0.74
4.834GlnVal: 4.834 ± 2.304
0.0GlnTrp: 0.0 ± 0.0
1.813GlnTyr: 1.813 ± 0.978
0.0GlnXaa: 0.0 ± 0.0
Arg
1.208ArgAla: 1.208 ± 0.732
2.417ArgCys: 2.417 ± 1.042
3.021ArgAsp: 3.021 ± 1.376
1.208ArgGlu: 1.208 ± 0.71
7.251ArgPhe: 7.251 ± 2.899
4.23ArgGly: 4.23 ± 1.352
3.021ArgHis: 3.021 ± 1.744
3.021ArgIle: 3.021 ± 1.275
4.23ArgLys: 4.23 ± 1.516
5.438ArgLeu: 5.438 ± 1.669
0.0ArgMet: 0.0 ± 0.0
4.834ArgAsn: 4.834 ± 0.979
4.23ArgPro: 4.23 ± 1.056
1.813ArgGln: 1.813 ± 0.922
10.272ArgArg: 10.272 ± 4.349
4.834ArgSer: 4.834 ± 1.173
2.417ArgThr: 2.417 ± 0.786
3.021ArgVal: 3.021 ± 1.349
0.604ArgTrp: 0.604 ± 0.532
2.417ArgTyr: 2.417 ± 1.064
0.0ArgXaa: 0.0 ± 0.0
Ser
4.834SerAla: 4.834 ± 1.126
0.0SerCys: 0.0 ± 0.0
4.23SerAsp: 4.23 ± 1.957
1.813SerGlu: 1.813 ± 0.577
4.23SerPhe: 4.23 ± 1.439
2.417SerGly: 2.417 ± 2.195
0.0SerHis: 0.0 ± 0.0
4.834SerIle: 4.834 ± 1.747
6.042SerLys: 6.042 ± 2.006
6.647SerLeu: 6.647 ± 2.685
3.021SerMet: 3.021 ± 1.608
7.855SerAsn: 7.855 ± 1.314
4.834SerPro: 4.834 ± 1.347
3.021SerGln: 3.021 ± 1.874
6.042SerArg: 6.042 ± 1.888
14.502SerSer: 14.502 ± 3.496
6.647SerThr: 6.647 ± 1.453
4.23SerVal: 4.23 ± 1.7
0.604SerTrp: 0.604 ± 0.498
3.625SerTyr: 3.625 ± 1.503
0.0SerXaa: 0.0 ± 0.0
Thr
3.021ThrAla: 3.021 ± 1.211
1.208ThrCys: 1.208 ± 1.772
0.604ThrAsp: 0.604 ± 0.454
2.417ThrGlu: 2.417 ± 0.998
2.417ThrPhe: 2.417 ± 1.385
4.23ThrGly: 4.23 ± 1.481
1.813ThrHis: 1.813 ± 1.144
3.625ThrIle: 3.625 ± 1.093
0.604ThrLys: 0.604 ± 0.454
2.417ThrLeu: 2.417 ± 0.786
2.417ThrMet: 2.417 ± 1.156
4.834ThrAsn: 4.834 ± 1.501
4.23ThrPro: 4.23 ± 1.166
1.208ThrGln: 1.208 ± 0.995
2.417ThrArg: 2.417 ± 1.108
3.625ThrSer: 3.625 ± 1.045
4.23ThrThr: 4.23 ± 1.743
4.834ThrVal: 4.834 ± 1.363
1.813ThrTrp: 1.813 ± 0.74
2.417ThrTyr: 2.417 ± 0.992
0.0ThrXaa: 0.0 ± 0.0
Val
1.813ValAla: 1.813 ± 0.77
0.0ValCys: 0.0 ± 0.0
3.021ValAsp: 3.021 ± 1.174
3.021ValGlu: 3.021 ± 1.355
0.604ValPhe: 0.604 ± 0.532
3.021ValGly: 3.021 ± 1.349
0.0ValHis: 0.0 ± 0.0
4.23ValIle: 4.23 ± 1.259
4.834ValLys: 4.834 ± 1.677
1.813ValLeu: 1.813 ± 0.991
1.208ValMet: 1.208 ± 0.862
6.042ValAsn: 6.042 ± 1.867
3.625ValPro: 3.625 ± 1.125
3.021ValGln: 3.021 ± 1.363
1.208ValArg: 1.208 ± 0.62
4.23ValSer: 4.23 ± 1.891
1.813ValThr: 1.813 ± 1.051
6.042ValVal: 6.042 ± 1.898
0.604ValTrp: 0.604 ± 0.532
3.625ValTyr: 3.625 ± 1.528
0.0ValXaa: 0.0 ± 0.0
Trp
1.813TrpAla: 1.813 ± 1.493
0.0TrpCys: 0.0 ± 0.0
0.604TrpAsp: 0.604 ± 0.886
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.604TrpLys: 0.604 ± 0.663
0.0TrpLeu: 0.0 ± 0.0
1.208TrpMet: 1.208 ± 0.862
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.208TrpGln: 1.208 ± 0.601
1.813TrpArg: 1.813 ± 0.907
2.417TrpSer: 2.417 ± 0.983
1.208TrpThr: 1.208 ± 0.611
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.604TrpTyr: 0.604 ± 0.498
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.021TyrAla: 3.021 ± 0.94
1.208TyrCys: 1.208 ± 0.827
1.813TyrAsp: 1.813 ± 1.595
1.208TyrGlu: 1.208 ± 0.62
3.021TyrPhe: 3.021 ± 1.374
1.208TyrGly: 1.208 ± 0.638
0.0TyrHis: 0.0 ± 0.0
3.625TyrIle: 3.625 ± 1.421
1.813TyrLys: 1.813 ± 0.942
2.417TyrLeu: 2.417 ± 1.073
3.021TyrMet: 3.021 ± 1.314
1.813TyrAsn: 1.813 ± 0.718
1.813TyrPro: 1.813 ± 0.922
0.0TyrGln: 0.0 ± 0.0
5.438TyrArg: 5.438 ± 2.0
2.417TyrSer: 2.417 ± 0.917
1.208TyrThr: 1.208 ± 0.601
1.208TyrVal: 1.208 ± 1.098
0.0TyrTrp: 0.0 ± 0.0
0.604TyrTyr: 0.604 ± 0.654
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1656 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski