Amino acid dipepetide frequency for Sowbane mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.244AlaAla: 4.244 ± 0.516
1.592AlaCys: 1.592 ± 0.743
2.122AlaAsp: 2.122 ± 0.659
4.775AlaGlu: 4.775 ± 1.627
1.592AlaPhe: 1.592 ± 0.61
6.366AlaGly: 6.366 ± 1.296
2.122AlaHis: 2.122 ± 0.559
2.122AlaIle: 2.122 ± 0.996
3.714AlaLys: 3.714 ± 1.187
7.427AlaLeu: 7.427 ± 0.973
1.061AlaMet: 1.061 ± 0.57
0.531AlaAsn: 0.531 ± 0.578
3.183AlaPro: 3.183 ± 1.427
2.653AlaGln: 2.653 ± 1.27
4.244AlaArg: 4.244 ± 0.244
7.958AlaSer: 7.958 ± 1.283
2.653AlaThr: 2.653 ± 2.402
7.427AlaVal: 7.427 ± 1.27
1.061AlaTrp: 1.061 ± 0.279
1.061AlaTyr: 1.061 ± 0.665
0.0AlaXaa: 0.0 ± 0.0
Cys
1.592CysAla: 1.592 ± 0.61
3.183CysCys: 3.183 ± 1.381
4.244CysAsp: 4.244 ± 0.998
1.061CysGlu: 1.061 ± 0.577
1.592CysPhe: 1.592 ± 0.998
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
2.122CysIle: 2.122 ± 0.501
1.592CysLys: 1.592 ± 0.998
2.122CysLeu: 2.122 ± 0.659
0.531CysMet: 0.531 ± 0.333
0.0CysAsn: 0.0 ± 0.0
1.061CysPro: 1.061 ± 0.57
1.061CysGln: 1.061 ± 0.279
1.592CysArg: 1.592 ± 0.448
3.714CysSer: 3.714 ± 0.992
2.653CysThr: 2.653 ± 0.593
0.531CysVal: 0.531 ± 0.333
0.531CysTrp: 0.531 ± 0.641
1.061CysTyr: 1.061 ± 0.577
0.0CysXaa: 0.0 ± 0.0
Asp
2.122AspAla: 2.122 ± 0.559
4.244AspCys: 4.244 ± 1.282
5.836AspAsp: 5.836 ± 1.359
4.775AspGlu: 4.775 ± 1.09
2.122AspPhe: 2.122 ± 0.659
4.244AspGly: 4.244 ± 0.244
0.0AspHis: 0.0 ± 0.0
1.592AspIle: 1.592 ± 0.733
2.653AspLys: 2.653 ± 1.274
2.122AspLeu: 2.122 ± 0.386
0.531AspMet: 0.531 ± 0.578
2.653AspAsn: 2.653 ± 0.474
2.653AspPro: 2.653 ± 0.58
2.653AspGln: 2.653 ± 0.699
1.061AspArg: 1.061 ± 0.279
2.653AspSer: 2.653 ± 1.274
1.061AspThr: 1.061 ± 0.279
3.183AspVal: 3.183 ± 0.933
2.122AspTrp: 2.122 ± 0.659
3.183AspTyr: 3.183 ± 0.57
0.0AspXaa: 0.0 ± 0.0
Glu
4.244GluAla: 4.244 ± 0.968
1.592GluCys: 1.592 ± 0.61
4.244GluAsp: 4.244 ± 1.118
5.305GluGlu: 5.305 ± 1.4
0.531GluPhe: 0.531 ± 0.641
2.653GluGly: 2.653 ± 0.972
0.0GluHis: 0.0 ± 0.0
5.836GluIle: 5.836 ± 1.086
3.183GluLys: 3.183 ± 1.427
4.244GluLeu: 4.244 ± 2.217
1.592GluMet: 1.592 ± 0.448
0.531GluAsn: 0.531 ± 0.333
3.714GluPro: 3.714 ± 0.505
1.061GluGln: 1.061 ± 0.577
4.244GluArg: 4.244 ± 1.318
2.122GluSer: 2.122 ± 0.501
3.714GluThr: 3.714 ± 1.024
5.305GluVal: 5.305 ± 1.533
0.0GluTrp: 0.0 ± 0.0
2.122GluTyr: 2.122 ± 0.559
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
2.122PheCys: 2.122 ± 0.501
2.653PheAsp: 2.653 ± 0.58
1.061PheGlu: 1.061 ± 0.866
1.061PhePhe: 1.061 ± 0.279
4.775PheGly: 4.775 ± 0.755
0.0PheHis: 0.0 ± 0.0
1.061PheIle: 1.061 ± 0.279
2.653PheLys: 2.653 ± 0.593
0.531PheLeu: 0.531 ± 0.333
0.531PheMet: 0.531 ± 0.333
0.531PheAsn: 0.531 ± 0.578
1.592PhePro: 1.592 ± 1.174
1.592PheGln: 1.592 ± 0.69
1.592PheArg: 1.592 ± 0.998
2.122PheSer: 2.122 ± 1.721
1.592PheThr: 1.592 ± 0.69
4.244PheVal: 4.244 ± 0.813
1.061PheTrp: 1.061 ± 0.279
0.531PheTyr: 0.531 ± 0.333
0.0PheXaa: 0.0 ± 0.0
Gly
5.305GlyAla: 5.305 ± 0.692
2.653GlyCys: 2.653 ± 0.509
4.775GlyAsp: 4.775 ± 0.591
0.531GlyGlu: 0.531 ± 0.481
2.122GlyPhe: 2.122 ± 0.659
4.244GlyGly: 4.244 ± 0.516
1.592GlyHis: 1.592 ± 0.733
4.244GlyIle: 4.244 ± 1.228
5.305GlyLys: 5.305 ± 1.249
4.775GlyLeu: 4.775 ± 1.145
1.592GlyMet: 1.592 ± 0.382
2.653GlyAsn: 2.653 ± 0.906
3.714GlyPro: 3.714 ± 1.187
1.592GlyGln: 1.592 ± 0.713
4.775GlyArg: 4.775 ± 0.908
6.897GlySer: 6.897 ± 1.517
2.653GlyThr: 2.653 ± 0.474
6.366GlyVal: 6.366 ± 1.336
0.0GlyTrp: 0.0 ± 0.0
4.775GlyTyr: 4.775 ± 0.591
0.0GlyXaa: 0.0 ± 0.0
His
0.531HisAla: 0.531 ± 0.578
0.531HisCys: 0.531 ± 0.333
0.0HisAsp: 0.0 ± 0.0
0.531HisGlu: 0.531 ± 0.578
0.0HisPhe: 0.0 ± 0.0
1.061HisGly: 1.061 ± 0.665
2.653HisHis: 2.653 ± 0.509
1.592HisIle: 1.592 ± 0.382
1.592HisLys: 1.592 ± 0.382
2.122HisLeu: 2.122 ± 0.659
0.531HisMet: 0.531 ± 0.389
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.061HisGln: 1.061 ± 0.279
1.592HisArg: 1.592 ± 0.61
1.061HisSer: 1.061 ± 0.57
1.061HisThr: 1.061 ± 0.665
1.061HisVal: 1.061 ± 0.279
0.531HisTrp: 0.531 ± 0.641
1.061HisTyr: 1.061 ± 0.577
0.0HisXaa: 0.0 ± 0.0
Ile
5.305IleAla: 5.305 ± 0.19
0.0IleCys: 0.0 ± 0.0
1.061IleAsp: 1.061 ± 0.57
4.244IleGlu: 4.244 ± 0.813
0.531IlePhe: 0.531 ± 0.333
4.244IleGly: 4.244 ± 0.684
1.061IleHis: 1.061 ± 0.279
1.592IleIle: 1.592 ± 0.382
0.531IleLys: 0.531 ± 0.333
2.653IleLeu: 2.653 ± 0.474
0.0IleMet: 0.0 ± 0.0
2.653IleAsn: 2.653 ± 0.58
2.122IlePro: 2.122 ± 0.386
1.592IleGln: 1.592 ± 0.382
3.714IleArg: 3.714 ± 0.827
5.836IleSer: 5.836 ± 0.956
2.122IleThr: 2.122 ± 0.735
4.775IleVal: 4.775 ± 2.232
0.531IleTrp: 0.531 ± 0.578
1.592IleTyr: 1.592 ± 0.448
0.0IleXaa: 0.0 ± 0.0
Lys
4.244LysAla: 4.244 ± 0.772
0.531LysCys: 0.531 ± 0.333
2.653LysAsp: 2.653 ± 0.972
1.061LysGlu: 1.061 ± 0.845
1.592LysPhe: 1.592 ± 0.743
2.653LysGly: 2.653 ± 1.159
1.592LysHis: 1.592 ± 0.69
5.305LysIle: 5.305 ± 1.161
3.714LysLys: 3.714 ± 1.728
3.183LysLeu: 3.183 ± 0.763
0.0LysMet: 0.0 ± 0.484
1.061LysAsn: 1.061 ± 0.279
4.244LysPro: 4.244 ± 0.684
1.592LysGln: 1.592 ± 0.448
2.653LysArg: 2.653 ± 0.972
6.366LysSer: 6.366 ± 1.422
3.183LysThr: 3.183 ± 1.091
2.653LysVal: 2.653 ± 0.699
2.653LysTrp: 2.653 ± 0.906
2.122LysTyr: 2.122 ± 0.386
0.0LysXaa: 0.0 ± 0.0
Leu
5.836LeuAla: 5.836 ± 2.366
1.061LeuCys: 1.061 ± 0.665
5.305LeuAsp: 5.305 ± 0.808
1.592LeuGlu: 1.592 ± 1.217
3.714LeuPhe: 3.714 ± 1.621
7.427LeuGly: 7.427 ± 1.655
2.122LeuHis: 2.122 ± 0.659
4.244LeuIle: 4.244 ± 1.318
2.122LeuLys: 2.122 ± 0.659
12.732LeuLeu: 12.732 ± 2.234
2.653LeuMet: 2.653 ± 0.58
4.244LeuAsn: 4.244 ± 1.228
2.653LeuPro: 2.653 ± 1.159
3.714LeuGln: 3.714 ± 1.445
5.836LeuArg: 5.836 ± 0.412
8.488LeuSer: 8.488 ± 1.781
3.183LeuThr: 3.183 ± 0.763
7.958LeuVal: 7.958 ± 0.974
2.653LeuTrp: 2.653 ± 0.593
2.122LeuTyr: 2.122 ± 0.707
0.0LeuXaa: 0.0 ± 0.0
Met
2.122MetAla: 2.122 ± 0.386
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.061MetGlu: 1.061 ± 0.279
0.531MetPhe: 0.531 ± 0.578
1.592MetGly: 1.592 ± 0.733
1.061MetHis: 1.061 ± 0.279
0.0MetIle: 0.0 ± 0.0
2.122MetLys: 2.122 ± 0.501
1.592MetLeu: 1.592 ± 0.382
1.061MetMet: 1.061 ± 0.279
1.061MetAsn: 1.061 ± 0.57
0.531MetPro: 0.531 ± 0.578
0.0MetGln: 0.0 ± 0.0
0.531MetArg: 0.531 ± 0.333
1.592MetSer: 1.592 ± 0.448
1.061MetThr: 1.061 ± 0.279
0.0MetVal: 0.0 ± 0.0
1.592MetTrp: 1.592 ± 0.382
0.531MetTyr: 0.531 ± 0.578
0.0MetXaa: 0.0 ± 0.0
Asn
0.531AsnAla: 0.531 ± 0.578
1.061AsnCys: 1.061 ± 0.577
0.531AsnAsp: 0.531 ± 0.578
5.305AsnGlu: 5.305 ± 1.161
2.653AsnPhe: 2.653 ± 1.114
2.122AsnGly: 2.122 ± 0.735
1.061AsnHis: 1.061 ± 0.279
1.592AsnIle: 1.592 ± 0.448
4.775AsnLys: 4.775 ± 1.345
4.775AsnLeu: 4.775 ± 1.186
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
1.061AsnPro: 1.061 ± 0.279
3.183AsnGln: 3.183 ± 0.489
2.653AsnArg: 2.653 ± 0.593
6.897AsnSer: 6.897 ± 1.189
0.0AsnThr: 0.0 ± 0.0
1.061AsnVal: 1.061 ± 0.279
0.531AsnTrp: 0.531 ± 0.333
1.061AsnTyr: 1.061 ± 0.57
0.0AsnXaa: 0.0 ± 0.0
Pro
7.427ProAla: 7.427 ± 2.44
1.592ProCys: 1.592 ± 0.448
2.122ProAsp: 2.122 ± 1.22
2.653ProGlu: 2.653 ± 0.906
1.592ProPhe: 1.592 ± 0.382
3.714ProGly: 3.714 ± 0.827
0.531ProHis: 0.531 ± 0.333
0.531ProIle: 0.531 ± 0.481
2.122ProLys: 2.122 ± 0.559
4.244ProLeu: 4.244 ± 1.069
0.531ProMet: 0.531 ± 0.333
4.244ProAsn: 4.244 ± 0.968
2.653ProPro: 2.653 ± 0.474
1.592ProGln: 1.592 ± 0.382
3.714ProArg: 3.714 ± 0.636
4.775ProSer: 4.775 ± 1.118
2.122ProThr: 2.122 ± 0.707
6.366ProVal: 6.366 ± 0.6
0.531ProTrp: 0.531 ± 0.333
2.122ProTyr: 2.122 ± 0.559
0.0ProXaa: 0.0 ± 0.0
Gln
3.714GlnAla: 3.714 ± 1.024
1.061GlnCys: 1.061 ± 0.665
2.653GlnAsp: 2.653 ± 0.972
2.122GlnGlu: 2.122 ± 0.659
0.531GlnPhe: 0.531 ± 0.333
3.183GlnGly: 3.183 ± 1.134
0.0GlnHis: 0.0 ± 0.0
1.592GlnIle: 1.592 ± 1.174
1.061GlnLys: 1.061 ± 0.963
2.653GlnLeu: 2.653 ± 0.699
0.531GlnMet: 0.531 ± 0.595
0.531GlnAsn: 0.531 ± 0.333
2.653GlnPro: 2.653 ± 1.114
1.592GlnGln: 1.592 ± 0.382
2.653GlnArg: 2.653 ± 0.58
2.653GlnSer: 2.653 ± 1.917
2.653GlnThr: 2.653 ± 1.73
3.714GlnVal: 3.714 ± 0.349
0.531GlnTrp: 0.531 ± 0.333
1.061GlnTyr: 1.061 ± 1.156
0.0GlnXaa: 0.0 ± 0.0
Arg
3.714ArgAla: 3.714 ± 0.636
1.592ArgCys: 1.592 ± 0.998
0.531ArgAsp: 0.531 ± 0.333
2.122ArgGlu: 2.122 ± 0.659
3.714ArgPhe: 3.714 ± 0.619
4.775ArgGly: 4.775 ± 0.376
1.061ArgHis: 1.061 ± 0.866
1.592ArgIle: 1.592 ± 0.382
3.714ArgLys: 3.714 ± 1.066
7.958ArgLeu: 7.958 ± 1.041
0.531ArgMet: 0.531 ± 0.333
3.714ArgAsn: 3.714 ± 1.149
3.183ArgPro: 3.183 ± 0.36
1.061ArgGln: 1.061 ± 0.279
1.592ArgArg: 1.592 ± 0.61
6.897ArgSer: 6.897 ± 0.303
2.653ArgThr: 2.653 ± 0.58
1.592ArgVal: 1.592 ± 0.382
0.0ArgTrp: 0.0 ± 0.0
3.183ArgTyr: 3.183 ± 1.294
0.0ArgXaa: 0.0 ± 0.0
Ser
3.714SerAla: 3.714 ± 1.066
2.653SerCys: 2.653 ± 1.086
2.122SerAsp: 2.122 ± 0.559
3.183SerGlu: 3.183 ± 0.57
4.244SerPhe: 4.244 ± 1.282
7.958SerGly: 7.958 ± 2.718
0.531SerHis: 0.531 ± 0.333
1.061SerIle: 1.061 ± 0.577
6.897SerLys: 6.897 ± 1.636
9.549SerLeu: 9.549 ± 2.012
2.122SerMet: 2.122 ± 0.559
5.836SerAsn: 5.836 ± 1.11
8.488SerPro: 8.488 ± 1.915
5.836SerGln: 5.836 ± 1.588
4.244SerArg: 4.244 ± 0.516
11.141SerSer: 11.141 ± 2.124
6.897SerThr: 6.897 ± 1.051
5.305SerVal: 5.305 ± 1.595
2.122SerTrp: 2.122 ± 0.985
1.592SerTyr: 1.592 ± 0.382
0.0SerXaa: 0.0 ± 0.0
Thr
7.958ThrAla: 7.958 ± 1.041
1.061ThrCys: 1.061 ± 0.665
1.061ThrAsp: 1.061 ± 0.57
3.714ThrGlu: 3.714 ± 1.103
0.531ThrPhe: 0.531 ± 0.333
2.122ThrGly: 2.122 ± 0.559
0.531ThrHis: 0.531 ± 0.333
4.244ThrIle: 4.244 ± 1.552
0.531ThrLys: 0.531 ± 0.578
3.714ThrLeu: 3.714 ± 1.103
1.592ThrMet: 1.592 ± 0.448
2.122ThrAsn: 2.122 ± 0.559
3.714ThrPro: 3.714 ± 1.676
0.531ThrGln: 0.531 ± 0.481
1.592ThrArg: 1.592 ± 0.382
3.714ThrSer: 3.714 ± 0.349
5.836ThrThr: 5.836 ± 1.047
4.775ThrVal: 4.775 ± 0.718
0.0ThrTrp: 0.0 ± 0.0
2.122ThrTyr: 2.122 ± 0.707
0.0ThrXaa: 0.0 ± 0.0
Val
3.183ValAla: 3.183 ± 1.336
1.592ValCys: 1.592 ± 0.69
3.183ValAsp: 3.183 ± 0.637
6.897ValGlu: 6.897 ± 1.701
2.653ValPhe: 2.653 ± 1.863
4.775ValGly: 4.775 ± 0.591
2.122ValHis: 2.122 ± 0.501
2.122ValIle: 2.122 ± 0.559
4.775ValLys: 4.775 ± 1.077
5.836ValLeu: 5.836 ± 1.11
1.592ValMet: 1.592 ± 1.099
4.775ValAsn: 4.775 ± 1.186
5.305ValPro: 5.305 ± 0.947
4.244ValGln: 4.244 ± 0.93
3.714ValArg: 3.714 ± 2.372
4.244ValSer: 4.244 ± 1.497
4.775ValThr: 4.775 ± 0.376
6.366ValVal: 6.366 ± 1.03
2.653ValTrp: 2.653 ± 0.474
1.592ValTyr: 1.592 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
1.592TrpAla: 1.592 ± 0.733
0.531TrpCys: 0.531 ± 0.333
0.531TrpAsp: 0.531 ± 0.333
1.061TrpGlu: 1.061 ± 0.665
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.531TrpHis: 0.531 ± 0.578
1.061TrpIle: 1.061 ± 0.279
0.531TrpLys: 0.531 ± 0.578
1.592TrpLeu: 1.592 ± 0.448
0.531TrpMet: 0.531 ± 0.302
1.592TrpAsn: 1.592 ± 0.61
1.592TrpPro: 1.592 ± 0.69
0.0TrpGln: 0.0 ± 0.0
1.061TrpArg: 1.061 ± 0.279
4.775TrpSer: 4.775 ± 0.591
1.061TrpThr: 1.061 ± 0.577
1.061TrpVal: 1.061 ± 0.279
2.122TrpTrp: 2.122 ± 0.559
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.531TyrAla: 0.531 ± 0.333
1.592TyrCys: 1.592 ± 0.733
5.836TyrAsp: 5.836 ± 0.773
3.183TyrGlu: 3.183 ± 1.044
0.0TyrPhe: 0.0 ± 0.0
2.653TyrGly: 2.653 ± 0.906
0.0TyrHis: 0.0 ± 0.0
2.653TyrIle: 2.653 ± 1.273
0.0TyrLys: 0.0 ± 0.0
5.836TyrLeu: 5.836 ± 0.774
0.0TyrMet: 0.0 ± 0.0
2.122TyrAsn: 2.122 ± 0.559
1.061TyrPro: 1.061 ± 0.665
0.531TyrGln: 0.531 ± 0.333
2.122TyrArg: 2.122 ± 0.386
1.592TyrSer: 1.592 ± 0.382
0.531TyrThr: 0.531 ± 0.578
2.653TyrVal: 2.653 ± 0.509
0.0TyrTrp: 0.0 ± 0.0
1.061TyrTyr: 1.061 ± 0.279
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1886 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski