Amino acid dipepetide frequency for Southern bean mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.198AlaAla: 7.198 ± 1.052
1.028AlaCys: 1.028 ± 0.65
3.085AlaAsp: 3.085 ± 0.749
4.113AlaGlu: 4.113 ± 2.005
1.542AlaPhe: 1.542 ± 0.815
4.627AlaGly: 4.627 ± 0.804
0.514AlaHis: 0.514 ± 0.904
5.141AlaIle: 5.141 ± 1.352
4.627AlaLys: 4.627 ± 1.403
4.627AlaLeu: 4.627 ± 0.836
1.542AlaMet: 1.542 ± 1.327
2.571AlaAsn: 2.571 ± 1.917
4.113AlaPro: 4.113 ± 0.981
3.599AlaGln: 3.599 ± 0.55
4.113AlaArg: 4.113 ± 0.981
8.74AlaSer: 8.74 ± 2.218
5.141AlaThr: 5.141 ± 1.693
4.627AlaVal: 4.627 ± 1.727
3.085AlaTrp: 3.085 ± 0.698
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.057CysAla: 2.057 ± 0.679
0.514CysCys: 0.514 ± 0.325
1.542CysAsp: 1.542 ± 0.514
1.028CysGlu: 1.028 ± 1.807
0.514CysPhe: 0.514 ± 0.325
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.028CysIle: 1.028 ± 0.853
2.057CysLys: 2.057 ± 1.299
2.057CysLeu: 2.057 ± 0.77
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
3.599CysPro: 3.599 ± 0.781
1.028CysGln: 1.028 ± 0.382
2.571CysArg: 2.571 ± 1.514
2.571CysSer: 2.571 ± 0.501
0.514CysThr: 0.514 ± 0.325
1.028CysVal: 1.028 ± 0.382
0.514CysTrp: 0.514 ± 0.325
2.571CysTyr: 2.571 ± 0.539
0.0CysXaa: 0.0 ± 0.0
Asp
3.085AspAla: 3.085 ± 0.698
1.028AspCys: 1.028 ± 0.382
7.198AspAsp: 7.198 ± 2.415
1.542AspGlu: 1.542 ± 0.514
4.113AspPhe: 4.113 ± 1.401
3.599AspGly: 3.599 ± 0.55
1.028AspHis: 1.028 ± 0.382
2.571AspIle: 2.571 ± 0.846
2.571AspLys: 2.571 ± 1.065
4.113AspLeu: 4.113 ± 1.541
0.514AspMet: 0.514 ± 0.727
1.028AspAsn: 1.028 ± 0.382
1.542AspPro: 1.542 ± 0.514
1.028AspGln: 1.028 ± 0.853
0.514AspArg: 0.514 ± 0.325
2.571AspSer: 2.571 ± 0.846
2.571AspThr: 2.571 ± 1.048
2.571AspVal: 2.571 ± 0.804
2.057AspTrp: 2.057 ± 0.77
0.514AspTyr: 0.514 ± 0.325
0.0AspXaa: 0.0 ± 0.0
Glu
4.113GluAla: 4.113 ± 1.347
0.514GluCys: 0.514 ± 0.489
4.627GluAsp: 4.627 ± 1.542
3.085GluGlu: 3.085 ± 1.028
2.571GluPhe: 2.571 ± 0.846
3.085GluGly: 3.085 ± 1.949
1.028GluHis: 1.028 ± 0.65
4.113GluIle: 4.113 ± 1.359
4.113GluLys: 4.113 ± 1.817
6.684GluLeu: 6.684 ± 1.441
0.514GluMet: 0.514 ± 0.325
1.028GluAsn: 1.028 ± 0.853
4.113GluPro: 4.113 ± 0.779
0.514GluGln: 0.514 ± 0.325
3.599GluArg: 3.599 ± 1.99
3.599GluSer: 3.599 ± 2.101
5.141GluThr: 5.141 ± 2.098
3.599GluVal: 3.599 ± 1.269
0.514GluTrp: 0.514 ± 0.325
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.057PheAla: 2.057 ± 0.765
1.542PheCys: 1.542 ± 0.922
2.571PheAsp: 2.571 ± 1.065
2.571PheGlu: 2.571 ± 0.846
0.514PhePhe: 0.514 ± 0.325
2.057PheGly: 2.057 ± 0.77
1.028PheHis: 1.028 ± 1.807
1.028PheIle: 1.028 ± 0.382
1.028PheLys: 1.028 ± 0.636
3.085PheLeu: 3.085 ± 1.467
0.514PheMet: 0.514 ± 0.325
1.028PheAsn: 1.028 ± 0.853
0.514PhePro: 0.514 ± 0.325
1.028PheGln: 1.028 ± 0.636
1.542PheArg: 1.542 ± 0.922
3.085PheSer: 3.085 ± 0.742
3.085PheThr: 3.085 ± 1.533
5.656PheVal: 5.656 ± 1.404
0.0PheTrp: 0.0 ± 0.0
2.571PheTyr: 2.571 ± 0.742
0.0PheXaa: 0.0 ± 0.0
Gly
3.085GlyAla: 3.085 ± 0.983
1.028GlyCys: 1.028 ± 0.382
3.599GlyAsp: 3.599 ± 1.208
3.085GlyGlu: 3.085 ± 2.56
4.627GlyPhe: 4.627 ± 1.298
6.684GlyGly: 6.684 ± 0.679
1.542GlyHis: 1.542 ± 0.922
1.542GlyIle: 1.542 ± 0.514
7.712GlyLys: 7.712 ± 1.853
2.057GlyLeu: 2.057 ± 0.39
2.057GlyMet: 2.057 ± 0.77
1.028GlyAsn: 1.028 ± 0.853
2.571GlyPro: 2.571 ± 0.846
1.542GlyGln: 1.542 ± 0.701
4.113GlyArg: 4.113 ± 0.791
10.283GlySer: 10.283 ± 1.557
4.113GlyThr: 4.113 ± 1.045
7.198GlyVal: 7.198 ± 1.312
2.057GlyTrp: 2.057 ± 0.77
3.085GlyTyr: 3.085 ± 0.749
0.0GlyXaa: 0.0 ± 0.0
His
0.514HisAla: 0.514 ± 0.325
0.514HisCys: 0.514 ± 0.904
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.514HisPhe: 0.514 ± 0.904
1.542HisGly: 1.542 ± 0.514
1.028HisHis: 1.028 ± 1.807
0.514HisIle: 0.514 ± 0.325
1.028HisLys: 1.028 ± 0.382
0.514HisLeu: 0.514 ± 0.325
0.514HisMet: 0.514 ± 0.727
0.514HisAsn: 0.514 ± 0.325
1.542HisPro: 1.542 ± 0.733
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
4.627HisSer: 4.627 ± 1.593
0.514HisThr: 0.514 ± 0.325
3.599HisVal: 3.599 ± 1.208
0.514HisTrp: 0.514 ± 0.325
0.514HisTyr: 0.514 ± 0.325
0.0HisXaa: 0.0 ± 0.0
Ile
3.599IleAla: 3.599 ± 2.58
2.057IleCys: 2.057 ± 1.601
2.057IleAsp: 2.057 ± 1.601
5.656IleGlu: 5.656 ± 1.133
0.0IlePhe: 0.0 ± 0.0
2.571IleGly: 2.571 ± 0.846
1.028IleHis: 1.028 ± 0.636
1.028IleIle: 1.028 ± 1.807
2.057IleLys: 2.057 ± 0.765
3.085IleLeu: 3.085 ± 1.776
0.514IleMet: 0.514 ± 0.325
2.057IleAsn: 2.057 ± 0.39
3.599IlePro: 3.599 ± 1.208
1.028IleGln: 1.028 ± 1.282
2.571IleArg: 2.571 ± 1.065
2.571IleSer: 2.571 ± 1.065
1.542IleThr: 1.542 ± 0.514
3.085IleVal: 3.085 ± 1.374
0.0IleTrp: 0.0 ± 0.0
1.028IleTyr: 1.028 ± 0.382
0.0IleXaa: 0.0 ± 0.0
Lys
6.684LysAla: 6.684 ± 1.073
0.0LysCys: 0.0 ± 0.0
3.085LysAsp: 3.085 ± 1.373
2.057LysGlu: 2.057 ± 0.765
1.028LysPhe: 1.028 ± 0.65
1.542LysGly: 1.542 ± 0.733
1.542LysHis: 1.542 ± 0.514
2.571LysIle: 2.571 ± 1.176
2.057LysLys: 2.057 ± 0.775
5.141LysLeu: 5.141 ± 1.071
2.057LysMet: 2.057 ± 0.765
0.0LysAsn: 0.0 ± 0.0
3.085LysPro: 3.085 ± 1.147
3.599LysGln: 3.599 ± 1.415
2.057LysArg: 2.057 ± 0.39
5.141LysSer: 5.141 ± 1.19
3.085LysThr: 3.085 ± 0.437
3.599LysVal: 3.599 ± 1.208
1.542LysTrp: 1.542 ± 1.151
3.085LysTyr: 3.085 ± 0.742
0.0LysXaa: 0.0 ± 0.0
Leu
6.17LeuAla: 6.17 ± 1.169
2.057LeuCys: 2.057 ± 0.888
2.571LeuAsp: 2.571 ± 1.263
5.656LeuGlu: 5.656 ± 0.487
4.113LeuPhe: 4.113 ± 0.996
8.226LeuGly: 8.226 ± 1.268
1.028LeuHis: 1.028 ± 0.65
3.599LeuIle: 3.599 ± 0.55
2.057LeuLys: 2.057 ± 0.765
8.74LeuLeu: 8.74 ± 2.894
0.514LeuMet: 0.514 ± 0.325
3.085LeuAsn: 3.085 ± 0.953
6.17LeuPro: 6.17 ± 0.612
2.571LeuGln: 2.571 ± 1.176
6.17LeuArg: 6.17 ± 2.874
10.797LeuSer: 10.797 ± 1.274
3.085LeuThr: 3.085 ± 1.374
7.712LeuVal: 7.712 ± 0.785
1.542LeuTrp: 1.542 ± 0.514
3.599LeuTyr: 3.599 ± 0.55
0.0LeuXaa: 0.0 ± 0.0
Met
2.057MetAla: 2.057 ± 2.044
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
3.599MetGly: 3.599 ± 1.023
1.028MetHis: 1.028 ± 0.382
0.0MetIle: 0.0 ± 0.0
1.028MetLys: 1.028 ± 0.382
2.057MetLeu: 2.057 ± 0.39
1.028MetMet: 1.028 ± 0.382
0.514MetAsn: 0.514 ± 0.325
1.028MetPro: 1.028 ± 0.636
1.028MetGln: 1.028 ± 0.382
2.057MetArg: 2.057 ± 0.77
0.514MetSer: 0.514 ± 0.904
1.028MetThr: 1.028 ± 0.382
1.542MetVal: 1.542 ± 0.514
0.0MetTrp: 0.0 ± 0.0
1.028MetTyr: 1.028 ± 0.382
0.0MetXaa: 0.0 ± 0.0
Asn
2.571AsnAla: 2.571 ± 1.557
0.514AsnCys: 0.514 ± 0.489
0.0AsnAsp: 0.0 ± 0.0
1.542AsnGlu: 1.542 ± 0.514
1.028AsnPhe: 1.028 ± 0.853
2.057AsnGly: 2.057 ± 0.976
1.028AsnHis: 1.028 ± 0.382
0.514AsnIle: 0.514 ± 0.727
1.028AsnLys: 1.028 ± 0.65
3.085AsnLeu: 3.085 ± 0.742
0.514AsnMet: 0.514 ± 0.654
1.542AsnAsn: 1.542 ± 0.514
2.057AsnPro: 2.057 ± 0.765
1.028AsnGln: 1.028 ± 0.636
2.571AsnArg: 2.571 ± 0.539
2.571AsnSer: 2.571 ± 1.065
1.028AsnThr: 1.028 ± 0.636
1.542AsnVal: 1.542 ± 0.733
0.514AsnTrp: 0.514 ± 0.727
1.542AsnTyr: 1.542 ± 0.815
0.0AsnXaa: 0.0 ± 0.0
Pro
4.113ProAla: 4.113 ± 0.779
1.028ProCys: 1.028 ± 0.382
2.057ProAsp: 2.057 ± 0.77
4.113ProGlu: 4.113 ± 1.541
4.113ProPhe: 4.113 ± 0.981
6.684ProGly: 6.684 ± 2.189
1.028ProHis: 1.028 ± 0.65
2.571ProIle: 2.571 ± 0.539
3.085ProLys: 3.085 ± 1.147
5.141ProLeu: 5.141 ± 1.549
0.0ProMet: 0.0 ± 0.0
0.514ProAsn: 0.514 ± 0.489
7.198ProPro: 7.198 ± 4.202
3.085ProGln: 3.085 ± 0.744
2.057ProArg: 2.057 ± 1.103
8.74ProSer: 8.74 ± 1.034
3.599ProThr: 3.599 ± 0.781
4.113ProVal: 4.113 ± 1.614
0.514ProTrp: 0.514 ± 0.325
1.542ProTyr: 1.542 ± 0.514
0.0ProXaa: 0.0 ± 0.0
Gln
2.571GlnAla: 2.571 ± 0.804
0.514GlnCys: 0.514 ± 0.904
0.514GlnAsp: 0.514 ± 0.325
4.113GlnGlu: 4.113 ± 1.529
1.028GlnPhe: 1.028 ± 0.382
2.571GlnGly: 2.571 ± 1.029
0.0GlnHis: 0.0 ± 0.0
0.514GlnIle: 0.514 ± 0.904
0.514GlnLys: 0.514 ± 0.325
5.141GlnLeu: 5.141 ± 1.709
1.028GlnMet: 1.028 ± 0.382
1.542GlnAsn: 1.542 ± 0.922
2.057GlnPro: 2.057 ± 0.775
1.028GlnGln: 1.028 ± 0.636
1.028GlnArg: 1.028 ± 0.382
4.627GlnSer: 4.627 ± 1.296
1.542GlnThr: 1.542 ± 0.815
2.571GlnVal: 2.571 ± 0.539
0.0GlnTrp: 0.0 ± 0.0
0.514GlnTyr: 0.514 ± 0.727
0.0GlnXaa: 0.0 ± 0.0
Arg
1.028ArgAla: 1.028 ± 0.382
2.057ArgCys: 2.057 ± 0.888
0.0ArgAsp: 0.0 ± 0.0
3.599ArgGlu: 3.599 ± 0.816
2.571ArgPhe: 2.571 ± 1.747
5.656ArgGly: 5.656 ± 0.803
1.028ArgHis: 1.028 ± 0.65
2.057ArgIle: 2.057 ± 1.088
2.571ArgLys: 2.571 ± 0.742
7.712ArgLeu: 7.712 ± 2.043
1.542ArgMet: 1.542 ± 0.514
2.571ArgAsn: 2.571 ± 0.539
2.057ArgPro: 2.057 ± 0.775
1.542ArgGln: 1.542 ± 1.151
4.113ArgArg: 4.113 ± 3.558
3.599ArgSer: 3.599 ± 0.781
2.571ArgThr: 2.571 ± 0.539
5.141ArgVal: 5.141 ± 2.159
0.0ArgTrp: 0.0 ± 0.0
2.571ArgTyr: 2.571 ± 0.804
0.0ArgXaa: 0.0 ± 0.0
Ser
6.684SerAla: 6.684 ± 1.823
4.113SerCys: 4.113 ± 0.723
4.113SerAsp: 4.113 ± 0.421
3.599SerGlu: 3.599 ± 1.25
4.113SerPhe: 4.113 ± 1.529
8.74SerGly: 8.74 ± 1.583
1.542SerHis: 1.542 ± 0.514
4.627SerIle: 4.627 ± 1.867
5.656SerLys: 5.656 ± 0.605
12.339SerLeu: 12.339 ± 2.289
2.057SerMet: 2.057 ± 0.826
2.571SerAsn: 2.571 ± 0.539
6.684SerPro: 6.684 ± 2.189
2.571SerGln: 2.571 ± 0.846
5.141SerArg: 5.141 ± 1.071
9.254SerSer: 9.254 ± 1.25
5.656SerThr: 5.656 ± 2.894
5.656SerVal: 5.656 ± 1.232
3.085SerTrp: 3.085 ± 1.028
3.085SerTyr: 3.085 ± 0.742
0.0SerXaa: 0.0 ± 0.0
Thr
7.198ThrAla: 7.198 ± 2.519
1.542ThrCys: 1.542 ± 0.514
1.542ThrAsp: 1.542 ± 0.514
2.057ThrGlu: 2.057 ± 1.969
0.0ThrPhe: 0.0 ± 0.0
2.057ThrGly: 2.057 ± 0.976
1.542ThrHis: 1.542 ± 0.514
1.542ThrIle: 1.542 ± 0.733
2.571ThrLys: 2.571 ± 1.19
6.17ThrLeu: 6.17 ± 1.239
1.028ThrMet: 1.028 ± 0.679
1.542ThrAsn: 1.542 ± 0.701
5.656ThrPro: 5.656 ± 1.458
2.057ThrGln: 2.057 ± 0.775
5.141ThrArg: 5.141 ± 0.311
5.656ThrSer: 5.656 ± 1.596
2.571ThrThr: 2.571 ± 1.917
3.085ThrVal: 3.085 ± 1.374
0.514ThrTrp: 0.514 ± 0.727
2.057ThrTyr: 2.057 ± 0.976
0.0ThrXaa: 0.0 ± 0.0
Val
6.684ValAla: 6.684 ± 1.168
3.085ValCys: 3.085 ± 0.698
4.627ValAsp: 4.627 ± 1.418
5.656ValGlu: 5.656 ± 1.26
1.542ValPhe: 1.542 ± 0.922
5.141ValGly: 5.141 ± 2.197
0.0ValHis: 0.0 ± 0.0
3.599ValIle: 3.599 ± 0.852
5.141ValLys: 5.141 ± 1.156
2.571ValLeu: 2.571 ± 0.501
1.542ValMet: 1.542 ± 0.514
3.085ValAsn: 3.085 ± 1.027
4.113ValPro: 4.113 ± 1.401
3.599ValGln: 3.599 ± 0.51
3.085ValArg: 3.085 ± 1.374
5.141ValSer: 5.141 ± 2.946
5.141ValThr: 5.141 ± 1.079
6.17ValVal: 6.17 ± 1.104
3.599ValTrp: 3.599 ± 0.781
2.057ValTyr: 2.057 ± 0.77
0.0ValXaa: 0.0 ± 0.0
Trp
1.542TrpAla: 1.542 ± 0.514
0.514TrpCys: 0.514 ± 0.325
1.028TrpAsp: 1.028 ± 0.382
1.028TrpGlu: 1.028 ± 0.65
0.514TrpPhe: 0.514 ± 0.325
0.514TrpGly: 0.514 ± 0.325
0.0TrpHis: 0.0 ± 0.0
2.057TrpIle: 2.057 ± 0.765
1.028TrpLys: 1.028 ± 0.382
2.057TrpLeu: 2.057 ± 0.39
1.028TrpMet: 1.028 ± 0.382
1.542TrpAsn: 1.542 ± 0.733
2.057TrpPro: 2.057 ± 1.088
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
4.113TrpSer: 4.113 ± 0.779
1.028TrpThr: 1.028 ± 0.382
0.514TrpVal: 0.514 ± 0.325
0.0TrpTrp: 0.0 ± 0.0
0.514TrpTyr: 0.514 ± 0.727
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.028TyrAla: 1.028 ± 0.382
1.542TyrCys: 1.542 ± 0.974
1.542TyrAsp: 1.542 ± 0.514
2.057TyrGlu: 2.057 ± 0.77
2.057TyrPhe: 2.057 ± 1.601
2.057TyrGly: 2.057 ± 0.77
1.542TyrHis: 1.542 ± 0.733
0.514TyrIle: 0.514 ± 0.727
1.028TyrLys: 1.028 ± 0.65
3.085TyrLeu: 3.085 ± 0.437
0.514TyrMet: 0.514 ± 0.325
0.514TyrAsn: 0.514 ± 0.325
1.542TyrPro: 1.542 ± 0.701
1.542TyrGln: 1.542 ± 0.815
1.542TyrArg: 1.542 ± 0.733
3.599TyrSer: 3.599 ± 0.781
2.571TyrThr: 2.571 ± 1.557
2.571TyrVal: 2.571 ± 1.048
1.028TyrTrp: 1.028 ± 0.382
1.028TyrTyr: 1.028 ± 0.382
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1946 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski