Amino acid dipepetide frequency for Southern bean mosaic virus (isolate Bean/United States/Arkansas) (SBMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.692AlaAla: 7.692 ± 1.069
1.026AlaCys: 1.026 ± 0.641
2.564AlaAsp: 2.564 ± 0.513
4.103AlaGlu: 4.103 ± 1.926
1.538AlaPhe: 1.538 ± 0.654
4.103AlaGly: 4.103 ± 0.759
0.513AlaHis: 0.513 ± 0.747
4.615AlaIle: 4.615 ± 1.439
4.103AlaLys: 4.103 ± 0.735
4.615AlaLeu: 4.615 ± 0.844
1.026AlaMet: 1.026 ± 0.629
3.59AlaAsn: 3.59 ± 1.591
4.103AlaPro: 4.103 ± 0.901
4.103AlaGln: 4.103 ± 0.735
2.564AlaArg: 2.564 ± 1.183
9.231AlaSer: 9.231 ± 2.145
5.641AlaThr: 5.641 ± 0.901
3.59AlaVal: 3.59 ± 0.803
3.077AlaTrp: 3.077 ± 0.666
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.051CysAla: 2.051 ± 0.593
0.513CysCys: 0.513 ± 0.32
2.564CysAsp: 2.564 ± 1.271
1.026CysGlu: 1.026 ± 0.707
0.513CysPhe: 0.513 ± 0.32
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.026CysIle: 1.026 ± 0.707
1.538CysLys: 1.538 ± 0.961
2.564CysLeu: 2.564 ± 0.513
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
3.59CysPro: 3.59 ± 0.694
1.026CysGln: 1.026 ± 0.346
2.564CysArg: 2.564 ± 1.271
2.564CysSer: 2.564 ± 0.693
0.513CysThr: 0.513 ± 0.32
1.026CysVal: 1.026 ± 0.346
0.513CysTrp: 0.513 ± 0.32
3.077CysTyr: 3.077 ± 0.67
0.0CysXaa: 0.0 ± 0.0
Asp
3.077AspAla: 3.077 ± 0.666
1.026AspCys: 1.026 ± 0.346
7.179AspAsp: 7.179 ± 2.125
2.051AspGlu: 2.051 ± 0.704
5.128AspPhe: 5.128 ± 1.805
3.59AspGly: 3.59 ± 0.484
1.026AspHis: 1.026 ± 0.346
2.564AspIle: 2.564 ± 0.739
2.564AspLys: 2.564 ± 0.693
3.59AspLeu: 3.59 ± 1.164
0.513AspMet: 0.513 ± 0.681
1.026AspAsn: 1.026 ± 0.346
1.538AspPro: 1.538 ± 0.453
1.026AspGln: 1.026 ± 0.707
0.513AspArg: 0.513 ± 0.32
1.538AspSer: 1.538 ± 0.453
1.538AspThr: 1.538 ± 1.271
3.077AspVal: 3.077 ± 0.636
2.051AspTrp: 2.051 ± 0.704
1.026AspTyr: 1.026 ± 0.641
0.0AspXaa: 0.0 ± 0.0
Glu
4.615GluAla: 4.615 ± 1.671
0.513GluCys: 0.513 ± 0.49
4.615GluAsp: 4.615 ± 1.358
3.59GluGlu: 3.59 ± 0.745
2.564GluPhe: 2.564 ± 0.739
2.564GluGly: 2.564 ± 1.602
1.026GluHis: 1.026 ± 0.641
3.59GluIle: 3.59 ± 0.745
3.59GluLys: 3.59 ± 1.533
7.179GluLeu: 7.179 ± 1.037
0.513GluMet: 0.513 ± 0.32
0.513GluAsn: 0.513 ± 0.747
4.103GluPro: 4.103 ± 0.759
0.513GluGln: 0.513 ± 0.32
2.564GluArg: 2.564 ± 1.602
4.615GluSer: 4.615 ± 2.357
6.154GluThr: 6.154 ± 2.618
4.615GluVal: 4.615 ± 1.18
0.513GluTrp: 0.513 ± 0.32
0.513GluTyr: 0.513 ± 0.747
0.0GluXaa: 0.0 ± 0.0
Phe
3.077PheAla: 3.077 ± 1.039
1.538PheCys: 1.538 ± 0.804
3.59PheAsp: 3.59 ± 1.139
3.077PheGlu: 3.077 ± 0.666
0.513PhePhe: 0.513 ± 0.32
2.564PheGly: 2.564 ± 0.693
0.513PheHis: 0.513 ± 0.747
1.026PheIle: 1.026 ± 0.346
1.538PheLys: 1.538 ± 0.73
2.564PheLeu: 2.564 ± 0.733
0.513PheMet: 0.513 ± 0.32
1.026PheAsn: 1.026 ± 0.707
0.513PhePro: 0.513 ± 0.32
1.026PheGln: 1.026 ± 0.629
1.538PheArg: 1.538 ± 0.804
2.564PheSer: 2.564 ± 0.693
2.564PheThr: 2.564 ± 0.959
4.103PheVal: 4.103 ± 1.186
0.0PheTrp: 0.0 ± 0.0
3.077PheTyr: 3.077 ± 1.309
0.0PheXaa: 0.0 ± 0.0
Gly
2.051GlyAla: 2.051 ± 0.704
1.026GlyCys: 1.026 ± 0.346
4.615GlyAsp: 4.615 ± 0.932
2.564GlyGlu: 2.564 ± 1.48
4.615GlyPhe: 4.615 ± 1.064
5.641GlyGly: 5.641 ± 0.946
2.051GlyHis: 2.051 ± 0.999
2.051GlyIle: 2.051 ± 0.692
7.692GlyLys: 7.692 ± 1.585
1.538GlyLeu: 1.538 ± 0.73
2.051GlyMet: 2.051 ± 0.704
0.513GlyAsn: 0.513 ± 0.32
3.077GlyPro: 3.077 ± 1.039
1.538GlyGln: 1.538 ± 1.271
4.103GlyArg: 4.103 ± 0.735
9.744GlySer: 9.744 ± 1.326
4.615GlyThr: 4.615 ± 1.565
8.205GlyVal: 8.205 ± 1.487
2.051GlyTrp: 2.051 ± 0.704
2.564GlyTyr: 2.564 ± 0.513
0.0GlyXaa: 0.0 ± 0.0
His
0.513HisAla: 0.513 ± 0.32
0.513HisCys: 0.513 ± 0.747
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.513HisPhe: 0.513 ± 0.747
1.538HisGly: 1.538 ± 0.453
1.026HisHis: 1.026 ± 1.495
0.513HisIle: 0.513 ± 0.32
2.051HisLys: 2.051 ± 0.704
0.513HisLeu: 0.513 ± 0.32
0.513HisMet: 0.513 ± 0.681
0.0HisAsn: 0.0 ± 0.0
1.026HisPro: 1.026 ± 0.346
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
4.103HisSer: 4.103 ± 0.967
1.538HisThr: 1.538 ± 0.453
3.59HisVal: 3.59 ± 1.062
0.513HisTrp: 0.513 ± 0.32
0.513HisTyr: 0.513 ± 0.32
0.0HisXaa: 0.0 ± 0.0
Ile
3.077IleAla: 3.077 ± 2.542
1.538IleCys: 1.538 ± 0.654
1.538IleAsp: 1.538 ± 0.654
5.641IleGlu: 5.641 ± 0.946
0.513IlePhe: 0.513 ± 0.747
3.077IleGly: 3.077 ± 0.666
1.538IleHis: 1.538 ± 0.48
1.026IleIle: 1.026 ± 1.085
2.051IleLys: 2.051 ± 0.692
2.564IleLeu: 2.564 ± 1.314
1.026IleMet: 1.026 ± 0.333
2.051IleAsn: 2.051 ± 0.379
4.615IlePro: 4.615 ± 1.396
1.026IleGln: 1.026 ± 1.085
1.538IleArg: 1.538 ± 0.961
2.564IleSer: 2.564 ± 0.995
2.051IleThr: 2.051 ± 1.126
2.564IleVal: 2.564 ± 0.668
0.0IleTrp: 0.0 ± 0.0
1.026IleTyr: 1.026 ± 0.346
0.0IleXaa: 0.0 ± 0.0
Lys
5.128LysAla: 5.128 ± 1.331
0.0LysCys: 0.0 ± 0.0
3.077LysAsp: 3.077 ± 1.301
1.538LysGlu: 1.538 ± 0.786
1.026LysPhe: 1.026 ± 0.641
1.026LysGly: 1.026 ± 0.346
1.538LysHis: 1.538 ± 0.453
2.564LysIle: 2.564 ± 1.111
2.051LysLys: 2.051 ± 0.738
4.615LysLeu: 4.615 ± 1.396
2.051LysMet: 2.051 ± 0.692
0.0LysAsn: 0.0 ± 0.0
3.077LysPro: 3.077 ± 1.039
3.59LysGln: 3.59 ± 1.364
3.077LysArg: 3.077 ± 0.364
4.615LysSer: 4.615 ± 1.215
3.077LysThr: 3.077 ± 0.364
3.59LysVal: 3.59 ± 1.062
1.538LysTrp: 1.538 ± 0.974
4.103LysTyr: 4.103 ± 0.353
0.0LysXaa: 0.0 ± 0.0
Leu
5.641LeuAla: 5.641 ± 1.108
2.051LeuCys: 2.051 ± 0.935
4.103LeuAsp: 4.103 ± 1.748
5.128LeuGlu: 5.128 ± 0.291
5.128LeuPhe: 5.128 ± 1.456
7.692LeuGly: 7.692 ± 1.965
0.513LeuHis: 0.513 ± 0.32
5.128LeuIle: 5.128 ± 0.658
2.051LeuLys: 2.051 ± 0.692
10.256LeuLeu: 10.256 ± 1.625
0.513LeuMet: 0.513 ± 0.32
3.077LeuAsn: 3.077 ± 0.899
6.154LeuPro: 6.154 ± 0.583
3.077LeuGln: 3.077 ± 1.097
6.667LeuArg: 6.667 ± 1.967
11.282LeuSer: 11.282 ± 1.201
3.59LeuThr: 3.59 ± 1.849
5.641LeuVal: 5.641 ± 0.742
1.538LeuTrp: 1.538 ± 0.453
2.051LeuTyr: 2.051 ± 0.379
0.0LeuXaa: 0.0 ± 0.0
Met
2.051MetAla: 2.051 ± 1.94
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.051MetGly: 2.051 ± 1.258
1.026MetHis: 1.026 ± 0.346
0.0MetIle: 0.0 ± 0.0
1.026MetLys: 1.026 ± 0.346
2.051MetLeu: 2.051 ± 0.379
1.026MetMet: 1.026 ± 0.346
0.513MetAsn: 0.513 ± 0.32
1.026MetPro: 1.026 ± 0.629
1.026MetGln: 1.026 ± 0.346
2.051MetArg: 2.051 ± 0.704
0.513MetSer: 0.513 ± 0.747
1.026MetThr: 1.026 ± 0.346
1.538MetVal: 1.538 ± 0.48
0.0MetTrp: 0.0 ± 0.0
1.026MetTyr: 1.026 ± 0.346
0.0MetXaa: 0.0 ± 0.0
Asn
2.051AsnAla: 2.051 ± 1.126
1.026AsnCys: 1.026 ± 0.843
0.0AsnAsp: 0.0 ± 0.0
1.538AsnGlu: 1.538 ± 0.453
0.0AsnPhe: 0.0 ± 0.0
2.051AsnGly: 2.051 ± 0.833
1.026AsnHis: 1.026 ± 0.346
0.513AsnIle: 0.513 ± 0.681
0.513AsnLys: 0.513 ± 0.32
4.103AsnLeu: 4.103 ± 0.735
0.513AsnMet: 0.513 ± 0.606
1.538AsnAsn: 1.538 ± 0.48
2.051AsnPro: 2.051 ± 0.692
2.051AsnGln: 2.051 ± 0.935
2.564AsnArg: 2.564 ± 0.486
2.051AsnSer: 2.051 ± 0.704
1.026AsnThr: 1.026 ± 0.629
2.051AsnVal: 2.051 ± 0.833
0.513AsnTrp: 0.513 ± 0.681
1.538AsnTyr: 1.538 ± 0.786
0.0AsnXaa: 0.0 ± 0.0
Pro
5.128ProAla: 5.128 ± 0.658
1.026ProCys: 1.026 ± 0.346
0.513ProAsp: 0.513 ± 0.32
4.615ProGlu: 4.615 ± 1.358
3.59ProPhe: 3.59 ± 0.694
6.667ProGly: 6.667 ± 1.912
1.538ProHis: 1.538 ± 0.961
2.564ProIle: 2.564 ± 0.486
3.077ProLys: 3.077 ± 1.039
5.641ProLeu: 5.641 ± 1.903
0.0ProMet: 0.0 ± 0.0
0.513ProAsn: 0.513 ± 0.49
5.641ProPro: 5.641 ± 2.677
3.077ProGln: 3.077 ± 0.67
2.564ProArg: 2.564 ± 1.051
8.718ProSer: 8.718 ± 1.671
3.59ProThr: 3.59 ± 0.694
3.59ProVal: 3.59 ± 1.334
0.513ProTrp: 0.513 ± 0.32
2.051ProTyr: 2.051 ± 0.379
0.0ProXaa: 0.0 ± 0.0
Gln
2.564GlnAla: 2.564 ± 0.668
1.026GlnCys: 1.026 ± 1.495
0.513GlnAsp: 0.513 ± 0.32
5.128GlnGlu: 5.128 ± 1.63
1.026GlnPhe: 1.026 ± 0.346
1.538GlnGly: 1.538 ± 0.48
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
5.641GlnLeu: 5.641 ± 2.277
0.513GlnMet: 0.513 ± 0.49
3.077GlnAsn: 3.077 ± 0.903
2.564GlnPro: 2.564 ± 1.158
1.538GlnGln: 1.538 ± 0.73
0.513GlnArg: 0.513 ± 0.49
4.615GlnSer: 4.615 ± 0.721
1.538GlnThr: 1.538 ± 0.786
3.077GlnVal: 3.077 ± 0.67
0.0GlnTrp: 0.0 ± 0.0
0.513GlnTyr: 0.513 ± 0.681
0.0GlnXaa: 0.0 ± 0.0
Arg
1.538ArgAla: 1.538 ± 0.654
2.051ArgCys: 2.051 ± 0.935
0.513ArgAsp: 0.513 ± 0.32
3.077ArgGlu: 3.077 ± 0.767
2.564ArgPhe: 2.564 ± 1.48
8.205ArgGly: 8.205 ± 0.286
1.026ArgHis: 1.026 ± 0.641
2.051ArgIle: 2.051 ± 0.999
2.051ArgLys: 2.051 ± 0.692
7.692ArgLeu: 7.692 ± 2.214
0.513ArgMet: 0.513 ± 0.32
1.538ArgAsn: 1.538 ± 0.48
2.051ArgPro: 2.051 ± 0.738
1.538ArgGln: 1.538 ± 0.974
5.128ArgArg: 5.128 ± 3.241
3.59ArgSer: 3.59 ± 0.694
2.564ArgThr: 2.564 ± 0.486
4.615ArgVal: 4.615 ± 1.25
0.0ArgTrp: 0.0 ± 0.0
1.538ArgTyr: 1.538 ± 0.73
0.0ArgXaa: 0.0 ± 0.0
Ser
6.667SerAla: 6.667 ± 1.63
4.103SerCys: 4.103 ± 0.6
3.077SerAsp: 3.077 ± 0.905
3.59SerGlu: 3.59 ± 1.12
4.103SerPhe: 4.103 ± 1.385
8.718SerGly: 8.718 ± 1.573
1.538SerHis: 1.538 ± 0.453
2.564SerIle: 2.564 ± 0.668
5.641SerLys: 5.641 ± 0.616
12.308SerLeu: 12.308 ± 2.149
1.538SerMet: 1.538 ± 0.669
2.564SerAsn: 2.564 ± 0.486
6.667SerPro: 6.667 ± 1.912
2.564SerGln: 2.564 ± 1.111
5.128SerArg: 5.128 ± 0.291
9.744SerSer: 9.744 ± 0.479
6.154SerThr: 6.154 ± 3.378
7.179SerVal: 7.179 ± 1.037
3.077SerTrp: 3.077 ± 0.905
2.564SerTyr: 2.564 ± 0.668
0.0SerXaa: 0.0 ± 0.0
Thr
7.179ThrAla: 7.179 ± 2.603
1.538ThrCys: 1.538 ± 0.453
0.513ThrAsp: 0.513 ± 0.681
2.051ThrGlu: 2.051 ± 1.618
0.0ThrPhe: 0.0 ± 0.0
4.103ThrGly: 4.103 ± 1.163
1.538ThrHis: 1.538 ± 0.48
2.564ThrIle: 2.564 ± 0.668
3.077ThrLys: 3.077 ± 1.419
6.154ThrLeu: 6.154 ± 1.14
1.538ThrMet: 1.538 ± 1.295
2.051ThrAsn: 2.051 ± 0.935
5.641ThrPro: 5.641 ± 1.301
2.564ThrGln: 2.564 ± 1.183
4.615ThrArg: 4.615 ± 0.402
4.103ThrSer: 4.103 ± 2.127
3.59ThrThr: 3.59 ± 2.351
3.077ThrVal: 3.077 ± 0.67
0.513ThrTrp: 0.513 ± 0.681
1.538ThrTyr: 1.538 ± 0.48
0.0ThrXaa: 0.0 ± 0.0
Val
6.154ValAla: 6.154 ± 1.239
3.59ValCys: 3.59 ± 1.207
5.128ValAsp: 5.128 ± 1.386
5.641ValGlu: 5.641 ± 1.193
1.026ValPhe: 1.026 ± 0.641
4.615ValGly: 4.615 ± 0.788
0.513ValHis: 0.513 ± 0.32
4.103ValIle: 4.103 ± 1.163
4.103ValLys: 4.103 ± 1.176
2.564ValLeu: 2.564 ± 0.513
1.538ValMet: 1.538 ± 0.48
4.103ValAsn: 4.103 ± 1.444
3.077ValPro: 3.077 ± 1.366
2.564ValGln: 2.564 ± 0.733
5.641ValArg: 5.641 ± 1.264
5.128ValSer: 5.128 ± 2.784
4.103ValThr: 4.103 ± 0.901
4.615ValVal: 4.615 ± 1.396
3.59ValTrp: 3.59 ± 0.694
2.051ValTyr: 2.051 ± 0.704
0.0ValXaa: 0.0 ± 0.0
Trp
1.538TrpAla: 1.538 ± 0.453
0.513TrpCys: 0.513 ± 0.32
0.513TrpAsp: 0.513 ± 0.747
2.051TrpGlu: 2.051 ± 0.704
0.513TrpPhe: 0.513 ± 0.32
0.513TrpGly: 0.513 ± 0.32
0.0TrpHis: 0.0 ± 0.0
2.051TrpIle: 2.051 ± 0.692
1.026TrpLys: 1.026 ± 0.346
2.051TrpLeu: 2.051 ± 0.379
1.026TrpMet: 1.026 ± 0.346
1.026TrpAsn: 1.026 ± 0.346
2.051TrpPro: 2.051 ± 0.999
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
5.128TrpSer: 5.128 ± 0.658
0.0TrpThr: 0.0 ± 0.0
0.513TrpVal: 0.513 ± 0.32
0.0TrpTrp: 0.0 ± 0.0
0.513TrpTyr: 0.513 ± 0.681
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
2.051TyrCys: 2.051 ± 0.999
1.026TyrAsp: 1.026 ± 0.629
2.051TyrGlu: 2.051 ± 0.704
2.051TyrPhe: 2.051 ± 1.361
2.051TyrGly: 2.051 ± 0.704
1.026TyrHis: 1.026 ± 0.346
0.513TyrIle: 0.513 ± 0.681
1.538TyrLys: 1.538 ± 0.453
3.077TyrLeu: 3.077 ± 0.364
0.513TyrMet: 0.513 ± 0.32
0.0TyrAsn: 0.0 ± 0.0
2.051TyrPro: 2.051 ± 0.961
1.538TyrGln: 1.538 ± 0.786
0.513TyrArg: 0.513 ± 0.747
3.59TyrSer: 3.59 ± 0.694
3.077TyrThr: 3.077 ± 2.002
3.59TyrVal: 3.59 ± 0.803
1.026TyrTrp: 1.026 ± 0.346
1.538TyrTyr: 1.538 ± 0.654
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1951 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski