Amino acid dipepetide frequency for Dioscorea bacilliform virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.861AlaAla: 5.861 ± 1.608
0.451AlaCys: 0.451 ± 0.216
2.254AlaAsp: 2.254 ± 1.078
6.312AlaGlu: 6.312 ± 4.109
3.156AlaPhe: 3.156 ± 1.509
3.156AlaGly: 3.156 ± 1.362
0.451AlaHis: 0.451 ± 0.216
4.058AlaIle: 4.058 ± 1.314
0.902AlaLys: 0.902 ± 0.431
5.41AlaLeu: 5.41 ± 2.809
1.803AlaMet: 1.803 ± 0.862
0.902AlaAsn: 0.902 ± 1.716
2.254AlaPro: 2.254 ± 1.078
4.509AlaGln: 4.509 ± 1.313
2.254AlaArg: 2.254 ± 1.078
2.705AlaSer: 2.705 ± 1.435
1.803AlaThr: 1.803 ± 1.443
4.058AlaVal: 4.058 ± 1.245
0.451AlaTrp: 0.451 ± 0.216
1.803AlaTyr: 1.803 ± 0.862
0.0AlaXaa: 0.0 ± 0.0
Cys
0.902CysAla: 0.902 ± 0.431
0.902CysCys: 0.902 ± 0.431
0.0CysAsp: 0.0 ± 0.0
0.451CysGlu: 0.451 ± 0.216
0.902CysPhe: 0.902 ± 0.431
1.803CysGly: 1.803 ± 1.657
0.0CysHis: 0.0 ± 0.0
0.451CysIle: 0.451 ± 0.216
2.705CysLys: 2.705 ± 1.294
1.353CysLeu: 1.353 ± 1.57
0.451CysMet: 0.451 ± 0.216
1.353CysAsn: 1.353 ± 0.647
1.353CysPro: 1.353 ± 0.647
1.353CysGln: 1.353 ± 0.647
0.451CysArg: 0.451 ± 0.216
0.451CysSer: 0.451 ± 0.216
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.353CysTyr: 1.353 ± 1.57
0.0CysXaa: 0.0 ± 0.0
Asp
4.058AspAla: 4.058 ± 1.941
0.902AspCys: 0.902 ± 0.431
6.312AspAsp: 6.312 ± 3.019
5.861AspGlu: 5.861 ± 1.608
2.254AspPhe: 2.254 ± 1.078
2.705AspGly: 2.705 ± 1.294
0.451AspHis: 0.451 ± 0.216
2.254AspIle: 2.254 ± 1.337
1.803AspLys: 1.803 ± 3.9
7.665AspLeu: 7.665 ± 5.125
0.0AspMet: 0.0 ± 0.0
4.509AspAsn: 4.509 ± 1.313
4.058AspPro: 4.058 ± 1.314
1.353AspGln: 1.353 ± 1.57
3.156AspArg: 3.156 ± 1.218
1.803AspSer: 1.803 ± 0.862
2.254AspThr: 2.254 ± 1.078
0.0AspVal: 0.0 ± 0.0
0.902AspTrp: 0.902 ± 0.431
3.156AspTyr: 3.156 ± 1.509
0.0AspXaa: 0.0 ± 0.0
Glu
3.607GluAla: 3.607 ± 1.321
0.902GluCys: 0.902 ± 0.431
9.468GluAsp: 9.468 ± 1.28
13.075GluGlu: 13.075 ± 2.734
2.705GluPhe: 2.705 ± 1.294
5.861GluGly: 5.861 ± 2.803
2.254GluHis: 2.254 ± 1.078
4.058GluIle: 4.058 ± 1.941
7.214GluLys: 7.214 ± 4.692
5.861GluLeu: 5.861 ± 2.613
0.902GluMet: 0.902 ± 0.431
3.156GluAsn: 3.156 ± 3.824
4.058GluPro: 4.058 ± 1.314
4.058GluGln: 4.058 ± 3.413
4.509GluArg: 4.509 ± 3.21
4.058GluSer: 4.058 ± 1.314
2.254GluThr: 2.254 ± 3.282
7.214GluVal: 7.214 ± 2.425
1.353GluTrp: 1.353 ± 0.647
2.254GluTyr: 2.254 ± 1.078
0.0GluXaa: 0.0 ± 0.0
Phe
1.353PheAla: 1.353 ± 0.647
0.902PheCys: 0.902 ± 0.431
1.803PheAsp: 1.803 ± 0.862
2.254PheGlu: 2.254 ± 1.078
0.902PhePhe: 0.902 ± 0.431
0.902PheGly: 0.902 ± 0.431
0.902PheHis: 0.902 ± 0.431
3.607PheIle: 3.607 ± 1.725
2.254PheLys: 2.254 ± 1.337
2.705PheLeu: 2.705 ± 1.294
0.451PheMet: 0.451 ± 0.216
2.254PheAsn: 2.254 ± 1.078
0.451PhePro: 0.451 ± 0.216
1.353PheGln: 1.353 ± 0.647
2.705PheArg: 2.705 ± 1.294
3.156PheSer: 3.156 ± 1.509
4.058PheThr: 4.058 ± 1.941
1.803PheVal: 1.803 ± 1.657
0.451PheTrp: 0.451 ± 0.216
0.902PheTyr: 0.902 ± 0.431
0.0PheXaa: 0.0 ± 0.0
Gly
4.509GlyAla: 4.509 ± 1.342
0.902GlyCys: 0.902 ± 0.431
3.156GlyAsp: 3.156 ± 1.362
5.41GlyGlu: 5.41 ± 2.587
1.803GlyPhe: 1.803 ± 1.657
2.254GlyGly: 2.254 ± 1.078
1.353GlyHis: 1.353 ± 0.647
3.607GlyIle: 3.607 ± 1.725
4.058GlyLys: 4.058 ± 1.314
3.156GlyLeu: 3.156 ± 1.509
1.353GlyMet: 1.353 ± 0.647
2.254GlyAsn: 2.254 ± 1.078
1.353GlyPro: 1.353 ± 0.647
1.803GlyGln: 1.803 ± 0.862
3.156GlyArg: 3.156 ± 1.509
2.705GlySer: 2.705 ± 1.435
4.058GlyThr: 4.058 ± 1.941
4.058GlyVal: 4.058 ± 1.314
1.353GlyTrp: 1.353 ± 0.647
2.254GlyTyr: 2.254 ± 1.078
0.0GlyXaa: 0.0 ± 0.0
His
0.451HisAla: 0.451 ± 0.216
0.451HisCys: 0.451 ± 0.216
0.902HisAsp: 0.902 ± 0.431
1.803HisGlu: 1.803 ± 1.657
1.803HisPhe: 1.803 ± 1.443
0.451HisGly: 0.451 ± 0.216
0.451HisHis: 0.451 ± 0.216
1.353HisIle: 1.353 ± 0.647
1.353HisLys: 1.353 ± 1.57
2.254HisLeu: 2.254 ± 1.337
0.902HisMet: 0.902 ± 0.431
0.451HisAsn: 0.451 ± 0.216
1.353HisPro: 1.353 ± 0.647
1.803HisGln: 1.803 ± 0.862
1.353HisArg: 1.353 ± 0.647
0.902HisSer: 0.902 ± 0.431
0.451HisThr: 0.451 ± 0.216
1.353HisVal: 1.353 ± 0.647
0.451HisTrp: 0.451 ± 0.216
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.959IleAla: 4.959 ± 2.591
1.803IleCys: 1.803 ± 1.443
4.509IleAsp: 4.509 ± 1.369
5.41IleGlu: 5.41 ± 2.587
1.803IlePhe: 1.803 ± 0.862
5.41IleGly: 5.41 ± 2.587
3.156IleHis: 3.156 ± 1.218
6.763IleIle: 6.763 ± 4.604
5.861IleLys: 5.861 ± 0.725
6.312IleLeu: 6.312 ± 2.436
0.902IleMet: 0.902 ± 0.431
1.353IleAsn: 1.353 ± 0.647
5.861IlePro: 5.861 ± 1.608
4.959IleGln: 4.959 ± 1.154
3.607IleArg: 3.607 ± 1.321
4.959IleSer: 4.959 ± 1.403
1.803IleThr: 1.803 ± 0.862
1.803IleVal: 1.803 ± 1.443
0.451IleTrp: 0.451 ± 1.875
1.353IleTyr: 1.353 ± 0.647
0.0IleXaa: 0.0 ± 0.0
Lys
4.058LysAla: 4.058 ± 1.245
1.353LysCys: 1.353 ± 0.647
3.156LysAsp: 3.156 ± 1.362
5.41LysGlu: 5.41 ± 4.823
2.705LysPhe: 2.705 ± 1.294
6.312LysGly: 6.312 ± 4.719
2.254LysHis: 2.254 ± 1.337
8.115LysIle: 8.115 ± 3.741
8.115LysLys: 8.115 ± 8.799
9.468LysLeu: 9.468 ± 4.643
2.254LysMet: 2.254 ± 1.078
3.607LysAsn: 3.607 ± 2.885
4.058LysPro: 4.058 ± 3.186
1.353LysGln: 1.353 ± 4.954
3.156LysArg: 3.156 ± 1.509
4.509LysSer: 4.509 ± 2.156
2.254LysThr: 2.254 ± 1.078
5.41LysVal: 5.41 ± 5.417
0.902LysTrp: 0.902 ± 0.431
0.902LysTyr: 0.902 ± 1.716
0.0LysXaa: 0.0 ± 0.0
Leu
3.607LeuAla: 3.607 ± 6.863
0.902LeuCys: 0.902 ± 0.431
3.607LeuAsp: 3.607 ± 1.213
7.214LeuGlu: 7.214 ± 0.124
1.803LeuPhe: 1.803 ± 0.862
4.509LeuGly: 4.509 ± 1.342
1.353LeuHis: 1.353 ± 1.796
3.607LeuIle: 3.607 ± 1.213
11.722LeuLys: 11.722 ± 7.241
5.41LeuLeu: 5.41 ± 2.869
0.451LeuMet: 0.451 ± 0.216
2.705LeuAsn: 2.705 ± 1.294
4.959LeuPro: 4.959 ± 2.372
6.312LeuGln: 6.312 ± 3.019
4.058LeuArg: 4.058 ± 3.186
5.41LeuSer: 5.41 ± 0.939
6.763LeuThr: 6.763 ± 10.859
8.566LeuVal: 8.566 ± 7.266
0.451LeuTrp: 0.451 ± 0.216
1.803LeuTyr: 1.803 ± 0.862
0.0LeuXaa: 0.0 ± 0.0
Met
0.902MetAla: 0.902 ± 0.431
0.451MetCys: 0.451 ± 0.216
1.353MetAsp: 1.353 ± 0.647
1.353MetGlu: 1.353 ± 0.647
0.902MetPhe: 0.902 ± 0.431
0.902MetGly: 0.902 ± 0.431
0.451MetHis: 0.451 ± 0.216
0.902MetIle: 0.902 ± 0.431
2.254MetLys: 2.254 ± 1.078
0.451MetLeu: 0.451 ± 0.216
0.451MetMet: 0.451 ± 0.216
0.902MetAsn: 0.902 ± 0.431
0.451MetPro: 0.451 ± 0.216
2.705MetGln: 2.705 ± 1.294
0.902MetArg: 0.902 ± 0.431
2.254MetSer: 2.254 ± 2.445
0.902MetThr: 0.902 ± 0.431
1.353MetVal: 1.353 ± 0.647
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.353AsnAla: 1.353 ± 0.647
0.451AsnCys: 0.451 ± 0.216
2.254AsnAsp: 2.254 ± 1.535
4.509AsnGlu: 4.509 ± 1.313
1.803AsnPhe: 1.803 ± 0.862
2.705AsnGly: 2.705 ± 1.294
0.902AsnHis: 0.902 ± 0.431
4.959AsnIle: 4.959 ± 3.009
4.509AsnLys: 4.509 ± 2.156
3.607AsnLeu: 3.607 ± 3.618
0.451AsnMet: 0.451 ± 0.216
3.607AsnAsn: 3.607 ± 1.799
3.607AsnPro: 3.607 ± 1.213
1.353AsnGln: 1.353 ± 0.647
1.803AsnArg: 1.803 ± 1.443
1.353AsnSer: 1.353 ± 0.647
2.705AsnThr: 2.705 ± 1.435
3.607AsnVal: 3.607 ± 1.725
0.902AsnTrp: 0.902 ± 0.431
1.353AsnTyr: 1.353 ± 0.647
0.0AsnXaa: 0.0 ± 0.0
Pro
4.058ProAla: 4.058 ± 1.314
0.451ProCys: 0.451 ± 0.216
2.705ProAsp: 2.705 ± 1.294
2.705ProGlu: 2.705 ± 1.294
3.156ProPhe: 3.156 ± 1.509
2.705ProGly: 2.705 ± 1.294
0.902ProHis: 0.902 ± 0.431
1.353ProIle: 1.353 ± 1.796
4.959ProLys: 4.959 ± 3.009
3.607ProLeu: 3.607 ± 1.799
2.254ProMet: 2.254 ± 1.004
4.058ProAsn: 4.058 ± 1.941
4.058ProPro: 4.058 ± 1.314
1.803ProGln: 1.803 ± 0.862
2.254ProArg: 2.254 ± 1.078
5.41ProSer: 5.41 ± 1.494
1.803ProThr: 1.803 ± 0.862
1.803ProVal: 1.803 ± 0.862
0.0ProTrp: 0.0 ± 0.0
1.353ProTyr: 1.353 ± 0.647
0.0ProXaa: 0.0 ± 0.0
Gln
1.353GlnAla: 1.353 ± 0.647
0.451GlnCys: 0.451 ± 0.216
0.902GlnAsp: 0.902 ± 0.431
5.41GlnGlu: 5.41 ± 0.939
1.803GlnPhe: 1.803 ± 0.862
3.156GlnGly: 3.156 ± 1.509
1.353GlnHis: 1.353 ± 0.647
4.058GlnIle: 4.058 ± 2.774
0.902GlnLys: 0.902 ± 0.431
4.058GlnLeu: 4.058 ± 3.186
2.254GlnMet: 2.254 ± 1.071
3.607GlnAsn: 3.607 ± 1.321
1.803GlnPro: 1.803 ± 1.443
3.607GlnGln: 3.607 ± 1.213
2.254GlnArg: 2.254 ± 1.078
1.803GlnSer: 1.803 ± 0.862
3.607GlnThr: 3.607 ± 1.213
3.607GlnVal: 3.607 ± 1.213
0.451GlnTrp: 0.451 ± 0.216
1.803GlnTyr: 1.803 ± 0.862
0.0GlnXaa: 0.0 ± 0.0
Arg
3.607ArgAla: 3.607 ± 1.725
0.902ArgCys: 0.902 ± 0.431
1.803ArgAsp: 1.803 ± 0.862
3.607ArgGlu: 3.607 ± 1.799
2.254ArgPhe: 2.254 ± 1.078
1.803ArgGly: 1.803 ± 0.862
0.451ArgHis: 0.451 ± 0.216
4.509ArgIle: 4.509 ± 1.313
3.607ArgLys: 3.607 ± 3.313
6.763ArgLeu: 6.763 ± 0.305
1.353ArgMet: 1.353 ± 0.647
2.705ArgAsn: 2.705 ± 1.294
1.803ArgPro: 1.803 ± 1.443
1.803ArgGln: 1.803 ± 0.862
3.156ArgArg: 3.156 ± 1.509
3.607ArgSer: 3.607 ± 1.725
2.705ArgThr: 2.705 ± 1.294
3.607ArgVal: 3.607 ± 1.725
0.902ArgTrp: 0.902 ± 0.431
0.902ArgTyr: 0.902 ± 0.431
0.0ArgXaa: 0.0 ± 0.0
Ser
1.803SerAla: 1.803 ± 0.862
0.902SerCys: 0.902 ± 0.431
3.156SerAsp: 3.156 ± 1.218
4.058SerGlu: 4.058 ± 1.314
1.353SerPhe: 1.353 ± 0.647
4.509SerGly: 4.509 ± 2.156
0.902SerHis: 0.902 ± 0.431
4.509SerIle: 4.509 ± 2.156
6.312SerLys: 6.312 ± 5.049
5.861SerLeu: 5.861 ± 2.789
0.902SerMet: 0.902 ± 1.173
1.803SerAsn: 1.803 ± 1.443
2.705SerPro: 2.705 ± 1.294
2.254SerGln: 2.254 ± 1.535
3.156SerArg: 3.156 ± 1.509
4.509SerSer: 4.509 ± 4.891
6.312SerThr: 6.312 ± 1.829
3.607SerVal: 3.607 ± 3.313
1.353SerTrp: 1.353 ± 0.647
3.156SerTyr: 3.156 ± 3.449
0.0SerXaa: 0.0 ± 0.0
Thr
2.705ThrAla: 2.705 ± 1.294
1.353ThrCys: 1.353 ± 0.647
2.705ThrAsp: 2.705 ± 1.261
4.509ThrGlu: 4.509 ± 1.369
1.353ThrPhe: 1.353 ± 0.647
2.254ThrGly: 2.254 ± 1.078
0.451ThrHis: 0.451 ± 0.216
5.861ThrIle: 5.861 ± 0.725
3.607ThrLys: 3.607 ± 1.213
4.959ThrLeu: 4.959 ± 1.154
0.451ThrMet: 0.451 ± 0.216
2.705ThrAsn: 2.705 ± 3.593
1.803ThrPro: 1.803 ± 0.862
1.803ThrGln: 1.803 ± 0.862
0.902ThrArg: 0.902 ± 0.431
4.959ThrSer: 4.959 ± 2.591
5.41ThrThr: 5.41 ± 1.494
3.607ThrVal: 3.607 ± 1.213
0.902ThrTrp: 0.902 ± 1.716
2.254ThrTyr: 2.254 ± 1.078
0.0ThrXaa: 0.0 ± 0.0
Val
3.156ValAla: 3.156 ± 2.015
1.353ValCys: 1.353 ± 1.57
2.705ValAsp: 2.705 ± 1.435
4.959ValGlu: 4.959 ± 8.198
1.803ValPhe: 1.803 ± 0.862
1.353ValGly: 1.353 ± 0.647
1.803ValHis: 1.803 ± 1.443
3.607ValIle: 3.607 ± 1.321
4.058ValLys: 4.058 ± 3.413
2.705ValLeu: 2.705 ± 1.435
1.353ValMet: 1.353 ± 0.647
2.705ValAsn: 2.705 ± 1.294
3.607ValPro: 3.607 ± 1.725
3.156ValGln: 3.156 ± 1.509
5.41ValArg: 5.41 ± 2.587
5.861ValSer: 5.861 ± 7.517
4.959ValThr: 4.959 ± 2.372
0.902ValVal: 0.902 ± 1.716
0.0ValTrp: 0.0 ± 0.0
2.705ValTyr: 2.705 ± 1.294
0.0ValXaa: 0.0 ± 0.0
Trp
0.451TrpAla: 0.451 ± 0.216
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.803TrpGlu: 1.803 ± 1.443
0.0TrpPhe: 0.0 ± 0.0
0.902TrpGly: 0.902 ± 0.431
0.0TrpHis: 0.0 ± 0.0
1.353TrpIle: 1.353 ± 0.647
1.353TrpLys: 1.353 ± 0.647
0.902TrpLeu: 0.902 ± 0.431
0.0TrpMet: 0.0 ± 0.0
0.451TrpAsn: 0.451 ± 0.216
0.0TrpPro: 0.0 ± 0.0
0.451TrpGln: 0.451 ± 0.216
1.803TrpArg: 1.803 ± 0.862
0.451TrpSer: 0.451 ± 0.216
0.451TrpThr: 0.451 ± 0.216
0.902TrpVal: 0.902 ± 0.431
0.0TrpTrp: 0.0 ± 0.0
0.451TrpTyr: 0.451 ± 1.875
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.803TyrAla: 1.803 ± 1.443
0.451TyrCys: 0.451 ± 2.115
2.705TyrAsp: 2.705 ± 1.261
2.705TyrGlu: 2.705 ± 1.294
0.902TyrPhe: 0.902 ± 0.431
0.902TyrGly: 0.902 ± 0.431
0.451TyrHis: 0.451 ± 0.216
4.058TyrIle: 4.058 ± 1.941
1.803TyrLys: 1.803 ± 0.862
2.705TyrLeu: 2.705 ± 1.261
0.451TyrMet: 0.451 ± 0.216
2.705TyrAsn: 2.705 ± 1.294
1.803TyrPro: 1.803 ± 0.862
0.451TyrGln: 0.451 ± 0.216
1.803TyrArg: 1.803 ± 0.862
2.705TyrSer: 2.705 ± 1.294
0.0TyrThr: 0.0 ± 0.0
0.902TyrVal: 0.902 ± 1.95
0.451TyrTrp: 0.451 ± 0.216
0.902TyrTyr: 0.902 ± 0.431
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2219 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski