Amino acid dipepetide frequency for Colocasia bobone disease-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.128AlaAla: 4.128 ± 2.862
1.101AlaCys: 1.101 ± 0.919
3.577AlaAsp: 3.577 ± 0.49
3.302AlaGlu: 3.302 ± 1.934
2.201AlaPhe: 2.201 ± 1.256
3.853AlaGly: 3.853 ± 0.924
0.275AlaHis: 0.275 ± 0.16
2.752AlaIle: 2.752 ± 0.675
5.228AlaLys: 5.228 ± 1.534
5.504AlaLeu: 5.504 ± 0.82
1.651AlaMet: 1.651 ± 1.057
1.651AlaAsn: 1.651 ± 0.916
2.477AlaPro: 2.477 ± 0.334
1.101AlaGln: 1.101 ± 0.673
2.201AlaArg: 2.201 ± 0.7
4.953AlaSer: 4.953 ± 1.163
3.577AlaThr: 3.577 ± 1.726
3.853AlaVal: 3.853 ± 1.086
0.55AlaTrp: 0.55 ± 0.487
1.101AlaTyr: 1.101 ± 0.639
0.0AlaXaa: 0.0 ± 0.0
Cys
0.275CysAla: 0.275 ± 0.16
0.0CysCys: 0.0 ± 0.0
1.101CysAsp: 1.101 ± 0.429
0.275CysGlu: 0.275 ± 0.346
1.101CysPhe: 1.101 ± 0.569
0.55CysGly: 0.55 ± 0.451
0.55CysHis: 0.55 ± 0.3
1.926CysIle: 1.926 ± 0.559
0.55CysLys: 0.55 ± 0.487
3.027CysLeu: 3.027 ± 0.878
0.55CysMet: 0.55 ± 0.319
0.55CysAsn: 0.55 ± 0.691
1.926CysPro: 1.926 ± 0.559
0.55CysGln: 0.55 ± 0.336
0.826CysArg: 0.826 ± 0.627
1.101CysSer: 1.101 ± 0.734
0.55CysThr: 0.55 ± 0.691
0.275CysVal: 0.275 ± 0.501
0.0CysTrp: 0.0 ± 0.0
0.826CysTyr: 0.826 ± 0.735
0.275CysXaa: 0.275 ± 0.346
Asp
3.302AspAla: 3.302 ± 1.652
1.101AspCys: 1.101 ± 0.411
3.302AspAsp: 3.302 ± 0.935
3.302AspGlu: 3.302 ± 0.839
3.027AspPhe: 3.027 ± 1.45
2.477AspGly: 2.477 ± 0.514
1.101AspHis: 1.101 ± 0.411
4.128AspIle: 4.128 ± 1.057
2.477AspLys: 2.477 ± 0.384
5.779AspLeu: 5.779 ± 1.988
1.926AspMet: 1.926 ± 1.448
2.201AspAsn: 2.201 ± 0.72
4.128AspPro: 4.128 ± 1.109
1.376AspGln: 1.376 ± 0.368
2.201AspArg: 2.201 ± 1.115
3.302AspSer: 3.302 ± 1.182
3.302AspThr: 3.302 ± 0.784
3.027AspVal: 3.027 ± 0.773
0.55AspTrp: 0.55 ± 0.319
2.477AspTyr: 2.477 ± 1.051
0.0AspXaa: 0.0 ± 0.0
Glu
4.403GluAla: 4.403 ± 2.32
1.101GluCys: 1.101 ± 0.761
3.302GluAsp: 3.302 ± 0.915
2.752GluGlu: 2.752 ± 0.926
2.752GluPhe: 2.752 ± 0.767
2.477GluGly: 2.477 ± 0.773
0.826GluHis: 0.826 ± 0.503
6.879GluIle: 6.879 ± 1.322
4.678GluLys: 4.678 ± 1.177
3.577GluLeu: 3.577 ± 0.704
1.376GluMet: 1.376 ± 0.537
2.201GluAsn: 2.201 ± 0.887
1.376GluPro: 1.376 ± 0.799
1.101GluGln: 1.101 ± 1.027
4.403GluArg: 4.403 ± 1.573
3.027GluSer: 3.027 ± 0.744
3.302GluThr: 3.302 ± 0.845
4.953GluVal: 4.953 ± 0.787
1.101GluTrp: 1.101 ± 0.593
2.752GluTyr: 2.752 ± 1.363
0.0GluXaa: 0.0 ± 0.0
Phe
1.926PheAla: 1.926 ± 0.657
0.55PheCys: 0.55 ± 0.3
3.577PheAsp: 3.577 ± 0.958
2.201PheGlu: 2.201 ± 0.914
1.101PhePhe: 1.101 ± 0.461
3.027PheGly: 3.027 ± 0.469
1.101PheHis: 1.101 ± 0.443
1.926PheIle: 1.926 ± 0.631
0.275PheLys: 0.275 ± 0.532
4.678PheLeu: 4.678 ± 1.423
0.826PheMet: 0.826 ± 0.476
1.651PheAsn: 1.651 ± 0.544
3.577PhePro: 3.577 ± 0.946
2.201PheGln: 2.201 ± 0.7
2.201PheArg: 2.201 ± 0.502
4.403PheSer: 4.403 ± 1.714
3.302PheThr: 3.302 ± 1.068
3.302PheVal: 3.302 ± 1.2
0.275PheTrp: 0.275 ± 0.16
1.376PheTyr: 1.376 ± 0.799
0.0PheXaa: 0.0 ± 0.0
Gly
1.926GlyAla: 1.926 ± 0.841
0.55GlyCys: 0.55 ± 0.3
3.577GlyAsp: 3.577 ± 1.201
3.577GlyGlu: 3.577 ± 0.807
2.201GlyPhe: 2.201 ± 0.537
2.752GlyGly: 2.752 ± 0.473
1.651GlyHis: 1.651 ± 0.694
3.577GlyIle: 3.577 ± 1.66
2.477GlyLys: 2.477 ± 0.849
6.879GlyLeu: 6.879 ± 1.852
2.477GlyMet: 2.477 ± 0.592
1.376GlyAsn: 1.376 ± 0.464
1.926GlyPro: 1.926 ± 0.881
0.275GlyGln: 0.275 ± 0.16
3.577GlyArg: 3.577 ± 1.062
5.779GlySer: 5.779 ± 1.082
1.651GlyThr: 1.651 ± 1.37
5.779GlyVal: 5.779 ± 1.02
1.376GlyTrp: 1.376 ± 0.799
1.651GlyTyr: 1.651 ± 0.638
0.0GlyXaa: 0.0 ± 0.0
His
0.826HisAla: 0.826 ± 0.391
0.55HisCys: 0.55 ± 0.319
0.826HisAsp: 0.826 ± 0.334
1.376HisGlu: 1.376 ± 0.603
1.101HisPhe: 1.101 ± 0.457
1.376HisGly: 1.376 ± 0.519
0.826HisHis: 0.826 ± 0.479
1.376HisIle: 1.376 ± 0.483
0.826HisLys: 0.826 ± 0.735
1.926HisLeu: 1.926 ± 0.754
0.275HisMet: 0.275 ± 0.532
0.826HisAsn: 0.826 ± 0.334
1.651HisPro: 1.651 ± 0.354
0.55HisGln: 0.55 ± 0.451
0.275HisArg: 0.275 ± 0.16
1.651HisSer: 1.651 ± 0.667
1.376HisThr: 1.376 ± 0.587
1.101HisVal: 1.101 ± 0.684
0.275HisTrp: 0.275 ± 0.16
0.826HisTyr: 0.826 ± 0.334
0.0HisXaa: 0.0 ± 0.0
Ile
3.302IleAla: 3.302 ± 0.725
0.0IleCys: 0.0 ± 0.0
3.577IleAsp: 3.577 ± 1.414
5.504IleGlu: 5.504 ± 1.534
3.577IlePhe: 3.577 ± 1.022
5.504IleGly: 5.504 ± 1.475
2.201IleHis: 2.201 ± 0.585
4.678IleIle: 4.678 ± 1.183
5.504IleLys: 5.504 ± 1.235
4.403IleLeu: 4.403 ± 1.264
1.926IleMet: 1.926 ± 0.359
3.853IleAsn: 3.853 ± 1.162
3.302IlePro: 3.302 ± 0.934
1.651IleGln: 1.651 ± 0.392
4.953IleArg: 4.953 ± 1.057
8.255IleSer: 8.255 ± 2.248
4.128IleThr: 4.128 ± 1.439
3.027IleVal: 3.027 ± 0.943
1.101IleTrp: 1.101 ± 0.429
0.826IleTyr: 0.826 ± 0.479
0.0IleXaa: 0.0 ± 0.0
Lys
3.577LysAla: 3.577 ± 1.788
1.376LysCys: 1.376 ± 1.062
2.752LysAsp: 2.752 ± 0.939
6.054LysGlu: 6.054 ± 1.506
2.477LysPhe: 2.477 ± 0.607
2.201LysGly: 2.201 ± 0.977
0.55LysHis: 0.55 ± 0.319
4.128LysIle: 4.128 ± 2.087
3.577LysLys: 3.577 ± 1.993
8.531LysLeu: 8.531 ± 1.728
1.926LysMet: 1.926 ± 0.632
2.477LysAsn: 2.477 ± 0.968
3.027LysPro: 3.027 ± 1.777
1.101LysGln: 1.101 ± 0.6
4.128LysArg: 4.128 ± 1.348
4.128LysSer: 4.128 ± 1.068
5.504LysThr: 5.504 ± 0.897
3.577LysVal: 3.577 ± 1.036
1.376LysTrp: 1.376 ± 0.577
0.826LysTyr: 0.826 ± 0.334
0.0LysXaa: 0.0 ± 0.0
Leu
3.027LeuAla: 3.027 ± 0.586
1.926LeuCys: 1.926 ± 0.841
5.228LeuAsp: 5.228 ± 0.914
4.953LeuGlu: 4.953 ± 1.038
4.403LeuPhe: 4.403 ± 0.81
6.879LeuGly: 6.879 ± 1.537
0.826LeuHis: 0.826 ± 0.479
5.504LeuIle: 5.504 ± 1.91
9.081LeuLys: 9.081 ± 1.606
8.806LeuLeu: 8.806 ± 1.358
3.577LeuMet: 3.577 ± 1.342
5.228LeuAsn: 5.228 ± 0.764
4.678LeuPro: 4.678 ± 0.685
3.302LeuGln: 3.302 ± 0.818
4.403LeuArg: 4.403 ± 1.297
11.282LeuSer: 11.282 ± 1.961
6.054LeuThr: 6.054 ± 1.926
4.678LeuVal: 4.678 ± 0.559
0.826LeuTrp: 0.826 ± 0.479
3.853LeuTyr: 3.853 ± 1.193
0.0LeuXaa: 0.0 ± 0.0
Met
3.027MetAla: 3.027 ± 0.844
0.55MetCys: 0.55 ± 0.451
0.275MetAsp: 0.275 ± 0.16
1.376MetGlu: 1.376 ± 0.589
1.651MetPhe: 1.651 ± 0.518
1.376MetGly: 1.376 ± 0.577
0.275MetHis: 0.275 ± 0.377
4.128MetIle: 4.128 ± 1.118
3.027MetLys: 3.027 ± 0.981
1.651MetLeu: 1.651 ± 0.544
0.275MetMet: 0.275 ± 0.501
1.101MetAsn: 1.101 ± 0.497
0.826MetPro: 0.826 ± 0.511
0.55MetGln: 0.55 ± 0.623
2.477MetArg: 2.477 ± 0.575
4.128MetSer: 4.128 ± 1.101
2.477MetThr: 2.477 ± 0.978
1.376MetVal: 1.376 ± 0.687
0.0MetTrp: 0.0 ± 0.0
0.826MetTyr: 0.826 ± 0.418
0.0MetXaa: 0.0 ± 0.0
Asn
1.926AsnAla: 1.926 ± 0.814
1.651AsnCys: 1.651 ± 0.898
2.752AsnAsp: 2.752 ± 0.537
1.376AsnGlu: 1.376 ± 0.589
0.826AsnPhe: 0.826 ± 0.391
0.0AsnGly: 0.0 ± 0.0
1.101AsnHis: 1.101 ± 1.188
3.302AsnIle: 3.302 ± 0.975
1.651AsnLys: 1.651 ± 0.981
4.953AsnLeu: 4.953 ± 1.648
1.651AsnMet: 1.651 ± 0.694
1.101AsnAsn: 1.101 ± 0.472
1.926AsnPro: 1.926 ± 0.953
1.926AsnGln: 1.926 ± 0.694
1.926AsnArg: 1.926 ± 0.838
3.302AsnSer: 3.302 ± 0.968
1.926AsnThr: 1.926 ± 0.572
2.201AsnVal: 2.201 ± 0.742
0.826AsnTrp: 0.826 ± 0.479
1.101AsnTyr: 1.101 ± 0.488
0.0AsnXaa: 0.0 ± 0.0
Pro
3.577ProAla: 3.577 ± 1.419
0.55ProCys: 0.55 ± 0.445
1.651ProAsp: 1.651 ± 0.723
2.752ProGlu: 2.752 ± 0.927
1.376ProPhe: 1.376 ± 0.589
2.752ProGly: 2.752 ± 0.818
0.826ProHis: 0.826 ± 0.479
2.752ProIle: 2.752 ± 1.081
3.577ProLys: 3.577 ± 0.756
5.228ProLeu: 5.228 ± 0.877
1.376ProMet: 1.376 ± 0.449
2.201ProAsn: 2.201 ± 0.941
2.752ProPro: 2.752 ± 0.639
1.926ProGln: 1.926 ± 0.969
1.651ProArg: 1.651 ± 0.694
5.504ProSer: 5.504 ± 1.248
4.128ProThr: 4.128 ± 1.488
4.128ProVal: 4.128 ± 0.944
0.55ProTrp: 0.55 ± 0.3
1.651ProTyr: 1.651 ± 0.498
0.0ProXaa: 0.0 ± 0.0
Gln
3.027GlnAla: 3.027 ± 0.469
0.55GlnCys: 0.55 ± 0.445
1.926GlnAsp: 1.926 ± 0.753
1.101GlnGlu: 1.101 ± 0.902
0.55GlnPhe: 0.55 ± 0.445
2.477GlnGly: 2.477 ± 0.873
1.101GlnHis: 1.101 ± 0.493
3.027GlnIle: 3.027 ± 1.195
0.826GlnLys: 0.826 ± 0.368
1.101GlnLeu: 1.101 ± 0.457
0.826GlnMet: 0.826 ± 0.479
0.55GlnAsn: 0.55 ± 0.336
0.55GlnPro: 0.55 ± 0.607
0.826GlnGln: 0.826 ± 0.444
1.651GlnArg: 1.651 ± 0.958
1.926GlnSer: 1.926 ± 0.629
3.302GlnThr: 3.302 ± 1.272
1.376GlnVal: 1.376 ± 0.642
1.376GlnTrp: 1.376 ± 0.554
0.55GlnTyr: 0.55 ± 0.319
0.0GlnXaa: 0.0 ± 0.0
Arg
1.926ArgAla: 1.926 ± 0.548
0.826ArgCys: 0.826 ± 0.334
4.128ArgAsp: 4.128 ± 1.789
2.752ArgGlu: 2.752 ± 1.257
2.752ArgPhe: 2.752 ± 0.699
3.027ArgGly: 3.027 ± 0.754
1.101ArgHis: 1.101 ± 0.429
1.926ArgIle: 1.926 ± 0.799
3.302ArgLys: 3.302 ± 1.033
5.228ArgLeu: 5.228 ± 1.845
1.926ArgMet: 1.926 ± 0.826
2.477ArgAsn: 2.477 ± 0.596
3.027ArgPro: 3.027 ± 1.116
1.376ArgGln: 1.376 ± 0.519
3.302ArgArg: 3.302 ± 0.691
3.302ArgSer: 3.302 ± 1.199
3.577ArgThr: 3.577 ± 0.995
3.853ArgVal: 3.853 ± 0.721
0.826ArgTrp: 0.826 ± 0.334
3.853ArgTyr: 3.853 ± 0.617
0.0ArgXaa: 0.0 ± 0.0
Ser
4.678SerAla: 4.678 ± 1.172
1.926SerCys: 1.926 ± 1.345
6.054SerAsp: 6.054 ± 3.004
4.953SerGlu: 4.953 ± 0.575
3.027SerPhe: 3.027 ± 1.346
4.403SerGly: 4.403 ± 1.453
2.201SerHis: 2.201 ± 0.82
7.705SerIle: 7.705 ± 2.264
4.678SerLys: 4.678 ± 0.513
10.182SerLeu: 10.182 ± 2.479
2.477SerMet: 2.477 ± 1.145
1.101SerAsn: 1.101 ± 0.457
4.403SerPro: 4.403 ± 1.255
3.302SerGln: 3.302 ± 0.437
6.329SerArg: 6.329 ± 1.423
6.329SerSer: 6.329 ± 2.319
4.678SerThr: 4.678 ± 0.586
4.403SerVal: 4.403 ± 1.199
1.101SerTrp: 1.101 ± 0.429
3.577SerTyr: 3.577 ± 0.907
0.0SerXaa: 0.0 ± 0.0
Thr
4.128ThrAla: 4.128 ± 1.549
0.826ThrCys: 0.826 ± 0.799
1.101ThrAsp: 1.101 ± 0.451
5.504ThrGlu: 5.504 ± 2.125
2.752ThrPhe: 2.752 ± 0.66
3.577ThrGly: 3.577 ± 0.733
0.826ThrHis: 0.826 ± 0.513
4.678ThrIle: 4.678 ± 1.206
6.054ThrLys: 6.054 ± 3.864
4.953ThrLeu: 4.953 ± 1.241
1.376ThrMet: 1.376 ± 0.584
2.201ThrAsn: 2.201 ± 0.537
3.853ThrPro: 3.853 ± 1.048
1.651ThrGln: 1.651 ± 0.713
1.926ThrArg: 1.926 ± 0.694
7.155ThrSer: 7.155 ± 1.644
3.302ThrThr: 3.302 ± 1.528
4.403ThrVal: 4.403 ± 0.734
1.101ThrTrp: 1.101 ± 0.657
1.376ThrTyr: 1.376 ± 0.614
0.275ThrXaa: 0.275 ± 0.16
Val
3.302ValAla: 3.302 ± 0.884
1.651ValCys: 1.651 ± 1.43
4.403ValAsp: 4.403 ± 1.157
3.853ValGlu: 3.853 ± 1.648
4.678ValPhe: 4.678 ± 1.269
3.853ValGly: 3.853 ± 0.73
0.55ValHis: 0.55 ± 0.319
3.027ValIle: 3.027 ± 0.754
3.027ValLys: 3.027 ± 0.969
5.779ValLeu: 5.779 ± 1.323
2.752ValMet: 2.752 ± 0.393
2.752ValAsn: 2.752 ± 1.387
3.027ValPro: 3.027 ± 1.148
1.926ValGln: 1.926 ± 1.036
3.027ValArg: 3.027 ± 0.414
3.853ValSer: 3.853 ± 0.979
4.403ValThr: 4.403 ± 0.89
3.853ValVal: 3.853 ± 1.224
0.55ValTrp: 0.55 ± 0.396
2.201ValTyr: 2.201 ± 0.743
0.0ValXaa: 0.0 ± 0.0
Trp
0.275TrpAla: 0.275 ± 0.346
0.0TrpCys: 0.0 ± 0.0
0.826TrpAsp: 0.826 ± 0.334
1.101TrpGlu: 1.101 ± 0.6
0.826TrpPhe: 0.826 ± 0.479
0.275TrpGly: 0.275 ± 0.16
0.275TrpHis: 0.275 ± 0.16
1.651TrpIle: 1.651 ± 0.554
1.101TrpLys: 1.101 ± 0.457
1.651TrpLeu: 1.651 ± 0.783
0.55TrpMet: 0.55 ± 0.319
0.826TrpAsn: 0.826 ± 0.479
0.0TrpPro: 0.0 ± 0.0
0.275TrpGln: 0.275 ± 0.16
1.101TrpArg: 1.101 ± 0.639
0.826TrpSer: 0.826 ± 0.479
0.826TrpThr: 0.826 ± 0.418
1.101TrpVal: 1.101 ± 0.429
0.0TrpTrp: 0.0 ± 0.0
0.275TrpTyr: 0.275 ± 0.439
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.477TyrAla: 2.477 ± 0.827
0.0TyrCys: 0.0 ± 0.0
0.55TyrAsp: 0.55 ± 0.3
0.275TyrGlu: 0.275 ± 0.16
1.376TyrPhe: 1.376 ± 0.772
2.201TyrGly: 2.201 ± 0.977
1.651TyrHis: 1.651 ± 0.635
2.201TyrIle: 2.201 ± 0.849
1.376TyrLys: 1.376 ± 0.464
4.128TyrLeu: 4.128 ± 1.214
1.376TyrMet: 1.376 ± 1.079
1.101TyrAsn: 1.101 ± 0.347
2.477TyrPro: 2.477 ± 0.603
1.651TyrGln: 1.651 ± 0.456
1.926TyrArg: 1.926 ± 0.572
3.302TyrSer: 3.302 ± 0.502
1.651TyrThr: 1.651 ± 0.498
2.201TyrVal: 2.201 ± 0.72
0.0TyrTrp: 0.0 ± 0.0
1.926TyrTyr: 1.926 ± 0.907
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.55XaaLeu: 0.55 ± 0.3
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3635 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski