Amino acid dipepetide frequency for Cacao virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.348AlaAla: 4.348 ± 1.57
1.279AlaCys: 1.279 ± 0.676
1.279AlaAsp: 1.279 ± 0.676
2.558AlaGlu: 2.558 ± 0.465
1.535AlaPhe: 1.535 ± 0.425
3.325AlaGly: 3.325 ± 1.488
1.79AlaHis: 1.79 ± 0.732
2.558AlaIle: 2.558 ± 0.985
2.558AlaLys: 2.558 ± 0.687
6.394AlaLeu: 6.394 ± 1.277
0.767AlaMet: 0.767 ± 0.237
1.023AlaAsn: 1.023 ± 0.329
1.535AlaPro: 1.535 ± 0.711
0.767AlaGln: 0.767 ± 0.694
3.069AlaArg: 3.069 ± 0.384
3.069AlaSer: 3.069 ± 0.265
2.558AlaThr: 2.558 ± 0.928
2.558AlaVal: 2.558 ± 0.995
0.256AlaTrp: 0.256 ± 0.231
1.79AlaTyr: 1.79 ± 0.466
0.0AlaXaa: 0.0 ± 0.0
Cys
1.535CysAla: 1.535 ± 0.475
0.512CysCys: 0.512 ± 0.334
0.256CysAsp: 0.256 ± 0.167
2.813CysGlu: 2.813 ± 1.244
1.535CysPhe: 1.535 ± 0.425
1.279CysGly: 1.279 ± 0.816
1.023CysHis: 1.023 ± 0.329
1.535CysIle: 1.535 ± 0.494
2.558CysLys: 2.558 ± 1.034
3.325CysLeu: 3.325 ± 1.414
1.535CysMet: 1.535 ± 0.475
1.023CysAsn: 1.023 ± 0.329
1.279CysPro: 1.279 ± 0.405
1.535CysGln: 1.535 ± 0.582
0.767CysArg: 0.767 ± 0.365
4.092CysSer: 4.092 ± 1.09
1.535CysThr: 1.535 ± 0.711
1.535CysVal: 1.535 ± 0.86
0.256CysTrp: 0.256 ± 0.55
1.023CysTyr: 1.023 ± 0.53
0.0CysXaa: 0.0 ± 0.0
Asp
3.581AspAla: 3.581 ± 2.673
1.79AspCys: 1.79 ± 0.606
3.325AspAsp: 3.325 ± 1.155
4.859AspGlu: 4.859 ± 1.103
2.813AspPhe: 2.813 ± 0.933
2.558AspGly: 2.558 ± 0.823
0.512AspHis: 0.512 ± 0.165
4.859AspIle: 4.859 ± 1.258
2.302AspLys: 2.302 ± 0.905
4.092AspLeu: 4.092 ± 1.646
3.581AspMet: 3.581 ± 1.702
3.581AspAsn: 3.581 ± 0.549
2.046AspPro: 2.046 ± 0.741
2.046AspGln: 2.046 ± 0.464
2.302AspArg: 2.302 ± 0.388
5.115AspSer: 5.115 ± 1.387
2.302AspThr: 2.302 ± 1.014
1.535AspVal: 1.535 ± 0.473
1.023AspTrp: 1.023 ± 0.513
2.813AspTyr: 2.813 ± 1.314
0.0AspXaa: 0.0 ± 0.0
Glu
3.581GluAla: 3.581 ± 2.107
2.302GluCys: 2.302 ± 0.378
5.627GluAsp: 5.627 ± 1.499
5.627GluGlu: 5.627 ± 1.999
4.092GluPhe: 4.092 ± 1.183
3.325GluGly: 3.325 ± 0.974
0.512GluHis: 0.512 ± 0.165
5.627GluIle: 5.627 ± 1.236
2.558GluLys: 2.558 ± 0.435
5.371GluLeu: 5.371 ± 1.469
0.767GluMet: 0.767 ± 0.365
1.535GluAsn: 1.535 ± 0.475
1.535GluPro: 1.535 ± 0.353
2.302GluGln: 2.302 ± 0.835
3.836GluArg: 3.836 ± 0.637
4.092GluSer: 4.092 ± 0.757
3.325GluThr: 3.325 ± 1.155
5.371GluVal: 5.371 ± 2.017
0.256GluTrp: 0.256 ± 0.167
3.581GluTyr: 3.581 ± 0.647
0.0GluXaa: 0.0 ± 0.0
Phe
2.558PheAla: 2.558 ± 1.651
1.023PheCys: 1.023 ± 0.588
4.092PheAsp: 4.092 ± 0.698
2.046PheGlu: 2.046 ± 0.898
1.535PhePhe: 1.535 ± 0.473
1.79PheGly: 1.79 ± 0.606
1.023PheHis: 1.023 ± 0.4
1.279PheIle: 1.279 ± 0.373
3.581PheLys: 3.581 ± 0.693
5.115PheLeu: 5.115 ± 0.751
2.302PheMet: 2.302 ± 0.683
2.046PheAsn: 2.046 ± 0.438
1.023PhePro: 1.023 ± 0.668
0.512PheGln: 0.512 ± 0.165
1.79PheArg: 1.79 ± 0.675
4.859PheSer: 4.859 ± 1.582
2.813PheThr: 2.813 ± 1.222
2.046PheVal: 2.046 ± 0.752
0.767PheTrp: 0.767 ± 0.46
0.767PheTyr: 0.767 ± 0.46
0.0PheXaa: 0.0 ± 0.0
Gly
2.302GlyAla: 2.302 ± 0.905
2.046GlyCys: 2.046 ± 0.605
5.115GlyAsp: 5.115 ± 1.065
1.79GlyGlu: 1.79 ± 0.631
3.325GlyPhe: 3.325 ± 0.502
3.836GlyGly: 3.836 ± 1.187
0.767GlyHis: 0.767 ± 0.501
4.092GlyIle: 4.092 ± 0.776
3.325GlyLys: 3.325 ± 1.912
5.371GlyLeu: 5.371 ± 1.626
1.535GlyMet: 1.535 ± 0.642
2.302GlyAsn: 2.302 ± 0.984
2.302GlyPro: 2.302 ± 0.513
1.023GlyGln: 1.023 ± 0.588
1.279GlyArg: 1.279 ± 0.774
6.394GlySer: 6.394 ± 1.564
1.535GlyThr: 1.535 ± 0.475
4.348GlyVal: 4.348 ± 0.748
1.535GlyTrp: 1.535 ± 0.425
1.535GlyTyr: 1.535 ± 0.692
0.0GlyXaa: 0.0 ± 0.0
His
0.767HisAla: 0.767 ± 0.365
1.279HisCys: 1.279 ± 0.497
1.279HisAsp: 1.279 ± 0.531
1.279HisGlu: 1.279 ± 0.405
1.79HisPhe: 1.79 ± 0.606
1.79HisGly: 1.79 ± 0.525
0.256HisHis: 0.256 ± 0.231
1.279HisIle: 1.279 ± 0.835
0.767HisLys: 0.767 ± 0.502
1.023HisLeu: 1.023 ± 0.588
0.0HisMet: 0.0 ± 0.0
0.767HisAsn: 0.767 ± 0.665
2.046HisPro: 2.046 ± 0.697
0.767HisGln: 0.767 ± 0.501
1.023HisArg: 1.023 ± 0.329
2.813HisSer: 2.813 ± 0.576
0.767HisThr: 0.767 ± 0.237
1.279HisVal: 1.279 ± 0.373
0.0HisTrp: 0.0 ± 0.0
1.023HisTyr: 1.023 ± 0.329
0.0HisXaa: 0.0 ± 0.0
Ile
3.836IleAla: 3.836 ± 0.844
1.535IleCys: 1.535 ± 0.86
3.325IleAsp: 3.325 ± 1.038
4.604IleGlu: 4.604 ± 1.81
2.046IlePhe: 2.046 ± 0.658
4.348IleGly: 4.348 ± 0.879
2.046IleHis: 2.046 ± 0.438
3.325IleIle: 3.325 ± 0.673
4.092IleLys: 4.092 ± 1.623
6.394IleLeu: 6.394 ± 1.37
2.302IleMet: 2.302 ± 0.788
4.092IleAsn: 4.092 ± 0.723
2.558IlePro: 2.558 ± 0.694
1.535IleGln: 1.535 ± 1.005
4.092IleArg: 4.092 ± 0.757
8.44IleSer: 8.44 ± 2.015
3.836IleThr: 3.836 ± 1.187
4.604IleVal: 4.604 ± 0.349
0.767IleTrp: 0.767 ± 0.501
2.302IleTyr: 2.302 ± 0.712
0.0IleXaa: 0.0 ± 0.0
Lys
3.836LysAla: 3.836 ± 0.844
2.302LysCys: 2.302 ± 0.805
2.813LysAsp: 2.813 ± 0.355
4.348LysGlu: 4.348 ± 1.447
2.302LysPhe: 2.302 ± 0.513
3.581LysGly: 3.581 ± 0.891
1.279LysHis: 1.279 ± 0.405
4.092LysIle: 4.092 ± 1.504
4.092LysLys: 4.092 ± 1.313
5.371LysLeu: 5.371 ± 1.254
3.325LysMet: 3.325 ± 0.931
3.325LysAsn: 3.325 ± 1.692
3.069LysPro: 3.069 ± 0.265
2.046LysGln: 2.046 ± 0.741
2.046LysArg: 2.046 ± 1.725
6.394LysSer: 6.394 ± 2.329
5.371LysThr: 5.371 ± 1.235
3.581LysVal: 3.581 ± 1.073
1.279LysTrp: 1.279 ± 0.531
1.535LysTyr: 1.535 ± 0.473
0.0LysXaa: 0.0 ± 0.0
Leu
2.813LeuAla: 2.813 ± 0.91
2.813LeuCys: 2.813 ± 0.474
4.604LeuAsp: 4.604 ± 0.491
6.138LeuGlu: 6.138 ± 2.284
4.092LeuPhe: 4.092 ± 1.504
4.859LeuGly: 4.859 ± 0.523
1.79LeuHis: 1.79 ± 0.606
7.673LeuIle: 7.673 ± 0.866
6.65LeuLys: 6.65 ± 0.85
8.696LeuLeu: 8.696 ± 1.075
1.79LeuMet: 1.79 ± 0.361
4.604LeuAsn: 4.604 ± 1.043
2.813LeuPro: 2.813 ± 1.031
3.325LeuGln: 3.325 ± 0.81
6.394LeuArg: 6.394 ± 0.427
9.207LeuSer: 9.207 ± 2.657
5.115LeuThr: 5.115 ± 1.22
4.348LeuVal: 4.348 ± 2.295
0.0LeuTrp: 0.0 ± 0.0
2.302LeuTyr: 2.302 ± 0.52
0.0LeuXaa: 0.0 ± 0.0
Met
1.279MetAla: 1.279 ± 0.373
0.256MetCys: 0.256 ± 0.167
1.279MetAsp: 1.279 ± 0.725
2.302MetGlu: 2.302 ± 0.785
2.046MetPhe: 2.046 ± 0.603
1.279MetGly: 1.279 ± 0.531
0.767MetHis: 0.767 ± 0.622
3.069MetIle: 3.069 ± 0.467
2.813MetLys: 2.813 ± 0.943
1.279MetLeu: 1.279 ± 0.405
2.046MetMet: 2.046 ± 1.074
2.302MetAsn: 2.302 ± 0.71
1.023MetPro: 1.023 ± 0.82
1.023MetGln: 1.023 ± 0.376
2.558MetArg: 2.558 ± 0.694
2.813MetSer: 2.813 ± 1.132
2.558MetThr: 2.558 ± 1.547
1.535MetVal: 1.535 ± 0.494
0.0MetTrp: 0.0 ± 0.0
1.023MetTyr: 1.023 ± 0.376
0.0MetXaa: 0.0 ± 0.0
Asn
2.302AsnAla: 2.302 ± 1.31
2.046AsnCys: 2.046 ± 1.177
2.046AsnAsp: 2.046 ± 0.872
3.069AsnGlu: 3.069 ± 0.711
1.279AsnPhe: 1.279 ± 0.652
3.325AsnGly: 3.325 ± 1.076
1.535AsnHis: 1.535 ± 0.353
2.302AsnIle: 2.302 ± 0.513
3.581AsnLys: 3.581 ± 0.851
6.65AsnLeu: 6.65 ± 1.09
2.046AsnMet: 2.046 ± 0.894
2.558AsnAsn: 2.558 ± 1.331
2.302AsnPro: 2.302 ± 1.098
1.023AsnGln: 1.023 ± 0.329
3.069AsnArg: 3.069 ± 1.191
3.581AsnSer: 3.581 ± 0.549
2.558AsnThr: 2.558 ± 0.551
1.279AsnVal: 1.279 ± 0.531
0.767AsnTrp: 0.767 ± 0.365
2.302AsnTyr: 2.302 ± 0.52
0.0AsnXaa: 0.0 ± 0.0
Pro
0.767ProAla: 0.767 ± 0.237
0.767ProCys: 0.767 ± 0.576
1.535ProAsp: 1.535 ± 1.001
3.325ProGlu: 3.325 ± 1.546
2.558ProPhe: 2.558 ± 0.836
3.581ProGly: 3.581 ± 1.392
1.023ProHis: 1.023 ± 0.53
2.302ProIle: 2.302 ± 0.782
1.535ProLys: 1.535 ± 0.543
2.046ProLeu: 2.046 ± 0.605
0.767ProMet: 0.767 ± 0.622
1.535ProAsn: 1.535 ± 0.353
0.512ProPro: 0.512 ± 0.165
1.79ProGln: 1.79 ± 0.675
1.79ProArg: 1.79 ± 0.732
4.092ProSer: 4.092 ± 1.394
1.535ProThr: 1.535 ± 0.711
2.302ProVal: 2.302 ± 1.024
0.767ProTrp: 0.767 ± 0.502
1.023ProTyr: 1.023 ± 0.376
0.0ProXaa: 0.0 ± 0.0
Gln
0.767GlnAla: 0.767 ± 0.74
1.535GlnCys: 1.535 ± 0.73
2.046GlnAsp: 2.046 ± 1.357
1.535GlnGlu: 1.535 ± 0.494
1.279GlnPhe: 1.279 ± 0.865
2.046GlnGly: 2.046 ± 0.601
1.279GlnHis: 1.279 ± 0.835
4.092GlnIle: 4.092 ± 0.731
2.813GlnLys: 2.813 ± 0.842
2.302GlnLeu: 2.302 ± 0.871
1.023GlnMet: 1.023 ± 0.513
1.535GlnAsn: 1.535 ± 0.475
1.279GlnPro: 1.279 ± 0.725
1.279GlnGln: 1.279 ± 0.501
1.023GlnArg: 1.023 ± 0.871
1.535GlnSer: 1.535 ± 0.353
2.046GlnThr: 2.046 ± 0.603
3.069GlnVal: 3.069 ± 0.854
0.0GlnTrp: 0.0 ± 0.0
0.512GlnTyr: 0.512 ± 0.462
0.0GlnXaa: 0.0 ± 0.0
Arg
1.535ArgAla: 1.535 ± 0.891
2.046ArgCys: 2.046 ± 0.438
3.836ArgAsp: 3.836 ± 0.735
3.836ArgGlu: 3.836 ± 1.005
0.767ArgPhe: 0.767 ± 0.365
2.302ArgGly: 2.302 ± 0.984
0.512ArgHis: 0.512 ± 0.165
5.115ArgIle: 5.115 ± 0.392
2.558ArgLys: 2.558 ± 1.002
5.115ArgLeu: 5.115 ± 1.505
2.302ArgMet: 2.302 ± 0.501
2.813ArgAsn: 2.813 ± 0.908
1.535ArgPro: 1.535 ± 0.73
1.023ArgGln: 1.023 ± 0.513
2.046ArgArg: 2.046 ± 1.105
6.138ArgSer: 6.138 ± 2.586
1.279ArgThr: 1.279 ± 0.373
2.558ArgVal: 2.558 ± 1.034
0.767ArgTrp: 0.767 ± 0.237
1.79ArgTyr: 1.79 ± 0.631
0.0ArgXaa: 0.0 ± 0.0
Ser
3.581SerAla: 3.581 ± 1.05
2.302SerCys: 2.302 ± 1.737
7.417SerAsp: 7.417 ± 1.29
5.115SerGlu: 5.115 ± 1.319
4.348SerPhe: 4.348 ± 2.31
3.836SerGly: 3.836 ± 1.051
2.046SerHis: 2.046 ± 0.603
5.371SerIle: 5.371 ± 1.577
9.207SerLys: 9.207 ± 1.311
10.23SerLeu: 10.23 ± 0.688
2.813SerMet: 2.813 ± 0.861
5.627SerAsn: 5.627 ± 1.212
3.069SerPro: 3.069 ± 1.128
4.604SerGln: 4.604 ± 0.658
3.836SerArg: 3.836 ± 0.738
9.719SerSer: 9.719 ± 0.994
5.627SerThr: 5.627 ± 0.71
5.627SerVal: 5.627 ± 1.583
1.279SerTrp: 1.279 ± 0.652
3.836SerTyr: 3.836 ± 2.226
0.0SerXaa: 0.0 ± 0.0
Thr
1.79ThrAla: 1.79 ± 0.412
1.79ThrCys: 1.79 ± 0.466
2.813ThrAsp: 2.813 ± 0.576
4.348ThrGlu: 4.348 ± 0.738
1.023ThrPhe: 1.023 ± 0.82
3.836ThrGly: 3.836 ± 1.399
0.767ThrHis: 0.767 ± 0.501
4.604ThrIle: 4.604 ± 0.214
4.348ThrLys: 4.348 ± 0.879
6.394ThrLeu: 6.394 ± 1.576
1.279ThrMet: 1.279 ± 0.373
2.302ThrAsn: 2.302 ± 0.378
1.279ThrPro: 1.279 ± 0.373
3.069ThrGln: 3.069 ± 0.863
2.558ThrArg: 2.558 ± 1.013
4.859ThrSer: 4.859 ± 0.305
2.558ThrThr: 2.558 ± 0.841
2.302ThrVal: 2.302 ± 1.0
0.256ThrTrp: 0.256 ± 0.685
1.023ThrTyr: 1.023 ± 0.376
0.0ThrXaa: 0.0 ± 0.0
Val
3.069ValAla: 3.069 ± 0.586
2.046ValCys: 2.046 ± 0.605
2.302ValAsp: 2.302 ± 0.501
3.836ValGlu: 3.836 ± 1.402
3.325ValPhe: 3.325 ± 0.974
2.302ValGly: 2.302 ± 0.556
2.046ValHis: 2.046 ± 0.658
3.836ValIle: 3.836 ± 1.574
4.092ValLys: 4.092 ± 1.28
1.79ValLeu: 1.79 ± 0.846
1.023ValMet: 1.023 ± 0.558
2.046ValAsn: 2.046 ± 1.211
1.79ValPro: 1.79 ± 0.631
2.046ValGln: 2.046 ± 0.63
4.859ValArg: 4.859 ± 0.982
7.928ValSer: 7.928 ± 1.152
3.581ValThr: 3.581 ± 0.182
2.558ValVal: 2.558 ± 0.551
1.023ValTrp: 1.023 ± 0.329
1.023ValTyr: 1.023 ± 0.668
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.512TrpCys: 0.512 ± 0.334
0.512TrpAsp: 0.512 ± 0.334
0.512TrpGlu: 0.512 ± 0.334
0.512TrpPhe: 0.512 ± 0.334
1.279TrpGly: 1.279 ± 1.017
0.256TrpHis: 0.256 ± 0.167
0.512TrpIle: 0.512 ± 0.165
0.512TrpLys: 0.512 ± 0.632
0.767TrpLeu: 0.767 ± 0.365
0.512TrpMet: 0.512 ± 0.334
0.512TrpAsn: 0.512 ± 0.165
0.512TrpPro: 0.512 ± 0.5
0.0TrpGln: 0.0 ± 0.0
0.256TrpArg: 0.256 ± 0.167
1.535TrpSer: 1.535 ± 0.353
1.279TrpThr: 1.279 ± 0.497
1.279TrpVal: 1.279 ± 0.497
0.256TrpTrp: 0.256 ± 0.167
0.256TrpTyr: 0.256 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.767TyrAla: 0.767 ± 0.501
0.767TyrCys: 0.767 ± 0.237
1.535TyrAsp: 1.535 ± 1.478
1.279TyrGlu: 1.279 ± 0.676
0.767TyrPhe: 0.767 ± 0.501
1.279TyrGly: 1.279 ± 0.531
0.767TyrHis: 0.767 ± 0.237
2.558TyrIle: 2.558 ± 1.034
2.302TyrLys: 2.302 ± 0.388
2.302TyrLeu: 2.302 ± 0.562
1.279TyrMet: 1.279 ± 0.774
4.092TyrAsn: 4.092 ± 0.753
2.046TyrPro: 2.046 ± 1.408
1.279TyrGln: 1.279 ± 1.187
1.279TyrArg: 1.279 ± 0.652
2.813TyrSer: 2.813 ± 1.27
1.023TyrThr: 1.023 ± 0.329
2.813TyrVal: 2.813 ± 0.474
0.512TyrTrp: 0.512 ± 0.165
1.279TyrTyr: 1.279 ± 0.418
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3911 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski