Amino acid dipepetide frequency for Achimota virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.337AlaAla: 4.337 ± 0.767
1.38AlaCys: 1.38 ± 0.442
2.957AlaAsp: 2.957 ± 0.923
2.76AlaGlu: 2.76 ± 0.733
1.577AlaPhe: 1.577 ± 0.749
3.548AlaGly: 3.548 ± 0.785
1.577AlaHis: 1.577 ± 0.448
4.14AlaIle: 4.14 ± 0.878
2.76AlaLys: 2.76 ± 0.715
7.096AlaLeu: 7.096 ± 1.155
0.986AlaMet: 0.986 ± 0.7
3.745AlaAsn: 3.745 ± 0.781
2.76AlaPro: 2.76 ± 0.907
2.76AlaGln: 2.76 ± 0.608
3.745AlaArg: 3.745 ± 1.018
4.928AlaSer: 4.928 ± 0.943
3.548AlaThr: 3.548 ± 0.794
2.957AlaVal: 2.957 ± 0.882
0.788AlaTrp: 0.788 ± 0.434
1.774AlaTyr: 1.774 ± 0.406
0.0AlaXaa: 0.0 ± 0.0
Cys
1.38CysAla: 1.38 ± 0.4
0.394CysCys: 0.394 ± 0.215
0.788CysAsp: 0.788 ± 0.257
1.183CysGlu: 1.183 ± 0.524
0.591CysPhe: 0.591 ± 0.462
1.183CysGly: 1.183 ± 0.678
0.197CysHis: 0.197 ± 0.263
1.38CysIle: 1.38 ± 0.294
0.788CysLys: 0.788 ± 0.519
1.971CysLeu: 1.971 ± 0.762
0.591CysMet: 0.591 ± 0.309
0.591CysAsn: 0.591 ± 0.501
1.183CysPro: 1.183 ± 0.648
0.394CysGln: 0.394 ± 0.256
0.986CysArg: 0.986 ± 0.332
2.76CysSer: 2.76 ± 0.851
1.38CysThr: 1.38 ± 0.684
2.168CysVal: 2.168 ± 0.553
0.0CysTrp: 0.0 ± 0.0
0.986CysTyr: 0.986 ± 0.314
0.0CysXaa: 0.0 ± 0.0
Asp
3.351AspAla: 3.351 ± 0.675
0.788AspCys: 0.788 ± 0.5
4.14AspAsp: 4.14 ± 0.902
3.745AspGlu: 3.745 ± 1.59
1.577AspPhe: 1.577 ± 0.627
1.38AspGly: 1.38 ± 0.832
0.986AspHis: 0.986 ± 0.64
4.14AspIle: 4.14 ± 1.275
3.154AspLys: 3.154 ± 0.945
4.928AspLeu: 4.928 ± 1.264
0.986AspMet: 0.986 ± 0.274
1.971AspAsn: 1.971 ± 0.417
3.154AspPro: 3.154 ± 0.969
4.14AspGln: 4.14 ± 0.457
2.365AspArg: 2.365 ± 0.76
3.745AspSer: 3.745 ± 0.37
4.928AspThr: 4.928 ± 1.37
1.38AspVal: 1.38 ± 0.542
0.591AspTrp: 0.591 ± 0.277
2.168AspTyr: 2.168 ± 0.607
0.0AspXaa: 0.0 ± 0.0
Glu
1.38GluAla: 1.38 ± 0.543
1.183GluCys: 1.183 ± 0.507
2.957GluAsp: 2.957 ± 1.287
2.563GluGlu: 2.563 ± 0.418
1.971GluPhe: 1.971 ± 0.626
4.14GluGly: 4.14 ± 0.525
0.986GluHis: 0.986 ± 0.395
5.322GluIle: 5.322 ± 0.95
2.563GluLys: 2.563 ± 0.535
5.717GluLeu: 5.717 ± 0.874
1.577GluMet: 1.577 ± 1.019
1.971GluAsn: 1.971 ± 0.356
1.774GluPro: 1.774 ± 0.582
2.957GluGln: 2.957 ± 1.262
3.154GluArg: 3.154 ± 0.732
3.548GluSer: 3.548 ± 0.904
3.154GluThr: 3.154 ± 0.807
2.76GluVal: 2.76 ± 0.837
0.788GluTrp: 0.788 ± 0.378
0.591GluTyr: 0.591 ± 0.327
0.0GluXaa: 0.0 ± 0.0
Phe
2.563PheAla: 2.563 ± 0.832
0.394PheCys: 0.394 ± 0.217
1.971PheAsp: 1.971 ± 0.693
2.168PheGlu: 2.168 ± 0.555
1.38PhePhe: 1.38 ± 0.544
1.577PheGly: 1.577 ± 0.396
0.788PheHis: 0.788 ± 0.456
3.154PheIle: 3.154 ± 0.479
2.365PheLys: 2.365 ± 0.967
3.351PheLeu: 3.351 ± 1.006
0.986PheMet: 0.986 ± 0.472
1.971PheAsn: 1.971 ± 0.638
1.183PhePro: 1.183 ± 0.493
0.591PheGln: 0.591 ± 0.225
2.168PheArg: 2.168 ± 0.705
3.942PheSer: 3.942 ± 1.02
2.168PheThr: 2.168 ± 0.557
1.38PheVal: 1.38 ± 0.341
0.788PheTrp: 0.788 ± 0.314
0.986PheTyr: 0.986 ± 0.411
0.0PheXaa: 0.0 ± 0.0
Gly
3.548GlyAla: 3.548 ± 1.119
0.788GlyCys: 0.788 ± 0.429
2.365GlyAsp: 2.365 ± 0.813
1.971GlyGlu: 1.971 ± 0.758
1.38GlyPhe: 1.38 ± 0.32
2.563GlyGly: 2.563 ± 0.914
1.183GlyHis: 1.183 ± 0.373
3.548GlyIle: 3.548 ± 0.371
2.365GlyLys: 2.365 ± 0.599
6.111GlyLeu: 6.111 ± 1.084
1.183GlyMet: 1.183 ± 0.663
2.76GlyAsn: 2.76 ± 0.711
2.563GlyPro: 2.563 ± 1.392
1.971GlyGln: 1.971 ± 0.605
2.76GlyArg: 2.76 ± 0.719
5.322GlySer: 5.322 ± 1.423
1.774GlyThr: 1.774 ± 0.51
4.928GlyVal: 4.928 ± 0.949
0.394GlyTrp: 0.394 ± 0.384
1.774GlyTyr: 1.774 ± 0.535
0.0GlyXaa: 0.0 ± 0.0
His
1.183HisAla: 1.183 ± 0.291
0.197HisCys: 0.197 ± 0.263
1.183HisAsp: 1.183 ± 0.584
1.183HisGlu: 1.183 ± 0.943
1.774HisPhe: 1.774 ± 0.904
0.394HisGly: 0.394 ± 0.499
0.394HisHis: 0.394 ± 0.256
1.971HisIle: 1.971 ± 0.677
0.788HisLys: 0.788 ± 0.512
2.365HisLeu: 2.365 ± 0.837
0.394HisMet: 0.394 ± 0.397
0.986HisAsn: 0.986 ± 0.48
1.774HisPro: 1.774 ± 0.446
0.986HisGln: 0.986 ± 0.28
1.183HisArg: 1.183 ± 0.507
1.577HisSer: 1.577 ± 0.607
0.788HisThr: 0.788 ± 0.314
1.183HisVal: 1.183 ± 0.418
0.197HisTrp: 0.197 ± 0.128
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.534IleAla: 4.534 ± 1.13
1.774IleCys: 1.774 ± 0.544
6.702IleAsp: 6.702 ± 1.678
3.942IleGlu: 3.942 ± 1.322
2.563IlePhe: 2.563 ± 0.637
2.168IleGly: 2.168 ± 1.063
0.788IleHis: 0.788 ± 0.369
7.096IleIle: 7.096 ± 1.2
4.731IleLys: 4.731 ± 0.715
7.294IleLeu: 7.294 ± 1.423
1.577IleMet: 1.577 ± 0.423
3.548IleAsn: 3.548 ± 0.934
2.365IlePro: 2.365 ± 0.451
3.745IleGln: 3.745 ± 1.031
3.548IleArg: 3.548 ± 0.668
6.899IleSer: 6.899 ± 2.081
5.322IleThr: 5.322 ± 1.08
3.942IleVal: 3.942 ± 1.175
1.183IleTrp: 1.183 ± 0.466
2.563IleTyr: 2.563 ± 0.515
0.0IleXaa: 0.0 ± 0.0
Lys
3.942LysAla: 3.942 ± 0.923
1.38LysCys: 1.38 ± 0.429
2.76LysAsp: 2.76 ± 0.833
2.957LysGlu: 2.957 ± 0.873
2.168LysPhe: 2.168 ± 0.541
3.548LysGly: 3.548 ± 0.519
1.183LysHis: 1.183 ± 0.6
2.563LysIle: 2.563 ± 0.728
3.942LysLys: 3.942 ± 1.327
6.111LysLeu: 6.111 ± 0.962
1.183LysMet: 1.183 ± 0.393
2.365LysAsn: 2.365 ± 0.669
1.774LysPro: 1.774 ± 0.401
4.14LysGln: 4.14 ± 1.978
2.76LysArg: 2.76 ± 0.799
3.154LysSer: 3.154 ± 0.761
2.563LysThr: 2.563 ± 0.445
2.365LysVal: 2.365 ± 0.539
0.788LysTrp: 0.788 ± 0.618
1.183LysTyr: 1.183 ± 0.676
0.0LysXaa: 0.0 ± 0.0
Leu
6.111LeuAla: 6.111 ± 1.169
2.365LeuCys: 2.365 ± 0.88
6.308LeuAsp: 6.308 ± 1.351
6.505LeuGlu: 6.505 ± 1.351
2.76LeuPhe: 2.76 ± 0.898
4.14LeuGly: 4.14 ± 1.005
1.577LeuHis: 1.577 ± 0.586
7.688LeuIle: 7.688 ± 1.735
6.111LeuLys: 6.111 ± 1.166
8.673LeuLeu: 8.673 ± 2.428
2.563LeuMet: 2.563 ± 0.44
5.914LeuAsn: 5.914 ± 1.489
5.717LeuPro: 5.717 ± 1.034
5.914LeuGln: 5.914 ± 1.019
3.548LeuArg: 3.548 ± 0.972
9.265LeuSer: 9.265 ± 2.134
9.265LeuThr: 9.265 ± 1.489
5.125LeuVal: 5.125 ± 0.878
0.986LeuTrp: 0.986 ± 0.469
4.337LeuTyr: 4.337 ± 1.346
0.0LeuXaa: 0.0 ± 0.0
Met
2.365MetAla: 2.365 ± 1.012
0.591MetCys: 0.591 ± 0.281
2.168MetAsp: 2.168 ± 0.483
0.788MetGlu: 0.788 ± 0.433
0.394MetPhe: 0.394 ± 0.215
0.394MetGly: 0.394 ± 0.217
0.0MetHis: 0.0 ± 0.0
1.577MetIle: 1.577 ± 0.442
0.197MetLys: 0.197 ± 0.128
2.76MetLeu: 2.76 ± 0.774
1.183MetMet: 1.183 ± 0.917
0.788MetAsn: 0.788 ± 0.377
1.183MetPro: 1.183 ± 0.423
1.577MetGln: 1.577 ± 0.601
1.971MetArg: 1.971 ± 0.677
2.168MetSer: 2.168 ± 0.685
1.577MetThr: 1.577 ± 0.596
0.986MetVal: 0.986 ± 0.405
0.197MetTrp: 0.197 ± 0.249
0.591MetTyr: 0.591 ± 0.264
0.0MetXaa: 0.0 ± 0.0
Asn
2.957AsnAla: 2.957 ± 0.861
0.788AsnCys: 0.788 ± 0.38
1.774AsnAsp: 1.774 ± 0.437
1.971AsnGlu: 1.971 ± 0.498
1.774AsnPhe: 1.774 ± 0.406
1.38AsnGly: 1.38 ± 0.638
2.168AsnHis: 2.168 ± 0.599
3.351AsnIle: 3.351 ± 0.947
2.365AsnLys: 2.365 ± 0.895
4.928AsnLeu: 4.928 ± 1.349
0.788AsnMet: 0.788 ± 0.605
1.577AsnAsn: 1.577 ± 0.258
4.534AsnPro: 4.534 ± 0.956
1.774AsnGln: 1.774 ± 0.389
2.957AsnArg: 2.957 ± 0.477
3.745AsnSer: 3.745 ± 1.583
1.971AsnThr: 1.971 ± 0.212
2.76AsnVal: 2.76 ± 0.756
0.788AsnTrp: 0.788 ± 0.512
1.38AsnTyr: 1.38 ± 0.393
0.0AsnXaa: 0.0 ± 0.0
Pro
2.76ProAla: 2.76 ± 1.378
1.577ProCys: 1.577 ± 1.116
1.183ProAsp: 1.183 ± 0.272
3.548ProGlu: 3.548 ± 0.897
1.971ProPhe: 1.971 ± 1.135
4.928ProGly: 4.928 ± 1.989
0.788ProHis: 0.788 ± 0.721
2.365ProIle: 2.365 ± 0.458
1.971ProLys: 1.971 ± 0.518
5.322ProLeu: 5.322 ± 0.917
0.986ProMet: 0.986 ± 0.365
2.76ProAsn: 2.76 ± 0.862
3.548ProPro: 3.548 ± 1.125
3.351ProGln: 3.351 ± 0.762
3.154ProArg: 3.154 ± 1.137
6.308ProSer: 6.308 ± 2.358
4.534ProThr: 4.534 ± 0.6
1.971ProVal: 1.971 ± 0.476
0.591ProTrp: 0.591 ± 0.462
1.577ProTyr: 1.577 ± 0.896
0.0ProXaa: 0.0 ± 0.0
Gln
2.168GlnAla: 2.168 ± 0.901
0.788GlnCys: 0.788 ± 0.27
1.971GlnAsp: 1.971 ± 0.478
3.154GlnGlu: 3.154 ± 1.186
1.971GlnPhe: 1.971 ± 0.574
3.942GlnGly: 3.942 ± 1.187
1.183GlnHis: 1.183 ± 0.683
3.154GlnIle: 3.154 ± 0.444
2.563GlnLys: 2.563 ± 0.604
5.914GlnLeu: 5.914 ± 0.935
1.774GlnMet: 1.774 ± 0.652
2.957GlnAsn: 2.957 ± 0.671
3.942GlnPro: 3.942 ± 2.147
2.76GlnGln: 2.76 ± 1.513
1.577GlnArg: 1.577 ± 0.371
3.548GlnSer: 3.548 ± 0.739
2.563GlnThr: 2.563 ± 0.813
3.351GlnVal: 3.351 ± 1.483
0.0GlnTrp: 0.0 ± 0.0
0.986GlnTyr: 0.986 ± 0.417
0.0GlnXaa: 0.0 ± 0.0
Arg
1.971ArgAla: 1.971 ± 0.714
0.394ArgCys: 0.394 ± 0.234
1.183ArgAsp: 1.183 ± 0.551
2.563ArgGlu: 2.563 ± 0.847
2.168ArgPhe: 2.168 ± 0.566
2.957ArgGly: 2.957 ± 0.427
1.577ArgHis: 1.577 ± 0.45
5.322ArgIle: 5.322 ± 0.855
3.548ArgLys: 3.548 ± 0.726
6.702ArgLeu: 6.702 ± 1.677
0.986ArgMet: 0.986 ± 0.286
2.168ArgAsn: 2.168 ± 0.383
2.76ArgPro: 2.76 ± 1.02
1.38ArgGln: 1.38 ± 0.316
3.154ArgArg: 3.154 ± 0.889
4.14ArgSer: 4.14 ± 0.494
1.971ArgThr: 1.971 ± 0.557
3.351ArgVal: 3.351 ± 0.683
0.0ArgTrp: 0.0 ± 0.0
1.38ArgTyr: 1.38 ± 0.479
0.0ArgXaa: 0.0 ± 0.0
Ser
5.125SerAla: 5.125 ± 0.727
2.563SerCys: 2.563 ± 0.764
5.125SerAsp: 5.125 ± 0.584
3.942SerGlu: 3.942 ± 1.285
2.957SerPhe: 2.957 ± 0.802
4.928SerGly: 4.928 ± 1.351
2.168SerHis: 2.168 ± 0.561
8.476SerIle: 8.476 ± 1.227
5.914SerLys: 5.914 ± 0.796
8.476SerLeu: 8.476 ± 1.924
2.563SerMet: 2.563 ± 0.663
3.154SerAsn: 3.154 ± 0.904
5.125SerPro: 5.125 ± 1.209
3.351SerGln: 3.351 ± 0.959
2.957SerArg: 2.957 ± 0.438
6.702SerSer: 6.702 ± 1.093
5.322SerThr: 5.322 ± 1.228
4.337SerVal: 4.337 ± 1.033
1.183SerTrp: 1.183 ± 0.473
1.971SerTyr: 1.971 ± 0.803
0.0SerXaa: 0.0 ± 0.0
Thr
4.14ThrAla: 4.14 ± 1.411
1.183ThrCys: 1.183 ± 0.696
2.563ThrAsp: 2.563 ± 0.582
2.563ThrGlu: 2.563 ± 0.515
2.563ThrPhe: 2.563 ± 0.523
2.563ThrGly: 2.563 ± 0.501
1.38ThrHis: 1.38 ± 0.496
3.154ThrIle: 3.154 ± 0.456
2.365ThrLys: 2.365 ± 0.364
5.519ThrLeu: 5.519 ± 1.548
1.38ThrMet: 1.38 ± 0.524
2.563ThrAsn: 2.563 ± 0.573
5.322ThrPro: 5.322 ± 0.807
2.563ThrGln: 2.563 ± 0.667
3.745ThrArg: 3.745 ± 0.458
5.125ThrSer: 5.125 ± 0.707
6.899ThrThr: 6.899 ± 0.458
4.928ThrVal: 4.928 ± 2.185
0.986ThrTrp: 0.986 ± 0.433
2.957ThrTyr: 2.957 ± 0.681
0.0ThrXaa: 0.0 ± 0.0
Val
3.351ValAla: 3.351 ± 0.966
1.183ValCys: 1.183 ± 0.398
4.928ValAsp: 4.928 ± 1.383
2.168ValGlu: 2.168 ± 0.696
2.168ValPhe: 2.168 ± 0.407
3.745ValGly: 3.745 ± 0.442
1.183ValHis: 1.183 ± 0.687
5.125ValIle: 5.125 ± 1.278
2.76ValLys: 2.76 ± 1.553
5.125ValLeu: 5.125 ± 1.354
1.38ValMet: 1.38 ± 0.548
1.577ValAsn: 1.577 ± 0.663
1.774ValPro: 1.774 ± 0.444
3.351ValGln: 3.351 ± 0.769
2.563ValArg: 2.563 ± 0.418
3.942ValSer: 3.942 ± 1.013
3.745ValThr: 3.745 ± 0.808
4.14ValVal: 4.14 ± 1.2
0.394ValTrp: 0.394 ± 0.355
1.577ValTyr: 1.577 ± 0.54
0.0ValXaa: 0.0 ± 0.0
Trp
0.986TrpAla: 0.986 ± 0.314
0.394TrpCys: 0.394 ± 0.396
0.0TrpAsp: 0.0 ± 0.0
0.197TrpGlu: 0.197 ± 0.128
0.591TrpPhe: 0.591 ± 0.309
0.591TrpGly: 0.591 ± 0.275
0.0TrpHis: 0.0 ± 0.0
1.577TrpIle: 1.577 ± 0.388
0.591TrpLys: 0.591 ± 0.252
1.38TrpLeu: 1.38 ± 0.54
0.0TrpMet: 0.0 ± 0.0
0.591TrpAsn: 0.591 ± 0.327
0.986TrpPro: 0.986 ± 0.376
0.0TrpGln: 0.0 ± 0.0
0.591TrpArg: 0.591 ± 0.236
1.38TrpSer: 1.38 ± 0.778
0.394TrpThr: 0.394 ± 0.256
0.197TrpVal: 0.197 ± 0.263
0.197TrpTrp: 0.197 ± 0.263
0.197TrpTyr: 0.197 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.971TyrAla: 1.971 ± 0.417
0.788TyrCys: 0.788 ± 0.257
0.591TyrAsp: 0.591 ± 0.309
1.183TyrGlu: 1.183 ± 0.428
1.577TyrPhe: 1.577 ± 0.377
1.183TyrGly: 1.183 ± 1.031
0.394TyrHis: 0.394 ± 0.229
1.183TyrIle: 1.183 ± 0.471
0.986TyrLys: 0.986 ± 0.482
5.125TyrLeu: 5.125 ± 1.424
0.197TyrMet: 0.197 ± 0.128
1.774TyrAsn: 1.774 ± 0.781
1.774TyrPro: 1.774 ± 0.734
2.76TyrGln: 2.76 ± 0.451
0.788TyrArg: 0.788 ± 0.623
4.14TyrSer: 4.14 ± 1.096
0.591TyrThr: 0.591 ± 0.584
1.971TyrVal: 1.971 ± 0.496
0.0TyrTrp: 0.0 ± 0.0
1.577TyrTyr: 1.577 ± 0.433
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5074 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski