Amino acid dipepetide frequency for Yellow-breasted capuchin simian foamy virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.792AlaAla: 3.792 ± 1.53
0.875AlaCys: 0.875 ± 0.359
2.917AlaAsp: 2.917 ± 0.587
2.625AlaGlu: 2.625 ± 0.716
1.75AlaPhe: 1.75 ± 0.627
2.625AlaGly: 2.625 ± 0.916
0.875AlaHis: 0.875 ± 0.37
2.917AlaIle: 2.917 ± 0.587
3.501AlaLys: 3.501 ± 0.408
5.834AlaLeu: 5.834 ± 0.778
0.875AlaMet: 0.875 ± 1.207
1.459AlaAsn: 1.459 ± 0.523
3.209AlaPro: 3.209 ± 1.264
3.209AlaGln: 3.209 ± 0.502
3.209AlaArg: 3.209 ± 0.685
3.501AlaSer: 3.501 ± 1.131
4.959AlaThr: 4.959 ± 0.967
4.667AlaVal: 4.667 ± 0.898
1.75AlaTrp: 1.75 ± 0.744
1.459AlaTyr: 1.459 ± 0.315
0.0AlaXaa: 0.0 ± 0.0
Cys
1.167CysAla: 1.167 ± 0.705
0.292CysCys: 0.292 ± 0.318
0.0CysAsp: 0.0 ± 0.0
0.292CysGlu: 0.292 ± 0.243
1.167CysPhe: 1.167 ± 0.727
1.75CysGly: 1.75 ± 0.93
0.0CysHis: 0.0 ± 0.0
0.583CysIle: 0.583 ± 0.372
2.042CysLys: 2.042 ± 0.717
1.75CysLeu: 1.75 ± 0.67
0.292CysMet: 0.292 ± 0.243
0.0CysAsn: 0.0 ± 0.0
2.042CysPro: 2.042 ± 1.401
0.292CysGln: 0.292 ± 0.367
0.583CysArg: 0.583 ± 0.325
1.167CysSer: 1.167 ± 1.065
0.875CysThr: 0.875 ± 0.595
0.875CysVal: 0.875 ± 0.477
0.0CysTrp: 0.0 ± 0.0
1.167CysTyr: 1.167 ± 0.769
0.0CysXaa: 0.0 ± 0.0
Asp
1.75AspAla: 1.75 ± 0.324
2.042AspCys: 2.042 ± 1.099
2.917AspAsp: 2.917 ± 1.015
2.334AspGlu: 2.334 ± 0.929
1.167AspPhe: 1.167 ± 0.306
2.334AspGly: 2.334 ± 0.94
1.167AspHis: 1.167 ± 0.658
2.625AspIle: 2.625 ± 0.945
3.209AspLys: 3.209 ± 1.118
5.251AspLeu: 5.251 ± 0.911
0.875AspMet: 0.875 ± 0.73
2.042AspAsn: 2.042 ± 0.492
3.209AspPro: 3.209 ± 1.607
2.334AspGln: 2.334 ± 0.94
2.334AspArg: 2.334 ± 0.972
2.334AspSer: 2.334 ± 0.448
2.042AspThr: 2.042 ± 0.19
2.334AspVal: 2.334 ± 0.584
1.167AspTrp: 1.167 ± 0.656
2.625AspTyr: 2.625 ± 0.902
0.0AspXaa: 0.0 ± 0.0
Glu
2.917GluAla: 2.917 ± 1.0
1.167GluCys: 1.167 ± 0.727
1.75GluAsp: 1.75 ± 1.117
6.418GluGlu: 6.418 ± 1.371
2.917GluPhe: 2.917 ± 1.029
4.376GluGly: 4.376 ± 0.909
2.334GluHis: 2.334 ± 0.548
4.084GluIle: 4.084 ± 1.158
2.917GluLys: 2.917 ± 1.207
4.084GluLeu: 4.084 ± 1.098
0.875GluMet: 0.875 ± 0.421
2.917GluAsn: 2.917 ± 1.47
2.042GluPro: 2.042 ± 0.94
2.625GluGln: 2.625 ± 0.953
3.792GluArg: 3.792 ± 0.767
2.625GluSer: 2.625 ± 1.071
4.376GluThr: 4.376 ± 1.596
4.667GluVal: 4.667 ± 0.609
0.583GluTrp: 0.583 ± 0.383
1.167GluTyr: 1.167 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
1.75PheAla: 1.75 ± 0.954
0.583PheCys: 0.583 ± 0.487
2.334PheAsp: 2.334 ± 0.669
1.167PheGlu: 1.167 ± 0.624
0.583PhePhe: 0.583 ± 0.385
1.75PheGly: 1.75 ± 0.789
1.167PheHis: 1.167 ± 0.648
1.459PheIle: 1.459 ± 0.667
3.209PheLys: 3.209 ± 0.889
3.209PheLeu: 3.209 ± 0.648
0.0PheMet: 0.0 ± 0.0
1.459PheAsn: 1.459 ± 0.372
1.167PhePro: 1.167 ± 0.68
1.459PheGln: 1.459 ± 0.653
0.583PheArg: 0.583 ± 0.384
2.917PheSer: 2.917 ± 1.249
3.792PheThr: 3.792 ± 0.785
1.167PheVal: 1.167 ± 0.576
1.167PheTrp: 1.167 ± 0.554
0.292PheTyr: 0.292 ± 0.209
0.0PheXaa: 0.0 ± 0.0
Gly
2.042GlyAla: 2.042 ± 0.448
0.583GlyCys: 0.583 ± 0.486
3.209GlyAsp: 3.209 ± 0.903
4.667GlyGlu: 4.667 ± 1.108
3.209GlyPhe: 3.209 ± 1.219
3.501GlyGly: 3.501 ± 1.645
1.75GlyHis: 1.75 ± 0.408
3.792GlyIle: 3.792 ± 1.251
1.75GlyLys: 1.75 ± 0.984
4.084GlyLeu: 4.084 ± 0.744
1.459GlyMet: 1.459 ± 0.871
3.792GlyAsn: 3.792 ± 1.881
3.209GlyPro: 3.209 ± 1.956
4.084GlyGln: 4.084 ± 0.761
3.501GlyArg: 3.501 ± 1.699
4.084GlySer: 4.084 ± 1.587
2.917GlyThr: 2.917 ± 0.898
2.042GlyVal: 2.042 ± 0.492
0.875GlyTrp: 0.875 ± 0.408
2.917GlyTyr: 2.917 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
1.75HisAla: 1.75 ± 0.599
0.292HisCys: 0.292 ± 0.342
1.459HisAsp: 1.459 ± 0.703
2.334HisGlu: 2.334 ± 0.764
0.583HisPhe: 0.583 ± 0.487
1.75HisGly: 1.75 ± 0.472
0.292HisHis: 0.292 ± 0.342
1.75HisIle: 1.75 ± 0.699
0.875HisLys: 0.875 ± 0.215
1.75HisLeu: 1.75 ± 0.493
0.583HisMet: 0.583 ± 0.439
0.583HisAsn: 0.583 ± 0.233
2.625HisPro: 2.625 ± 0.715
1.459HisGln: 1.459 ± 0.764
0.292HisArg: 0.292 ± 0.209
2.334HisSer: 2.334 ± 0.9
1.75HisThr: 1.75 ± 0.775
0.875HisVal: 0.875 ± 0.37
0.292HisTrp: 0.292 ± 0.243
0.875HisTyr: 0.875 ± 0.37
0.0HisXaa: 0.0 ± 0.0
Ile
1.75IleAla: 1.75 ± 0.493
1.167IleCys: 1.167 ± 0.554
2.625IleAsp: 2.625 ± 0.749
3.209IleGlu: 3.209 ± 0.672
1.167IlePhe: 1.167 ± 0.466
2.625IleGly: 2.625 ± 0.644
0.875IleHis: 0.875 ± 0.428
5.543IleIle: 5.543 ± 1.578
3.501IleLys: 3.501 ± 1.124
6.709IleLeu: 6.709 ± 1.807
1.167IleMet: 1.167 ± 0.456
2.334IleAsn: 2.334 ± 0.547
4.959IlePro: 4.959 ± 1.432
4.667IleGln: 4.667 ± 1.793
2.625IleArg: 2.625 ± 0.628
4.084IleSer: 4.084 ± 1.114
4.959IleThr: 4.959 ± 1.483
5.251IleVal: 5.251 ± 1.303
1.167IleTrp: 1.167 ± 0.554
1.459IleTyr: 1.459 ± 0.621
0.0IleXaa: 0.0 ± 0.0
Lys
4.667LysAla: 4.667 ± 0.66
2.042LysCys: 2.042 ± 1.135
2.917LysAsp: 2.917 ± 0.844
4.084LysGlu: 4.084 ± 1.12
1.75LysPhe: 1.75 ± 0.541
2.042LysGly: 2.042 ± 1.437
0.875LysHis: 0.875 ± 0.411
4.376LysIle: 4.376 ± 0.897
4.376LysLys: 4.376 ± 1.476
5.834LysLeu: 5.834 ± 1.835
0.292LysMet: 0.292 ± 0.342
2.917LysAsn: 2.917 ± 0.822
4.959LysPro: 4.959 ± 1.98
2.334LysGln: 2.334 ± 0.947
2.334LysArg: 2.334 ± 0.767
1.459LysSer: 1.459 ± 0.751
4.667LysThr: 4.667 ± 1.072
1.75LysVal: 1.75 ± 0.509
2.334LysTrp: 2.334 ± 0.915
3.209LysTyr: 3.209 ± 1.028
0.0LysXaa: 0.0 ± 0.0
Leu
4.667LeuAla: 4.667 ± 0.988
0.875LeuCys: 0.875 ± 1.102
5.251LeuAsp: 5.251 ± 1.195
4.376LeuGlu: 4.376 ± 1.887
2.625LeuPhe: 2.625 ± 0.898
5.834LeuGly: 5.834 ± 0.708
2.042LeuHis: 2.042 ± 0.522
4.084LeuIle: 4.084 ± 1.313
5.834LeuLys: 5.834 ± 1.976
10.21LeuLeu: 10.21 ± 2.137
1.167LeuMet: 1.167 ± 0.554
5.543LeuAsn: 5.543 ± 0.37
9.918LeuPro: 9.918 ± 0.713
4.959LeuGln: 4.959 ± 1.381
6.126LeuArg: 6.126 ± 1.408
4.667LeuSer: 4.667 ± 0.581
5.834LeuThr: 5.834 ± 0.594
6.418LeuVal: 6.418 ± 0.426
2.334LeuTrp: 2.334 ± 0.649
2.625LeuTyr: 2.625 ± 0.807
0.0LeuXaa: 0.0 ± 0.0
Met
1.75MetAla: 1.75 ± 0.892
0.583MetCys: 0.583 ± 0.233
0.583MetAsp: 0.583 ± 0.636
0.875MetGlu: 0.875 ± 0.408
0.583MetPhe: 0.583 ± 0.439
1.167MetGly: 1.167 ± 0.307
0.583MetHis: 0.583 ± 0.385
0.875MetIle: 0.875 ± 0.44
0.583MetLys: 0.583 ± 0.233
0.875MetLeu: 0.875 ± 0.685
0.875MetMet: 0.875 ± 0.954
0.875MetAsn: 0.875 ± 0.215
0.875MetPro: 0.875 ± 0.595
0.875MetGln: 0.875 ± 0.359
0.583MetArg: 0.583 ± 0.233
0.292MetSer: 0.292 ± 0.243
2.334MetThr: 2.334 ± 0.759
0.583MetVal: 0.583 ± 0.418
0.0MetTrp: 0.0 ± 0.0
0.292MetTyr: 0.292 ± 0.318
0.0MetXaa: 0.0 ± 0.0
Asn
2.625AsnAla: 2.625 ± 0.81
0.292AsnCys: 0.292 ± 0.243
0.875AsnAsp: 0.875 ± 0.411
2.625AsnGlu: 2.625 ± 0.458
2.917AsnPhe: 2.917 ± 0.719
2.042AsnGly: 2.042 ± 0.827
1.167AsnHis: 1.167 ± 0.466
4.084AsnIle: 4.084 ± 1.103
3.501AsnLys: 3.501 ± 1.013
4.959AsnLeu: 4.959 ± 0.519
1.459AsnMet: 1.459 ± 0.526
4.959AsnAsn: 4.959 ± 1.096
4.084AsnPro: 4.084 ± 2.321
2.917AsnGln: 2.917 ± 0.925
3.209AsnArg: 3.209 ± 1.197
3.209AsnSer: 3.209 ± 1.077
2.625AsnThr: 2.625 ± 0.42
2.334AsnVal: 2.334 ± 0.342
0.875AsnTrp: 0.875 ± 0.73
0.583AsnTyr: 0.583 ± 0.487
0.0AsnXaa: 0.0 ± 0.0
Pro
4.376ProAla: 4.376 ± 1.369
0.875ProCys: 0.875 ± 0.32
2.625ProAsp: 2.625 ± 0.744
2.917ProGlu: 2.917 ± 0.697
2.042ProPhe: 2.042 ± 0.989
4.667ProGly: 4.667 ± 1.526
2.625ProHis: 2.625 ± 1.349
5.251ProIle: 5.251 ± 0.64
5.543ProLys: 5.543 ± 0.695
9.918ProLeu: 9.918 ± 2.507
1.167ProMet: 1.167 ± 0.897
2.042ProAsn: 2.042 ± 0.492
7.585ProPro: 7.585 ± 2.507
5.834ProGln: 5.834 ± 1.679
3.209ProArg: 3.209 ± 1.721
8.168ProSer: 8.168 ± 2.773
3.792ProThr: 3.792 ± 0.712
4.667ProVal: 4.667 ± 0.527
0.583ProTrp: 0.583 ± 0.233
3.209ProTyr: 3.209 ± 1.31
0.0ProXaa: 0.0 ± 0.0
Gln
3.209GlnAla: 3.209 ± 1.643
0.292GlnCys: 0.292 ± 0.367
3.209GlnAsp: 3.209 ± 0.683
2.917GlnGlu: 2.917 ± 1.175
0.583GlnPhe: 0.583 ± 0.487
4.376GlnGly: 4.376 ± 1.432
1.459GlnHis: 1.459 ± 0.645
2.625GlnIle: 2.625 ± 0.589
2.917GlnLys: 2.917 ± 1.289
3.209GlnLeu: 3.209 ± 1.1
0.0GlnMet: 0.0 ± 0.0
3.209GlnAsn: 3.209 ± 0.698
5.543GlnPro: 5.543 ± 2.279
2.334GlnGln: 2.334 ± 0.473
2.917GlnArg: 2.917 ± 0.662
2.625GlnSer: 2.625 ± 0.982
2.042GlnThr: 2.042 ± 0.19
3.501GlnVal: 3.501 ± 1.112
2.042GlnTrp: 2.042 ± 0.399
1.167GlnTyr: 1.167 ± 0.47
0.0GlnXaa: 0.0 ± 0.0
Arg
2.917ArgAla: 2.917 ± 1.095
0.875ArgCys: 0.875 ± 0.44
2.334ArgAsp: 2.334 ± 0.614
3.209ArgGlu: 3.209 ± 1.026
2.042ArgPhe: 2.042 ± 1.068
3.501ArgGly: 3.501 ± 2.779
2.334ArgHis: 2.334 ± 0.487
1.167ArgIle: 1.167 ± 0.43
2.917ArgLys: 2.917 ± 1.483
3.501ArgLeu: 3.501 ± 1.108
1.167ArgMet: 1.167 ± 0.445
2.917ArgAsn: 2.917 ± 1.779
4.959ArgPro: 4.959 ± 2.061
1.167ArgGln: 1.167 ± 0.272
3.792ArgArg: 3.792 ± 1.053
3.209ArgSer: 3.209 ± 1.102
2.625ArgThr: 2.625 ± 0.608
2.917ArgVal: 2.917 ± 0.923
0.875ArgTrp: 0.875 ± 0.461
1.167ArgTyr: 1.167 ± 0.889
0.0ArgXaa: 0.0 ± 0.0
Ser
4.084SerAla: 4.084 ± 0.798
0.583SerCys: 0.583 ± 0.384
2.625SerAsp: 2.625 ± 0.634
3.792SerGlu: 3.792 ± 1.371
0.583SerPhe: 0.583 ± 0.486
4.667SerGly: 4.667 ± 1.922
1.75SerHis: 1.75 ± 1.255
3.209SerIle: 3.209 ± 0.747
2.917SerLys: 2.917 ± 0.905
6.709SerLeu: 6.709 ± 1.839
0.875SerMet: 0.875 ± 0.409
3.209SerAsn: 3.209 ± 0.122
4.084SerPro: 4.084 ± 0.874
2.625SerGln: 2.625 ± 0.813
3.501SerArg: 3.501 ± 1.919
4.667SerSer: 4.667 ± 1.003
6.709SerThr: 6.709 ± 0.853
2.917SerVal: 2.917 ± 0.837
1.167SerTrp: 1.167 ± 0.306
2.625SerTyr: 2.625 ± 0.715
0.0SerXaa: 0.0 ± 0.0
Thr
4.667ThrAla: 4.667 ± 0.435
1.167ThrCys: 1.167 ± 0.656
2.625ThrAsp: 2.625 ± 0.416
4.376ThrGlu: 4.376 ± 1.39
2.334ThrPhe: 2.334 ± 1.357
3.792ThrGly: 3.792 ± 0.641
0.875ThrHis: 0.875 ± 0.528
5.251ThrIle: 5.251 ± 1.4
4.084ThrLys: 4.084 ± 1.204
4.667ThrLeu: 4.667 ± 0.424
0.875ThrMet: 0.875 ± 0.411
3.792ThrAsn: 3.792 ± 1.126
7.293ThrPro: 7.293 ± 0.791
2.625ThrGln: 2.625 ± 0.332
2.042ThrArg: 2.042 ± 0.744
6.709ThrSer: 6.709 ± 0.966
2.917ThrThr: 2.917 ± 1.064
3.792ThrVal: 3.792 ± 0.73
1.75ThrTrp: 1.75 ± 0.509
1.75ThrTyr: 1.75 ± 0.795
0.0ThrXaa: 0.0 ± 0.0
Val
3.209ValAla: 3.209 ± 1.279
0.583ValCys: 0.583 ± 0.735
2.917ValAsp: 2.917 ± 0.594
3.501ValGlu: 3.501 ± 0.733
2.042ValPhe: 2.042 ± 0.826
2.042ValGly: 2.042 ± 0.511
1.75ValHis: 1.75 ± 0.493
3.501ValIle: 3.501 ± 0.605
3.209ValLys: 3.209 ± 1.262
7.001ValLeu: 7.001 ± 0.8
0.583ValMet: 0.583 ± 0.331
3.792ValAsn: 3.792 ± 0.292
5.543ValPro: 5.543 ± 1.33
2.625ValGln: 2.625 ± 0.261
1.459ValArg: 1.459 ± 0.871
2.625ValSer: 2.625 ± 0.696
4.667ValThr: 4.667 ± 1.874
4.376ValVal: 4.376 ± 1.073
0.875ValTrp: 0.875 ± 0.43
2.334ValTyr: 2.334 ± 0.79
0.0ValXaa: 0.0 ± 0.0
Trp
0.583TrpAla: 0.583 ± 0.418
0.0TrpCys: 0.0 ± 0.0
1.459TrpAsp: 1.459 ± 0.712
2.042TrpGlu: 2.042 ± 0.775
0.0TrpPhe: 0.0 ± 0.0
1.459TrpGly: 1.459 ± 1.272
0.0TrpHis: 0.0 ± 0.0
1.75TrpIle: 1.75 ± 0.856
0.583TrpLys: 0.583 ± 0.372
2.042TrpLeu: 2.042 ± 0.498
0.583TrpMet: 0.583 ± 0.636
1.167TrpAsn: 1.167 ± 0.973
1.75TrpPro: 1.75 ± 0.609
0.583TrpGln: 0.583 ± 0.233
2.042TrpArg: 2.042 ± 0.498
0.0TrpSer: 0.0 ± 0.0
1.75TrpThr: 1.75 ± 0.461
0.875TrpVal: 0.875 ± 0.408
0.583TrpTrp: 0.583 ± 0.301
1.459TrpTyr: 1.459 ± 0.42
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.625TyrAla: 2.625 ± 0.915
1.167TyrCys: 1.167 ± 0.743
1.459TyrAsp: 1.459 ± 0.42
0.875TyrGlu: 0.875 ± 0.528
1.167TyrPhe: 1.167 ± 0.648
1.167TyrGly: 1.167 ± 0.519
0.583TyrHis: 0.583 ± 0.418
2.917TyrIle: 2.917 ± 0.756
1.75TyrLys: 1.75 ± 0.842
3.792TyrLeu: 3.792 ± 0.932
0.583TyrMet: 0.583 ± 0.325
2.334TyrAsn: 2.334 ± 0.71
2.042TyrPro: 2.042 ± 0.565
1.167TyrGln: 1.167 ± 0.554
1.459TyrArg: 1.459 ± 0.641
2.625TyrSer: 2.625 ± 1.284
1.75TyrThr: 1.75 ± 0.84
2.625TyrVal: 2.625 ± 0.729
0.292TyrTrp: 0.292 ± 0.209
2.625TyrTyr: 2.625 ± 0.95
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3429 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski