Amino acid dipepetide frequency for Simian immunodeficiency virus SIV-mnd 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.254AlaAla: 2.254 ± 0.886
1.288AlaCys: 1.288 ± 0.719
1.61AlaAsp: 1.61 ± 0.361
5.473AlaGlu: 5.473 ± 1.137
0.644AlaPhe: 0.644 ± 0.306
2.898AlaGly: 2.898 ± 0.691
1.61AlaHis: 1.61 ± 0.647
2.898AlaIle: 2.898 ± 0.575
3.22AlaLys: 3.22 ± 1.313
6.761AlaLeu: 6.761 ± 1.056
2.576AlaMet: 2.576 ± 1.079
0.966AlaAsn: 0.966 ± 0.293
2.576AlaPro: 2.576 ± 0.806
2.898AlaGln: 2.898 ± 0.605
2.898AlaArg: 2.898 ± 1.084
3.22AlaSer: 3.22 ± 0.849
2.898AlaThr: 2.898 ± 1.244
5.795AlaVal: 5.795 ± 0.626
0.966AlaTrp: 0.966 ± 0.293
1.61AlaTyr: 1.61 ± 0.523
0.0AlaXaa: 0.0 ± 0.0
Cys
0.966CysAla: 0.966 ± 0.801
0.966CysCys: 0.966 ± 0.789
0.322CysAsp: 0.322 ± 0.509
0.644CysGlu: 0.644 ± 0.334
1.288CysPhe: 1.288 ± 0.671
1.288CysGly: 1.288 ± 0.743
0.322CysHis: 0.322 ± 0.267
1.61CysIle: 1.61 ± 0.793
1.932CysLys: 1.932 ± 0.64
0.966CysLeu: 0.966 ± 0.558
0.0CysMet: 0.0 ± 0.0
1.932CysAsn: 1.932 ± 1.046
2.254CysPro: 2.254 ± 1.281
1.288CysGln: 1.288 ± 0.409
1.61CysArg: 1.61 ± 0.717
1.288CysSer: 1.288 ± 0.507
1.932CysThr: 1.932 ± 0.969
1.932CysVal: 1.932 ± 1.115
0.322CysTrp: 0.322 ± 0.246
1.61CysTyr: 1.61 ± 1.313
0.0CysXaa: 0.0 ± 0.0
Asp
1.61AspAla: 1.61 ± 0.401
1.61AspCys: 1.61 ± 0.623
0.966AspAsp: 0.966 ± 0.451
2.898AspGlu: 2.898 ± 0.496
1.61AspPhe: 1.61 ± 0.906
1.61AspGly: 1.61 ± 0.942
0.966AspHis: 0.966 ± 0.293
1.932AspIle: 1.932 ± 0.676
2.576AspLys: 2.576 ± 1.036
4.185AspLeu: 4.185 ± 0.965
0.322AspMet: 0.322 ± 0.448
1.61AspAsn: 1.61 ± 0.671
2.576AspPro: 2.576 ± 0.798
2.898AspGln: 2.898 ± 0.617
2.898AspArg: 2.898 ± 1.739
2.898AspSer: 2.898 ± 1.09
2.576AspThr: 2.576 ± 0.942
1.932AspVal: 1.932 ± 0.394
1.288AspTrp: 1.288 ± 0.791
2.576AspTyr: 2.576 ± 0.879
0.0AspXaa: 0.0 ± 0.0
Glu
5.795GluAla: 5.795 ± 1.78
0.644GluCys: 0.644 ± 0.954
2.898GluAsp: 2.898 ± 0.59
8.371GluGlu: 8.371 ± 2.45
1.61GluPhe: 1.61 ± 0.797
6.117GluGly: 6.117 ± 2.247
1.288GluHis: 1.288 ± 0.848
3.22GluIle: 3.22 ± 0.628
9.015GluLys: 9.015 ± 2.641
5.473GluLeu: 5.473 ± 1.792
0.644GluMet: 0.644 ± 0.372
3.863GluAsn: 3.863 ± 0.493
3.22GluPro: 3.22 ± 0.6
4.507GluGln: 4.507 ± 0.75
4.185GluArg: 4.185 ± 0.844
1.61GluSer: 1.61 ± 0.807
3.542GluThr: 3.542 ± 0.947
3.863GluVal: 3.863 ± 0.959
2.576GluTrp: 2.576 ± 0.698
2.254GluTyr: 2.254 ± 0.83
0.0GluXaa: 0.0 ± 0.0
Phe
0.644PheAla: 0.644 ± 0.534
1.288PheCys: 1.288 ± 0.553
0.966PheAsp: 0.966 ± 0.582
0.966PheGlu: 0.966 ± 0.451
1.61PhePhe: 1.61 ± 0.706
1.61PheGly: 1.61 ± 1.005
0.966PheHis: 0.966 ± 0.794
0.644PheIle: 0.644 ± 0.547
1.932PheLys: 1.932 ± 0.588
2.576PheLeu: 2.576 ± 1.853
0.0PheMet: 0.0 ± 0.0
0.966PheAsn: 0.966 ± 0.484
1.932PhePro: 1.932 ± 0.91
0.966PheGln: 0.966 ± 0.549
3.863PheArg: 3.863 ± 1.0
0.966PheSer: 0.966 ± 0.486
2.576PheThr: 2.576 ± 0.945
0.966PheVal: 0.966 ± 0.451
0.322PheTrp: 0.322 ± 0.246
0.966PheTyr: 0.966 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
4.829GlyAla: 4.829 ± 1.27
2.576GlyCys: 2.576 ± 1.077
2.898GlyAsp: 2.898 ± 0.705
3.542GlyGlu: 3.542 ± 0.911
1.288GlyPhe: 1.288 ± 0.885
5.151GlyGly: 5.151 ± 1.178
1.61GlyHis: 1.61 ± 0.905
5.795GlyIle: 5.795 ± 1.43
8.693GlyLys: 8.693 ± 1.297
6.761GlyLeu: 6.761 ± 2.34
1.932GlyMet: 1.932 ± 1.067
1.61GlyAsn: 1.61 ± 0.466
5.795GlyPro: 5.795 ± 1.79
2.898GlyGln: 2.898 ± 1.241
2.254GlyArg: 2.254 ± 0.865
4.507GlySer: 4.507 ± 0.829
3.22GlyThr: 3.22 ± 0.585
2.576GlyVal: 2.576 ± 0.705
1.288GlyTrp: 1.288 ± 0.718
2.898GlyTyr: 2.898 ± 1.32
0.0GlyXaa: 0.0 ± 0.0
His
1.932HisAla: 1.932 ± 1.089
0.322HisCys: 0.322 ± 0.509
1.61HisAsp: 1.61 ± 0.77
0.644HisGlu: 0.644 ± 0.334
1.61HisPhe: 1.61 ± 1.198
0.966HisGly: 0.966 ± 0.486
0.322HisHis: 0.322 ± 0.246
0.966HisIle: 0.966 ± 0.711
2.254HisLys: 2.254 ± 0.851
2.576HisLeu: 2.576 ± 0.667
0.644HisMet: 0.644 ± 0.643
1.288HisAsn: 1.288 ± 0.672
1.932HisPro: 1.932 ± 1.278
1.288HisGln: 1.288 ± 0.828
0.322HisArg: 0.322 ± 0.394
1.288HisSer: 1.288 ± 0.516
1.288HisThr: 1.288 ± 0.672
0.644HisVal: 0.644 ± 0.277
0.966HisTrp: 0.966 ± 1.125
1.288HisTyr: 1.288 ± 0.778
0.0HisXaa: 0.0 ± 0.0
Ile
3.22IleAla: 3.22 ± 0.786
0.322IleCys: 0.322 ± 0.246
2.898IleAsp: 2.898 ± 1.102
4.507IleGlu: 4.507 ± 0.776
0.966IlePhe: 0.966 ± 0.484
3.863IleGly: 3.863 ± 1.618
1.288IleHis: 1.288 ± 0.746
5.151IleIle: 5.151 ± 1.171
3.22IleLys: 3.22 ± 1.212
3.863IleLeu: 3.863 ± 0.662
0.644IleMet: 0.644 ± 0.372
1.61IleAsn: 1.61 ± 0.697
4.185IlePro: 4.185 ± 1.174
2.576IleGln: 2.576 ± 0.73
3.863IleArg: 3.863 ± 0.493
3.22IleSer: 3.22 ± 1.284
2.254IleThr: 2.254 ± 1.079
5.473IleVal: 5.473 ± 1.248
2.254IleTrp: 2.254 ± 0.491
1.932IleTyr: 1.932 ± 0.447
0.0IleXaa: 0.0 ± 0.0
Lys
5.151LysAla: 5.151 ± 1.25
1.61LysCys: 1.61 ± 0.868
1.932LysAsp: 1.932 ± 0.535
9.015LysGlu: 9.015 ± 2.39
2.898LysPhe: 2.898 ± 0.774
7.405LysGly: 7.405 ± 1.408
1.61LysHis: 1.61 ± 0.513
5.795LysIle: 5.795 ± 0.743
5.795LysLys: 5.795 ± 1.58
6.761LysLeu: 6.761 ± 1.636
1.288LysMet: 1.288 ± 0.374
2.576LysAsn: 2.576 ± 0.756
2.576LysPro: 2.576 ± 0.994
4.185LysGln: 4.185 ± 0.944
2.254LysArg: 2.254 ± 0.868
1.932LysSer: 1.932 ± 1.029
3.542LysThr: 3.542 ± 0.8
7.083LysVal: 7.083 ± 2.478
0.966LysTrp: 0.966 ± 0.375
5.151LysTyr: 5.151 ± 2.02
0.0LysXaa: 0.0 ± 0.0
Leu
4.185LeuAla: 4.185 ± 0.919
1.932LeuCys: 1.932 ± 0.787
4.185LeuAsp: 4.185 ± 1.152
7.727LeuGlu: 7.727 ± 0.972
2.898LeuPhe: 2.898 ± 0.976
6.117LeuGly: 6.117 ± 1.445
2.898LeuHis: 2.898 ± 0.867
4.185LeuIle: 4.185 ± 0.968
8.049LeuLys: 8.049 ± 1.724
8.371LeuLeu: 8.371 ± 1.701
1.61LeuMet: 1.61 ± 1.058
3.22LeuAsn: 3.22 ± 0.73
3.863LeuPro: 3.863 ± 1.083
5.473LeuGln: 5.473 ± 0.43
4.829LeuArg: 4.829 ± 1.705
4.185LeuSer: 4.185 ± 1.11
3.22LeuThr: 3.22 ± 1.086
3.863LeuVal: 3.863 ± 1.095
2.254LeuTrp: 2.254 ± 0.742
0.644LeuTyr: 0.644 ± 0.547
0.0LeuXaa: 0.0 ± 0.0
Met
1.61MetAla: 1.61 ± 0.784
0.644MetCys: 0.644 ± 0.534
0.966MetAsp: 0.966 ± 0.591
1.932MetGlu: 1.932 ± 0.498
1.288MetPhe: 1.288 ± 0.719
1.61MetGly: 1.61 ± 0.357
0.966MetHis: 0.966 ± 0.611
1.288MetIle: 1.288 ± 0.468
0.644MetLys: 0.644 ± 0.657
1.288MetLeu: 1.288 ± 0.743
0.644MetMet: 0.644 ± 0.657
0.644MetAsn: 0.644 ± 0.715
1.932MetPro: 1.932 ± 1.311
1.288MetGln: 1.288 ± 0.719
0.644MetArg: 0.644 ± 0.547
0.644MetSer: 0.644 ± 0.513
1.288MetThr: 1.288 ± 0.584
0.644MetVal: 0.644 ± 0.715
0.322MetTrp: 0.322 ± 0.267
0.322MetTyr: 0.322 ± 0.329
0.0MetXaa: 0.0 ± 0.0
Asn
2.576AsnAla: 2.576 ± 1.265
0.966AsnCys: 0.966 ± 0.615
1.61AsnAsp: 1.61 ± 0.989
2.898AsnGlu: 2.898 ± 0.677
2.576AsnPhe: 2.576 ± 0.805
2.254AsnGly: 2.254 ± 0.344
0.322AsnHis: 0.322 ± 0.246
3.542AsnIle: 3.542 ± 1.098
2.254AsnLys: 2.254 ± 0.954
3.22AsnLeu: 3.22 ± 1.374
0.644AsnMet: 0.644 ± 0.502
2.254AsnAsn: 2.254 ± 0.62
2.576AsnPro: 2.576 ± 1.283
2.254AsnGln: 2.254 ± 0.825
2.576AsnArg: 2.576 ± 1.064
1.932AsnSer: 1.932 ± 0.651
3.542AsnThr: 3.542 ± 1.202
2.254AsnVal: 2.254 ± 0.781
1.61AsnTrp: 1.61 ± 0.706
0.644AsnTyr: 0.644 ± 0.471
0.0AsnXaa: 0.0 ± 0.0
Pro
1.932ProAla: 1.932 ± 0.676
1.288ProCys: 1.288 ± 1.068
1.932ProAsp: 1.932 ± 0.497
4.185ProGlu: 4.185 ± 1.991
1.61ProPhe: 1.61 ± 0.639
4.507ProGly: 4.507 ± 1.066
1.288ProHis: 1.288 ± 0.43
2.254ProIle: 2.254 ± 0.708
3.542ProLys: 3.542 ± 0.975
4.829ProLeu: 4.829 ± 1.027
1.288ProMet: 1.288 ± 0.959
2.898ProAsn: 2.898 ± 0.648
4.185ProPro: 4.185 ± 2.341
3.542ProGln: 3.542 ± 0.861
4.185ProArg: 4.185 ± 2.123
2.898ProSer: 2.898 ± 0.714
4.185ProThr: 4.185 ± 1.32
3.542ProVal: 3.542 ± 1.24
1.288ProTrp: 1.288 ± 0.791
2.576ProTyr: 2.576 ± 0.612
0.0ProXaa: 0.0 ± 0.0
Gln
3.542GlnAla: 3.542 ± 1.122
2.254GlnCys: 2.254 ± 0.62
1.932GlnAsp: 1.932 ± 0.594
5.151GlnGlu: 5.151 ± 2.481
0.644GlnPhe: 0.644 ± 0.515
4.185GlnGly: 4.185 ± 1.398
0.644GlnHis: 0.644 ± 0.277
4.185GlnIle: 4.185 ± 1.271
6.761GlnLys: 6.761 ± 0.813
3.863GlnLeu: 3.863 ± 0.887
1.288GlnMet: 1.288 ± 0.781
2.254GlnAsn: 2.254 ± 0.699
2.254GlnPro: 2.254 ± 0.591
2.576GlnGln: 2.576 ± 1.147
1.288GlnArg: 1.288 ± 1.177
2.576GlnSer: 2.576 ± 0.698
2.576GlnThr: 2.576 ± 0.565
2.576GlnVal: 2.576 ± 0.789
3.22GlnTrp: 3.22 ± 0.878
1.288GlnTyr: 1.288 ± 0.746
0.0GlnXaa: 0.0 ± 0.0
Arg
1.61ArgAla: 1.61 ± 0.569
1.288ArgCys: 1.288 ± 0.634
3.22ArgAsp: 3.22 ± 0.752
5.795ArgGlu: 5.795 ± 2.251
1.288ArgPhe: 1.288 ± 0.725
4.507ArgGly: 4.507 ± 0.811
1.288ArgHis: 1.288 ± 0.769
1.288ArgIle: 1.288 ± 0.502
2.576ArgLys: 2.576 ± 0.838
3.863ArgLeu: 3.863 ± 1.425
1.288ArgMet: 1.288 ± 0.744
2.898ArgAsn: 2.898 ± 1.074
1.61ArgPro: 1.61 ± 0.466
3.22ArgGln: 3.22 ± 0.773
3.22ArgArg: 3.22 ± 1.785
2.898ArgSer: 2.898 ± 2.581
2.254ArgThr: 2.254 ± 0.625
3.22ArgVal: 3.22 ± 0.983
0.644ArgTrp: 0.644 ± 0.409
2.576ArgTyr: 2.576 ± 0.826
0.0ArgXaa: 0.0 ± 0.0
Ser
1.288SerAla: 1.288 ± 0.558
1.288SerCys: 1.288 ± 0.732
2.576SerAsp: 2.576 ± 0.55
2.898SerGlu: 2.898 ± 0.958
0.644SerPhe: 0.644 ± 0.424
4.829SerGly: 4.829 ± 1.364
0.0SerHis: 0.0 ± 0.0
2.254SerIle: 2.254 ± 0.573
4.507SerLys: 4.507 ± 0.882
4.829SerLeu: 4.829 ± 1.793
1.288SerMet: 1.288 ± 0.605
0.966SerAsn: 0.966 ± 0.438
2.254SerPro: 2.254 ± 0.964
2.898SerGln: 2.898 ± 1.155
3.863SerArg: 3.863 ± 1.576
3.863SerSer: 3.863 ± 2.018
3.542SerThr: 3.542 ± 1.647
3.22SerVal: 3.22 ± 0.929
1.288SerTrp: 1.288 ± 0.502
1.288SerTyr: 1.288 ± 1.044
0.0SerXaa: 0.0 ± 0.0
Thr
5.151ThrAla: 5.151 ± 1.117
0.966ThrCys: 0.966 ± 0.544
3.863ThrAsp: 3.863 ± 0.917
2.898ThrGlu: 2.898 ± 1.382
0.322ThrPhe: 0.322 ± 0.448
3.542ThrGly: 3.542 ± 0.679
2.254ThrHis: 2.254 ± 0.912
2.576ThrIle: 2.576 ± 0.891
3.863ThrLys: 3.863 ± 1.938
2.576ThrLeu: 2.576 ± 1.242
0.644ThrMet: 0.644 ± 0.534
2.898ThrAsn: 2.898 ± 1.032
5.151ThrPro: 5.151 ± 0.929
2.898ThrGln: 2.898 ± 0.72
0.644ThrArg: 0.644 ± 0.334
4.185ThrSer: 4.185 ± 1.301
7.727ThrThr: 7.727 ± 3.183
4.185ThrVal: 4.185 ± 1.688
3.22ThrTrp: 3.22 ± 0.794
1.288ThrTyr: 1.288 ± 0.777
0.0ThrXaa: 0.0 ± 0.0
Val
3.863ValAla: 3.863 ± 0.991
1.61ValCys: 1.61 ± 0.718
3.542ValAsp: 3.542 ± 1.015
2.898ValGlu: 2.898 ± 0.917
0.644ValPhe: 0.644 ± 0.329
4.507ValGly: 4.507 ± 0.829
1.61ValHis: 1.61 ± 0.696
5.151ValIle: 5.151 ± 1.329
4.185ValLys: 4.185 ± 1.161
5.795ValLeu: 5.795 ± 1.792
1.288ValMet: 1.288 ± 0.844
2.576ValAsn: 2.576 ± 0.991
3.863ValPro: 3.863 ± 0.865
3.863ValGln: 3.863 ± 1.501
1.932ValArg: 1.932 ± 0.969
2.898ValSer: 2.898 ± 0.539
4.507ValThr: 4.507 ± 1.091
1.932ValVal: 1.932 ± 0.902
1.288ValTrp: 1.288 ± 0.451
2.254ValTyr: 2.254 ± 0.759
0.0ValXaa: 0.0 ± 0.0
Trp
0.966TrpAla: 0.966 ± 0.451
0.322TrpCys: 0.322 ± 0.329
0.966TrpAsp: 0.966 ± 0.375
0.322TrpGlu: 0.322 ± 0.246
0.322TrpPhe: 0.322 ± 0.267
3.22TrpGly: 3.22 ± 0.896
1.288TrpHis: 1.288 ± 0.92
1.288TrpIle: 1.288 ± 0.621
1.932TrpLys: 1.932 ± 1.097
2.254TrpLeu: 2.254 ± 1.187
0.966TrpMet: 0.966 ± 0.293
2.254TrpAsn: 2.254 ± 0.892
1.288TrpPro: 1.288 ± 0.672
1.61TrpGln: 1.61 ± 0.602
1.288TrpArg: 1.288 ± 0.553
1.288TrpSer: 1.288 ± 0.732
2.576TrpThr: 2.576 ± 0.586
1.61TrpVal: 1.61 ± 0.623
1.288TrpTrp: 1.288 ± 0.468
0.644TrpTyr: 0.644 ± 0.277
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.932TyrAla: 1.932 ± 0.897
1.288TyrCys: 1.288 ± 0.75
0.644TyrAsp: 0.644 ± 0.577
1.932TyrGlu: 1.932 ± 0.93
0.966TyrPhe: 0.966 ± 0.375
1.932TyrGly: 1.932 ± 0.801
1.61TyrHis: 1.61 ± 1.027
1.288TyrIle: 1.288 ± 0.507
2.576TyrLys: 2.576 ± 1.666
2.898TyrLeu: 2.898 ± 1.434
1.288TyrMet: 1.288 ± 0.451
3.22TyrAsn: 3.22 ± 0.874
1.932TyrPro: 1.932 ± 0.846
2.254TyrGln: 2.254 ± 0.699
1.932TyrArg: 1.932 ± 0.97
1.288TyrSer: 1.288 ± 0.553
1.61TyrThr: 1.61 ± 0.845
2.898TyrVal: 2.898 ± 1.374
0.322TyrTrp: 0.322 ± 0.246
1.932TyrTyr: 1.932 ± 0.634
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (3107 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski