Amino acid dipepetide frequency for Human immunodeficiency virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.494AlaAla: 3.494 ± 0.825
1.906AlaCys: 1.906 ± 0.561
2.541AlaAsp: 2.541 ± 1.059
4.13AlaGlu: 4.13 ± 0.812
0.953AlaPhe: 0.953 ± 0.505
3.812AlaGly: 3.812 ± 0.926
1.271AlaHis: 1.271 ± 0.547
5.083AlaIle: 5.083 ± 2.101
3.494AlaLys: 3.494 ± 1.049
6.353AlaLeu: 6.353 ± 0.621
1.906AlaMet: 1.906 ± 0.538
1.906AlaAsn: 1.906 ± 0.746
1.588AlaPro: 1.588 ± 0.702
2.541AlaGln: 2.541 ± 1.196
3.812AlaArg: 3.812 ± 1.265
4.13AlaSer: 4.13 ± 0.663
3.494AlaThr: 3.494 ± 0.994
4.765AlaVal: 4.765 ± 1.235
0.953AlaTrp: 0.953 ± 0.485
0.953AlaTyr: 0.953 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.953CysAla: 0.953 ± 0.64
0.318CysCys: 0.318 ± 0.433
0.635CysAsp: 0.635 ± 0.441
0.318CysGlu: 0.318 ± 0.228
1.588CysPhe: 1.588 ± 1.252
1.906CysGly: 1.906 ± 0.76
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.953CysLys: 0.953 ± 0.411
0.635CysLeu: 0.635 ± 0.417
0.318CysMet: 0.318 ± 0.382
1.588CysAsn: 1.588 ± 0.935
0.318CysPro: 0.318 ± 0.247
1.271CysGln: 1.271 ± 0.627
1.271CysArg: 1.271 ± 0.983
2.224CysSer: 2.224 ± 1.174
2.541CysThr: 2.541 ± 0.572
1.588CysVal: 1.588 ± 0.588
0.635CysTrp: 0.635 ± 0.34
0.318CysTyr: 0.318 ± 0.433
0.0CysXaa: 0.0 ± 0.0
Asp
1.588AspAla: 1.588 ± 0.526
2.859AspCys: 2.859 ± 1.216
1.906AspAsp: 1.906 ± 0.614
1.271AspGlu: 1.271 ± 0.8
1.271AspPhe: 1.271 ± 0.932
1.906AspGly: 1.906 ± 0.614
0.635AspHis: 0.635 ± 0.452
3.494AspIle: 3.494 ± 1.181
3.177AspLys: 3.177 ± 1.346
3.812AspLeu: 3.812 ± 1.048
0.635AspMet: 0.635 ± 0.267
2.224AspAsn: 2.224 ± 0.83
3.494AspPro: 3.494 ± 0.989
1.588AspGln: 1.588 ± 0.588
4.765AspArg: 4.765 ± 1.729
3.177AspSer: 3.177 ± 1.298
3.177AspThr: 3.177 ± 0.697
0.635AspVal: 0.635 ± 0.273
0.953AspTrp: 0.953 ± 0.717
0.953AspTyr: 0.953 ± 0.514
0.0AspXaa: 0.0 ± 0.0
Glu
5.4GluAla: 5.4 ± 0.863
0.0GluCys: 0.0 ± 0.0
2.541GluAsp: 2.541 ± 1.383
6.353GluGlu: 6.353 ± 2.141
1.271GluPhe: 1.271 ± 0.663
5.083GluGly: 5.083 ± 1.078
0.953GluHis: 0.953 ± 0.684
4.765GluIle: 4.765 ± 1.584
4.13GluLys: 4.13 ± 0.899
6.989GluLeu: 6.989 ± 1.238
1.271GluMet: 1.271 ± 0.638
2.224GluAsn: 2.224 ± 0.718
4.765GluPro: 4.765 ± 1.156
2.541GluGln: 2.541 ± 0.775
3.494GluArg: 3.494 ± 1.043
3.177GluSer: 3.177 ± 0.827
4.765GluThr: 4.765 ± 1.276
4.765GluVal: 4.765 ± 0.537
1.906GluTrp: 1.906 ± 0.697
0.635GluTyr: 0.635 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
1.271PheAla: 1.271 ± 0.383
0.635PheCys: 0.635 ± 0.273
1.271PheAsp: 1.271 ± 1.095
0.318PheGlu: 0.318 ± 0.247
0.953PhePhe: 0.953 ± 0.359
1.271PheGly: 1.271 ± 0.36
0.635PheHis: 0.635 ± 0.452
1.588PheIle: 1.588 ± 0.696
1.588PheLys: 1.588 ± 0.776
2.859PheLeu: 2.859 ± 0.607
0.0PheMet: 0.0 ± 0.0
2.859PheAsn: 2.859 ± 1.139
2.224PhePro: 2.224 ± 1.216
0.953PheGln: 0.953 ± 0.439
2.859PheArg: 2.859 ± 0.949
2.224PheSer: 2.224 ± 0.76
1.271PheThr: 1.271 ± 0.487
0.318PheVal: 0.318 ± 0.228
0.635PheTrp: 0.635 ± 0.341
0.953PheTyr: 0.953 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
5.718GlyAla: 5.718 ± 1.031
1.588GlyCys: 1.588 ± 0.603
3.177GlyAsp: 3.177 ± 0.826
3.812GlyGlu: 3.812 ± 0.668
2.224GlyPhe: 2.224 ± 0.674
6.989GlyGly: 6.989 ± 1.047
2.224GlyHis: 2.224 ± 2.115
6.036GlyIle: 6.036 ± 1.996
4.447GlyLys: 4.447 ± 1.357
6.036GlyLeu: 6.036 ± 1.396
0.318GlyMet: 0.318 ± 0.228
4.13GlyAsn: 4.13 ± 1.446
3.494GlyPro: 3.494 ± 0.788
3.177GlyGln: 3.177 ± 1.24
4.13GlyArg: 4.13 ± 0.699
4.13GlySer: 4.13 ± 1.17
4.13GlyThr: 4.13 ± 2.374
3.494GlyVal: 3.494 ± 1.268
1.588GlyTrp: 1.588 ± 0.474
1.906GlyTyr: 1.906 ± 0.785
0.318GlyXaa: 0.318 ± 0.228
His
1.271HisAla: 1.271 ± 0.383
1.271HisCys: 1.271 ± 0.519
0.318HisAsp: 0.318 ± 0.247
0.318HisGlu: 0.318 ± 0.228
0.635HisPhe: 0.635 ± 0.859
2.224HisGly: 2.224 ± 0.893
0.953HisHis: 0.953 ± 1.008
1.271HisIle: 1.271 ± 0.639
1.271HisLys: 1.271 ± 0.559
2.541HisLeu: 2.541 ± 0.877
0.953HisMet: 0.953 ± 0.801
1.271HisAsn: 1.271 ± 0.63
2.541HisPro: 2.541 ± 0.894
1.906HisGln: 1.906 ± 1.397
1.271HisArg: 1.271 ± 0.559
0.635HisSer: 0.635 ± 0.438
0.953HisThr: 0.953 ± 0.494
0.318HisVal: 0.318 ± 0.228
0.0HisTrp: 0.0 ± 0.0
0.953HisTyr: 0.953 ± 0.874
0.0HisXaa: 0.0 ± 0.0
Ile
2.224IleAla: 2.224 ± 0.437
1.271IleCys: 1.271 ± 0.547
2.859IleAsp: 2.859 ± 0.547
4.13IleGlu: 4.13 ± 1.07
1.271IlePhe: 1.271 ± 0.697
6.036IleGly: 6.036 ± 2.096
2.541IleHis: 2.541 ± 0.789
6.036IleIle: 6.036 ± 1.335
5.4IleLys: 5.4 ± 1.032
6.989IleLeu: 6.989 ± 2.117
0.953IleMet: 0.953 ± 0.481
1.271IleAsn: 1.271 ± 0.383
5.083IlePro: 5.083 ± 1.092
3.177IleGln: 3.177 ± 1.448
5.083IleArg: 5.083 ± 1.737
2.859IleSer: 2.859 ± 0.726
2.541IleThr: 2.541 ± 1.393
7.624IleVal: 7.624 ± 2.209
1.588IleTrp: 1.588 ± 0.694
2.224IleTyr: 2.224 ± 0.535
0.0IleXaa: 0.0 ± 0.0
Lys
4.765LysAla: 4.765 ± 1.391
1.906LysCys: 1.906 ± 0.76
2.859LysAsp: 2.859 ± 1.054
6.353LysGlu: 6.353 ± 1.799
0.635LysPhe: 0.635 ± 0.383
5.083LysGly: 5.083 ± 1.063
1.271LysHis: 1.271 ± 0.685
8.577LysIle: 8.577 ± 1.375
7.942LysLys: 7.942 ± 2.462
6.353LysLeu: 6.353 ± 1.795
0.318LysMet: 0.318 ± 0.228
2.859LysAsn: 2.859 ± 1.02
2.224LysPro: 2.224 ± 0.868
4.13LysGln: 4.13 ± 1.405
2.859LysArg: 2.859 ± 0.639
1.271LysSer: 1.271 ± 0.679
2.859LysThr: 2.859 ± 0.843
4.13LysVal: 4.13 ± 1.499
2.224LysTrp: 2.224 ± 0.602
2.859LysTyr: 2.859 ± 0.666
0.0LysXaa: 0.0 ± 0.0
Leu
4.447LeuAla: 4.447 ± 1.217
0.953LeuCys: 0.953 ± 0.468
4.13LeuAsp: 4.13 ± 1.042
5.4LeuGlu: 5.4 ± 1.273
2.859LeuPhe: 2.859 ± 0.964
7.624LeuGly: 7.624 ± 1.577
2.859LeuHis: 2.859 ± 1.128
3.812LeuIle: 3.812 ± 1.329
7.624LeuLys: 7.624 ± 0.903
8.895LeuLeu: 8.895 ± 3.289
0.635LeuMet: 0.635 ± 0.626
3.494LeuAsn: 3.494 ± 1.074
2.224LeuPro: 2.224 ± 1.1
4.765LeuGln: 4.765 ± 1.618
4.13LeuArg: 4.13 ± 0.979
4.13LeuSer: 4.13 ± 0.773
5.083LeuThr: 5.083 ± 1.209
5.4LeuVal: 5.4 ± 1.478
2.859LeuTrp: 2.859 ± 0.974
1.588LeuTyr: 1.588 ± 0.646
0.318LeuXaa: 0.318 ± 0.228
Met
1.271MetAla: 1.271 ± 0.738
0.318MetCys: 0.318 ± 0.247
0.635MetAsp: 0.635 ± 0.456
1.588MetGlu: 1.588 ± 0.747
0.635MetPhe: 0.635 ± 0.341
1.588MetGly: 1.588 ± 0.539
0.635MetHis: 0.635 ± 0.273
0.953MetIle: 0.953 ± 0.397
0.318MetLys: 0.318 ± 0.351
0.953MetLeu: 0.953 ± 0.647
0.953MetMet: 0.953 ± 1.053
0.635MetAsn: 0.635 ± 0.273
0.0MetPro: 0.0 ± 0.0
0.635MetGln: 0.635 ± 0.702
1.906MetArg: 1.906 ± 0.465
0.953MetSer: 0.953 ± 0.717
2.224MetThr: 2.224 ± 0.995
0.635MetVal: 0.635 ± 0.341
0.635MetTrp: 0.635 ± 0.417
0.953MetTyr: 0.953 ± 0.519
0.0MetXaa: 0.0 ± 0.0
Asn
1.271AsnAla: 1.271 ± 0.682
2.541AsnCys: 2.541 ± 1.022
0.953AsnAsp: 0.953 ± 0.46
3.494AsnGlu: 3.494 ± 1.056
3.177AsnPhe: 3.177 ± 0.838
0.953AsnGly: 0.953 ± 0.468
0.0AsnHis: 0.0 ± 0.0
2.541AsnIle: 2.541 ± 1.006
2.859AsnLys: 2.859 ± 0.784
2.859AsnLeu: 2.859 ± 0.965
0.635AsnMet: 0.635 ± 0.341
5.083AsnAsn: 5.083 ± 2.004
3.494AsnPro: 3.494 ± 1.024
0.635AsnGln: 0.635 ± 0.273
2.541AsnArg: 2.541 ± 0.475
3.177AsnSer: 3.177 ± 0.866
4.13AsnThr: 4.13 ± 1.876
1.906AsnVal: 1.906 ± 0.936
2.541AsnTrp: 2.541 ± 1.006
0.953AsnTyr: 0.953 ± 0.485
0.0AsnXaa: 0.0 ± 0.0
Pro
2.224ProAla: 2.224 ± 0.662
0.953ProCys: 0.953 ± 0.586
2.224ProAsp: 2.224 ± 0.662
3.812ProGlu: 3.812 ± 0.845
1.271ProPhe: 1.271 ± 0.567
4.447ProGly: 4.447 ± 1.583
0.318ProHis: 0.318 ± 0.228
5.718ProIle: 5.718 ± 1.297
3.177ProLys: 3.177 ± 1.31
4.765ProLeu: 4.765 ± 1.372
1.271ProMet: 1.271 ± 0.813
1.588ProAsn: 1.588 ± 0.993
4.13ProPro: 4.13 ± 1.568
4.13ProGln: 4.13 ± 0.989
2.224ProArg: 2.224 ± 0.952
3.812ProSer: 3.812 ± 1.794
2.541ProThr: 2.541 ± 0.572
4.765ProVal: 4.765 ± 1.051
1.271ProTrp: 1.271 ± 0.835
0.953ProTyr: 0.953 ± 0.684
0.0ProXaa: 0.0 ± 0.0
Gln
3.812GlnAla: 3.812 ± 0.953
0.318GlnCys: 0.318 ± 0.247
1.906GlnAsp: 1.906 ± 0.746
4.447GlnGlu: 4.447 ± 0.855
0.318GlnPhe: 0.318 ± 0.254
4.447GlnGly: 4.447 ± 1.092
1.271GlnHis: 1.271 ± 0.783
4.447GlnIle: 4.447 ± 1.204
4.13GlnLys: 4.13 ± 1.69
5.718GlnLeu: 5.718 ± 1.438
2.541GlnMet: 2.541 ± 1.503
2.859GlnAsn: 2.859 ± 1.477
2.224GlnPro: 2.224 ± 0.897
3.177GlnGln: 3.177 ± 1.186
3.812GlnArg: 3.812 ± 1.512
1.271GlnSer: 1.271 ± 0.487
2.541GlnThr: 2.541 ± 0.491
2.859GlnVal: 2.859 ± 1.441
0.635GlnTrp: 0.635 ± 0.456
1.906GlnTyr: 1.906 ± 1.005
0.635GlnXaa: 0.635 ± 0.494
Arg
5.4ArgAla: 5.4 ± 1.705
0.318ArgCys: 0.318 ± 0.453
4.447ArgAsp: 4.447 ± 0.913
5.083ArgGlu: 5.083 ± 0.936
1.271ArgPhe: 1.271 ± 1.127
3.177ArgGly: 3.177 ± 1.026
1.588ArgHis: 1.588 ± 1.277
5.718ArgIle: 5.718 ± 2.104
4.765ArgLys: 4.765 ± 1.35
2.224ArgLeu: 2.224 ± 0.9
1.906ArgMet: 1.906 ± 0.682
0.953ArgAsn: 0.953 ± 0.265
3.812ArgPro: 3.812 ± 1.174
5.083ArgGln: 5.083 ± 1.337
3.177ArgArg: 3.177 ± 1.844
2.859ArgSer: 2.859 ± 0.816
1.906ArgThr: 1.906 ± 1.029
2.224ArgVal: 2.224 ± 0.776
2.541ArgTrp: 2.541 ± 0.97
0.635ArgTyr: 0.635 ± 0.273
0.0ArgXaa: 0.0 ± 0.0
Ser
2.859SerAla: 2.859 ± 0.554
0.318SerCys: 0.318 ± 0.228
2.541SerAsp: 2.541 ± 0.475
3.812SerGlu: 3.812 ± 0.659
2.224SerPhe: 2.224 ± 0.776
4.447SerGly: 4.447 ± 1.492
0.635SerHis: 0.635 ± 0.702
3.494SerIle: 3.494 ± 0.819
2.859SerLys: 2.859 ± 1.505
5.4SerLeu: 5.4 ± 1.505
0.635SerMet: 0.635 ± 0.456
1.906SerAsn: 1.906 ± 0.897
3.177SerPro: 3.177 ± 1.042
3.812SerGln: 3.812 ± 1.204
2.859SerArg: 2.859 ± 1.283
4.765SerSer: 4.765 ± 1.217
4.13SerThr: 4.13 ± 1.7
1.588SerVal: 1.588 ± 0.51
1.271SerTrp: 1.271 ± 0.644
0.635SerTyr: 0.635 ± 0.452
0.318SerXaa: 0.318 ± 0.489
Thr
4.13ThrAla: 4.13 ± 0.833
0.0ThrCys: 0.0 ± 0.0
2.859ThrAsp: 2.859 ± 0.878
6.671ThrGlu: 6.671 ± 1.238
1.271ThrPhe: 1.271 ± 0.7
4.447ThrGly: 4.447 ± 0.704
1.588ThrHis: 1.588 ± 0.588
2.541ThrIle: 2.541 ± 0.798
3.494ThrLys: 3.494 ± 0.992
4.765ThrLeu: 4.765 ± 1.625
0.953ThrMet: 0.953 ± 0.575
3.494ThrAsn: 3.494 ± 1.122
4.13ThrPro: 4.13 ± 0.812
3.177ThrGln: 3.177 ± 0.841
2.224ThrArg: 2.224 ± 0.774
2.859ThrSer: 2.859 ± 1.036
5.718ThrThr: 5.718 ± 1.871
4.13ThrVal: 4.13 ± 1.128
1.588ThrTrp: 1.588 ± 0.51
0.953ThrTyr: 0.953 ± 0.691
0.0ThrXaa: 0.0 ± 0.0
Val
4.13ValAla: 4.13 ± 1.372
0.318ValCys: 0.318 ± 0.433
3.177ValAsp: 3.177 ± 1.149
2.859ValGlu: 2.859 ± 0.733
1.271ValPhe: 1.271 ± 0.468
4.447ValGly: 4.447 ± 0.926
2.224ValHis: 2.224 ± 1.063
3.177ValIle: 3.177 ± 0.785
5.4ValLys: 5.4 ± 0.973
3.177ValLeu: 3.177 ± 0.798
0.318ValMet: 0.318 ± 0.464
2.224ValAsn: 2.224 ± 0.72
4.13ValPro: 4.13 ± 0.87
4.13ValGln: 4.13 ± 1.515
3.494ValArg: 3.494 ± 1.268
3.494ValSer: 3.494 ± 1.09
3.177ValThr: 3.177 ± 1.452
4.765ValVal: 4.765 ± 1.184
1.906ValTrp: 1.906 ± 0.833
1.588ValTyr: 1.588 ± 0.61
0.0ValXaa: 0.0 ± 0.0
Trp
1.588TrpAla: 1.588 ± 0.408
0.318TrpCys: 0.318 ± 0.392
1.271TrpAsp: 1.271 ± 0.684
1.906TrpGlu: 1.906 ± 0.572
0.318TrpPhe: 0.318 ± 0.247
1.906TrpGly: 1.906 ± 0.655
0.318TrpHis: 0.318 ± 0.453
0.953TrpIle: 0.953 ± 0.265
2.541TrpLys: 2.541 ± 0.695
0.953TrpLeu: 0.953 ± 0.564
1.271TrpMet: 1.271 ± 0.559
2.224TrpAsn: 2.224 ± 1.323
1.588TrpPro: 1.588 ± 0.358
2.224TrpGln: 2.224 ± 0.846
1.906TrpArg: 1.906 ± 0.605
0.953TrpSer: 0.953 ± 0.564
2.224TrpThr: 2.224 ± 0.934
1.906TrpVal: 1.906 ± 0.674
1.271TrpTrp: 1.271 ± 0.547
0.953TrpTyr: 0.953 ± 0.468
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.588TyrAla: 1.588 ± 0.694
1.271TyrCys: 1.271 ± 0.581
0.953TyrAsp: 0.953 ± 0.439
0.953TyrGlu: 0.953 ± 0.691
1.271TyrPhe: 1.271 ± 0.586
1.271TyrGly: 1.271 ± 0.535
0.635TyrHis: 0.635 ± 0.273
0.953TyrIle: 0.953 ± 0.43
1.906TyrLys: 1.906 ± 0.588
0.953TyrLeu: 0.953 ± 0.397
0.318TyrMet: 0.318 ± 0.228
0.953TyrAsn: 0.953 ± 0.684
0.953TyrPro: 0.953 ± 0.844
1.906TyrGln: 1.906 ± 1.126
1.271TyrArg: 1.271 ± 0.4
1.588TyrSer: 1.588 ± 0.49
1.271TyrThr: 1.271 ± 0.482
1.588TyrVal: 1.588 ± 0.66
1.271TyrTrp: 1.271 ± 0.493
1.271TyrTyr: 1.271 ± 0.433
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.318XaaAsp: 0.318 ± 0.489
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.318XaaHis: 0.318 ± 0.228
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.318XaaGln: 0.318 ± 0.247
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.318XaaThr: 0.318 ± 0.247
0.0XaaVal: 0.0 ± 0.0
0.318XaaTrp: 0.318 ± 0.228
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (3149 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski