Amino acid dipepetide frequency for Human immunodeficiency virus type 1 group M subtype D (isolate ELI) (HIV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.108AlaAla: 4.108 ± 0.943
1.643AlaCys: 1.643 ± 0.504
2.191AlaAsp: 2.191 ± 0.726
6.3AlaGlu: 6.3 ± 1.603
1.369AlaPhe: 1.369 ± 0.508
4.382AlaGly: 4.382 ± 0.956
1.917AlaHis: 1.917 ± 0.728
6.574AlaIle: 6.574 ± 1.307
1.917AlaLys: 1.917 ± 0.489
5.752AlaLeu: 5.752 ± 0.925
2.465AlaMet: 2.465 ± 0.826
1.917AlaAsn: 1.917 ± 0.761
3.287AlaPro: 3.287 ± 1.017
1.917AlaGln: 1.917 ± 0.32
4.382AlaArg: 4.382 ± 1.167
4.108AlaSer: 4.108 ± 0.847
3.287AlaThr: 3.287 ± 0.962
3.835AlaVal: 3.835 ± 1.047
1.917AlaTrp: 1.917 ± 0.478
0.548AlaTyr: 0.548 ± 0.378
0.0AlaXaa: 0.0 ± 0.0
Cys
1.096CysAla: 1.096 ± 0.766
0.548CysCys: 0.548 ± 0.631
0.274CysAsp: 0.274 ± 0.189
0.274CysGlu: 0.274 ± 0.35
1.369CysPhe: 1.369 ± 0.82
1.643CysGly: 1.643 ± 0.481
0.548CysHis: 0.548 ± 0.631
0.0CysIle: 0.0 ± 0.0
1.917CysLys: 1.917 ± 0.716
0.548CysLeu: 0.548 ± 0.45
0.274CysMet: 0.274 ± 0.33
1.369CysAsn: 1.369 ± 0.956
0.822CysPro: 0.822 ± 0.672
1.643CysGln: 1.643 ± 0.4
1.643CysArg: 1.643 ± 0.643
1.917CysSer: 1.917 ± 1.176
2.191CysThr: 2.191 ± 0.545
1.643CysVal: 1.643 ± 0.504
0.822CysTrp: 0.822 ± 0.368
0.822CysTyr: 0.822 ± 0.774
0.0CysXaa: 0.0 ± 0.0
Asp
0.822AspAla: 0.822 ± 0.423
3.013AspCys: 3.013 ± 0.824
1.369AspAsp: 1.369 ± 0.517
1.369AspGlu: 1.369 ± 0.431
1.096AspPhe: 1.096 ± 0.756
1.369AspGly: 1.369 ± 0.466
0.0AspHis: 0.0 ± 0.0
3.561AspIle: 3.561 ± 0.524
3.561AspLys: 3.561 ± 0.987
3.287AspLeu: 3.287 ± 0.817
0.822AspMet: 0.822 ± 0.328
1.643AspAsn: 1.643 ± 0.846
3.835AspPro: 3.835 ± 2.148
1.643AspGln: 1.643 ± 0.468
3.287AspArg: 3.287 ± 0.975
1.917AspSer: 1.917 ± 0.684
2.465AspThr: 2.465 ± 0.834
1.643AspVal: 1.643 ± 0.736
0.274AspTrp: 0.274 ± 0.307
0.822AspTyr: 0.822 ± 0.368
0.0AspXaa: 0.0 ± 0.0
Glu
5.478GluAla: 5.478 ± 0.813
0.0GluCys: 0.0 ± 0.0
2.465GluAsp: 2.465 ± 0.927
7.395GluGlu: 7.395 ± 1.203
1.096GluPhe: 1.096 ± 0.347
4.93GluGly: 4.93 ± 0.755
1.096GluHis: 1.096 ± 0.556
4.382GluIle: 4.382 ± 0.718
5.478GluLys: 5.478 ± 0.941
6.574GluLeu: 6.574 ± 1.124
1.643GluMet: 1.643 ± 0.571
1.917GluAsn: 1.917 ± 0.568
4.382GluPro: 4.382 ± 0.799
3.561GluGln: 3.561 ± 1.113
5.478GluArg: 5.478 ± 1.117
4.108GluSer: 4.108 ± 0.944
5.204GluThr: 5.204 ± 1.217
2.739GluVal: 2.739 ± 0.706
1.917GluTrp: 1.917 ± 0.725
0.822GluTyr: 0.822 ± 0.503
0.0GluXaa: 0.0 ± 0.0
Phe
0.822PheAla: 0.822 ± 0.423
0.274PheCys: 0.274 ± 0.243
0.548PheAsp: 0.548 ± 0.382
0.548PheGlu: 0.548 ± 0.375
0.274PhePhe: 0.274 ± 0.243
1.369PheGly: 1.369 ± 0.936
0.0PheHis: 0.0 ± 0.0
1.369PheIle: 1.369 ± 0.502
1.369PheLys: 1.369 ± 0.368
3.287PheLeu: 3.287 ± 0.609
0.0PheMet: 0.0 ± 0.0
2.739PheAsn: 2.739 ± 0.867
1.643PhePro: 1.643 ± 0.737
0.548PheGln: 0.548 ± 0.22
2.465PheArg: 2.465 ± 0.839
2.465PheSer: 2.465 ± 0.468
0.822PheThr: 0.822 ± 0.567
0.274PheVal: 0.274 ± 0.189
0.274PheTrp: 0.274 ± 0.189
1.917PheTyr: 1.917 ± 0.38
0.0PheXaa: 0.0 ± 0.0
Gly
4.108GlyAla: 4.108 ± 0.867
1.917GlyCys: 1.917 ± 0.525
2.465GlyAsp: 2.465 ± 0.876
3.287GlyGlu: 3.287 ± 0.643
1.917GlyPhe: 1.917 ± 0.5
6.574GlyGly: 6.574 ± 0.75
3.287GlyHis: 3.287 ± 1.356
7.669GlyIle: 7.669 ± 1.366
4.656GlyLys: 4.656 ± 1.63
5.478GlyLeu: 5.478 ± 1.543
0.822GlyMet: 0.822 ± 0.308
2.465GlyAsn: 2.465 ± 1.052
5.752GlyPro: 5.752 ± 1.292
5.204GlyGln: 5.204 ± 1.011
3.835GlyArg: 3.835 ± 0.785
3.287GlySer: 3.287 ± 0.86
4.382GlyThr: 4.382 ± 1.385
2.739GlyVal: 2.739 ± 0.498
1.096GlyTrp: 1.096 ± 0.573
1.369GlyTyr: 1.369 ± 0.668
0.0GlyXaa: 0.0 ± 0.0
His
1.096HisAla: 1.096 ± 0.409
1.369HisCys: 1.369 ± 1.281
0.0HisAsp: 0.0 ± 0.0
0.548HisGlu: 0.548 ± 0.22
0.822HisPhe: 0.822 ± 0.833
1.643HisGly: 1.643 ± 0.616
0.822HisHis: 0.822 ± 0.837
1.917HisIle: 1.917 ± 0.643
1.917HisLys: 1.917 ± 0.874
1.643HisLeu: 1.643 ± 0.451
0.822HisMet: 0.822 ± 1.005
1.096HisAsn: 1.096 ± 0.333
2.739HisPro: 2.739 ± 1.049
2.465HisGln: 2.465 ± 0.929
0.822HisArg: 0.822 ± 0.304
0.822HisSer: 0.822 ± 0.765
1.096HisThr: 1.096 ± 0.68
0.274HisVal: 0.274 ± 0.189
0.0HisTrp: 0.0 ± 0.0
0.548HisTyr: 0.548 ± 0.382
0.0HisXaa: 0.0 ± 0.0
Ile
5.204IleAla: 5.204 ± 0.884
1.096IleCys: 1.096 ± 0.347
2.191IleAsp: 2.191 ± 0.701
4.108IleGlu: 4.108 ± 0.944
1.096IlePhe: 1.096 ± 0.347
5.752IleGly: 5.752 ± 1.754
1.096IleHis: 1.096 ± 0.459
7.395IleIle: 7.395 ± 1.273
6.026IleLys: 6.026 ± 1.209
4.93IleLeu: 4.93 ± 0.799
0.548IleMet: 0.548 ± 0.485
1.643IleAsn: 1.643 ± 0.359
4.108IlePro: 4.108 ± 0.749
2.739IleGln: 2.739 ± 1.264
4.656IleArg: 4.656 ± 0.982
3.561IleSer: 3.561 ± 0.585
3.013IleThr: 3.013 ± 1.567
5.478IleVal: 5.478 ± 0.967
2.739IleTrp: 2.739 ± 0.634
2.191IleTyr: 2.191 ± 0.445
0.0IleXaa: 0.0 ± 0.0
Lys
5.478LysAla: 5.478 ± 1.506
2.191LysCys: 2.191 ± 1.109
2.465LysAsp: 2.465 ± 0.654
7.395LysGlu: 7.395 ± 2.14
1.096LysPhe: 1.096 ± 0.53
4.382LysGly: 4.382 ± 1.094
1.643LysHis: 1.643 ± 0.391
4.656LysIle: 4.656 ± 1.449
7.943LysLys: 7.943 ± 1.927
4.656LysLeu: 4.656 ± 1.54
1.096LysMet: 1.096 ± 0.239
2.739LysAsn: 2.739 ± 0.624
1.643LysPro: 1.643 ± 0.609
4.93LysGln: 4.93 ± 0.726
2.739LysArg: 2.739 ± 0.642
2.739LysSer: 2.739 ± 0.614
3.287LysThr: 3.287 ± 0.74
4.382LysVal: 4.382 ± 1.43
2.465LysTrp: 2.465 ± 0.544
1.917LysTyr: 1.917 ± 0.482
0.0LysXaa: 0.0 ± 0.0
Leu
3.835LeuAla: 3.835 ± 0.818
1.096LeuCys: 1.096 ± 0.653
3.835LeuAsp: 3.835 ± 0.793
7.395LeuGlu: 7.395 ± 1.307
2.191LeuPhe: 2.191 ± 1.061
6.574LeuGly: 6.574 ± 2.026
1.643LeuHis: 1.643 ± 0.644
3.561LeuIle: 3.561 ± 1.499
7.395LeuLys: 7.395 ± 1.122
8.217LeuLeu: 8.217 ± 2.706
1.096LeuMet: 1.096 ± 0.798
6.574LeuAsn: 6.574 ± 1.057
2.191LeuPro: 2.191 ± 0.773
4.93LeuGln: 4.93 ± 0.731
4.93LeuArg: 4.93 ± 0.871
3.013LeuSer: 3.013 ± 0.664
4.108LeuThr: 4.108 ± 0.721
4.382LeuVal: 4.382 ± 1.322
2.739LeuTrp: 2.739 ± 1.005
2.465LeuTyr: 2.465 ± 0.636
0.0LeuXaa: 0.0 ± 0.0
Met
1.096MetAla: 1.096 ± 0.615
0.0MetCys: 0.0 ± 0.0
1.096MetAsp: 1.096 ± 0.539
1.917MetGlu: 1.917 ± 0.519
0.822MetPhe: 0.822 ± 0.225
1.917MetGly: 1.917 ± 0.461
0.822MetHis: 0.822 ± 0.308
0.822MetIle: 0.822 ± 0.304
0.274MetLys: 0.274 ± 0.243
1.369MetLeu: 1.369 ± 0.456
1.096MetMet: 1.096 ± 0.575
0.822MetAsn: 0.822 ± 0.424
0.0MetPro: 0.0 ± 0.0
1.369MetGln: 1.369 ± 0.649
1.643MetArg: 1.643 ± 0.324
0.822MetSer: 0.822 ± 0.225
2.739MetThr: 2.739 ± 0.623
1.369MetVal: 1.369 ± 0.311
0.274MetTrp: 0.274 ± 0.243
1.096MetTyr: 1.096 ± 0.758
0.0MetXaa: 0.0 ± 0.0
Asn
1.643AsnAla: 1.643 ± 0.49
2.739AsnCys: 2.739 ± 0.959
1.096AsnAsp: 1.096 ± 0.239
2.739AsnGlu: 2.739 ± 0.71
3.013AsnPhe: 3.013 ± 1.059
1.917AsnGly: 1.917 ± 0.889
0.548AsnHis: 0.548 ± 0.631
3.013AsnIle: 3.013 ± 2.019
3.561AsnLys: 3.561 ± 1.39
4.382AsnLeu: 4.382 ± 0.856
1.096AsnMet: 1.096 ± 0.774
3.287AsnAsn: 3.287 ± 1.6
3.561AsnPro: 3.561 ± 0.926
0.548AsnGln: 0.548 ± 0.378
1.917AsnArg: 1.917 ± 0.654
3.561AsnSer: 3.561 ± 1.099
3.835AsnThr: 3.835 ± 0.676
1.643AsnVal: 1.643 ± 1.129
1.917AsnTrp: 1.917 ± 0.489
1.369AsnTyr: 1.369 ± 0.425
0.0AsnXaa: 0.0 ± 0.0
Pro
3.835ProAla: 3.835 ± 0.651
1.096ProCys: 1.096 ± 0.79
1.917ProAsp: 1.917 ± 0.485
3.287ProGlu: 3.287 ± 0.543
1.369ProPhe: 1.369 ± 0.675
4.93ProGly: 4.93 ± 1.207
0.548ProHis: 0.548 ± 0.34
5.752ProIle: 5.752 ± 0.648
3.287ProLys: 3.287 ± 1.041
5.204ProLeu: 5.204 ± 0.7
1.096ProMet: 1.096 ± 0.572
0.822ProAsn: 0.822 ± 0.672
3.561ProPro: 3.561 ± 1.226
4.382ProGln: 4.382 ± 1.02
4.382ProArg: 4.382 ± 1.295
2.465ProSer: 2.465 ± 1.006
1.917ProThr: 1.917 ± 0.518
5.478ProVal: 5.478 ± 1.156
1.096ProTrp: 1.096 ± 0.857
1.096ProTyr: 1.096 ± 0.391
0.0ProXaa: 0.0 ± 0.0
Gln
7.395GlnAla: 7.395 ± 1.375
0.822GlnCys: 0.822 ± 0.599
2.191GlnAsp: 2.191 ± 1.019
3.561GlnGlu: 3.561 ± 0.768
0.0GlnPhe: 0.0 ± 0.0
5.478GlnGly: 5.478 ± 0.889
1.369GlnHis: 1.369 ± 0.76
4.382GlnIle: 4.382 ± 1.122
2.739GlnLys: 2.739 ± 1.083
5.478GlnLeu: 5.478 ± 0.932
2.739GlnMet: 2.739 ± 0.939
4.382GlnAsn: 4.382 ± 0.788
2.191GlnPro: 2.191 ± 1.407
3.013GlnGln: 3.013 ± 1.492
3.287GlnArg: 3.287 ± 1.168
1.917GlnSer: 1.917 ± 0.398
1.917GlnThr: 1.917 ± 0.307
3.835GlnVal: 3.835 ± 1.255
1.096GlnTrp: 1.096 ± 0.44
1.643GlnTyr: 1.643 ± 0.602
0.0GlnXaa: 0.0 ± 0.0
Arg
4.93ArgAla: 4.93 ± 1.106
0.822ArgCys: 0.822 ± 0.454
3.561ArgAsp: 3.561 ± 0.869
4.93ArgGlu: 4.93 ± 0.784
0.822ArgPhe: 0.822 ± 0.39
4.656ArgGly: 4.656 ± 0.86
0.548ArgHis: 0.548 ± 0.547
5.204ArgIle: 5.204 ± 2.065
4.382ArgLys: 4.382 ± 1.05
4.382ArgLeu: 4.382 ± 0.59
1.096ArgMet: 1.096 ± 0.464
1.917ArgAsn: 1.917 ± 1.005
3.561ArgPro: 3.561 ± 0.82
5.752ArgGln: 5.752 ± 1.189
5.204ArgArg: 5.204 ± 2.469
3.561ArgSer: 3.561 ± 2.037
2.739ArgThr: 2.739 ± 0.645
1.917ArgVal: 1.917 ± 0.489
2.191ArgTrp: 2.191 ± 1.048
1.643ArgTyr: 1.643 ± 0.391
0.0ArgXaa: 0.0 ± 0.0
Ser
2.739SerAla: 2.739 ± 0.82
0.274SerCys: 0.274 ± 0.189
2.465SerAsp: 2.465 ± 0.407
3.835SerGlu: 3.835 ± 0.968
1.643SerPhe: 1.643 ± 0.812
3.287SerGly: 3.287 ± 1.151
1.369SerHis: 1.369 ± 0.78
2.465SerIle: 2.465 ± 0.35
2.465SerLys: 2.465 ± 0.698
5.204SerLeu: 5.204 ± 2.419
0.822SerMet: 0.822 ± 0.452
2.191SerAsn: 2.191 ± 0.539
4.382SerPro: 4.382 ± 1.166
5.204SerGln: 5.204 ± 1.517
3.561SerArg: 3.561 ± 0.781
3.287SerSer: 3.287 ± 1.173
3.835SerThr: 3.835 ± 1.497
3.013SerVal: 3.013 ± 0.385
0.548SerTrp: 0.548 ± 0.22
1.096SerTyr: 1.096 ± 0.79
0.0SerXaa: 0.0 ± 0.0
Thr
2.739ThrAla: 2.739 ± 1.003
0.548ThrCys: 0.548 ± 0.485
2.465ThrAsp: 2.465 ± 0.81
5.752ThrGlu: 5.752 ± 0.478
0.822ThrPhe: 0.822 ± 0.308
3.561ThrGly: 3.561 ± 0.441
1.096ThrHis: 1.096 ± 0.653
1.917ThrIle: 1.917 ± 0.53
2.191ThrLys: 2.191 ± 1.213
5.752ThrLeu: 5.752 ± 1.319
1.369ThrMet: 1.369 ± 0.433
4.108ThrAsn: 4.108 ± 1.68
3.561ThrPro: 3.561 ± 0.504
2.465ThrGln: 2.465 ± 0.82
2.191ThrArg: 2.191 ± 0.826
3.561ThrSer: 3.561 ± 0.814
3.287ThrThr: 3.287 ± 1.298
4.656ThrVal: 4.656 ± 1.07
1.917ThrTrp: 1.917 ± 0.473
1.369ThrTyr: 1.369 ± 0.72
0.0ThrXaa: 0.0 ± 0.0
Val
3.561ValAla: 3.561 ± 0.905
0.548ValCys: 0.548 ± 0.631
3.013ValAsp: 3.013 ± 1.039
3.013ValGlu: 3.013 ± 1.091
0.822ValPhe: 0.822 ± 0.391
4.656ValGly: 4.656 ± 1.444
2.739ValHis: 2.739 ± 1.102
3.561ValIle: 3.561 ± 0.966
4.382ValLys: 4.382 ± 1.284
3.835ValLeu: 3.835 ± 0.648
0.274ValMet: 0.274 ± 0.35
1.643ValAsn: 1.643 ± 0.85
3.835ValPro: 3.835 ± 0.969
3.287ValGln: 3.287 ± 0.717
3.287ValArg: 3.287 ± 0.46
3.561ValSer: 3.561 ± 0.896
3.013ValThr: 3.013 ± 1.094
3.013ValVal: 3.013 ± 1.041
1.917ValTrp: 1.917 ± 0.462
1.643ValTyr: 1.643 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
2.191TrpAla: 2.191 ± 0.445
0.274TrpCys: 0.274 ± 0.307
1.643TrpAsp: 1.643 ± 0.674
1.643TrpGlu: 1.643 ± 0.534
0.548TrpPhe: 0.548 ± 0.382
1.917TrpGly: 1.917 ± 0.568
0.274TrpHis: 0.274 ± 0.35
0.822TrpIle: 0.822 ± 0.368
2.191TrpLys: 2.191 ± 0.626
0.822TrpLeu: 0.822 ± 0.53
1.643TrpMet: 1.643 ± 0.481
2.191TrpAsn: 2.191 ± 1.392
1.096TrpPro: 1.096 ± 0.411
1.917TrpGln: 1.917 ± 0.663
2.191TrpArg: 2.191 ± 0.394
1.369TrpSer: 1.369 ± 0.912
1.096TrpThr: 1.096 ± 0.589
1.643TrpVal: 1.643 ± 0.4
0.822TrpTrp: 0.822 ± 0.331
0.548TrpTyr: 0.548 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.643TyrAla: 1.643 ± 0.359
1.096TyrCys: 1.096 ± 0.369
0.548TyrAsp: 0.548 ± 0.378
1.369TyrGlu: 1.369 ± 0.552
0.822TyrPhe: 0.822 ± 0.424
1.369TyrGly: 1.369 ± 0.74
1.096TyrHis: 1.096 ± 0.563
0.548TyrIle: 0.548 ± 0.22
2.191TyrLys: 2.191 ± 0.656
1.369TyrLeu: 1.369 ± 0.527
0.274TyrMet: 0.274 ± 0.189
1.917TyrAsn: 1.917 ± 0.663
1.369TyrPro: 1.369 ± 0.613
2.191TyrGln: 2.191 ± 0.773
1.917TyrArg: 1.917 ± 0.462
1.643TyrSer: 1.643 ± 0.324
1.096TyrThr: 1.096 ± 0.461
1.643TyrVal: 1.643 ± 0.671
0.822TyrTrp: 0.822 ± 0.304
1.096TyrTyr: 1.096 ± 0.333
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3652 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski