Amino acid dipepetide frequency for Human immunodeficiency virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.469AlaAla: 4.469 ± 1.645
1.915AlaCys: 1.915 ± 0.556
1.596AlaAsp: 1.596 ± 0.639
4.788AlaGlu: 4.788 ± 1.008
1.915AlaPhe: 1.915 ± 0.411
5.107AlaGly: 5.107 ± 1.436
1.277AlaHis: 1.277 ± 0.535
4.788AlaIle: 4.788 ± 2.032
3.192AlaLys: 3.192 ± 0.869
5.107AlaLeu: 5.107 ± 1.672
1.596AlaMet: 1.596 ± 0.556
2.234AlaAsn: 2.234 ± 0.83
2.553AlaPro: 2.553 ± 1.154
1.596AlaGln: 1.596 ± 0.399
2.873AlaArg: 2.873 ± 0.707
5.107AlaSer: 5.107 ± 1.027
3.511AlaThr: 3.511 ± 1.094
5.426AlaVal: 5.426 ± 1.286
1.277AlaTrp: 1.277 ± 0.543
1.277AlaTyr: 1.277 ± 0.543
0.0AlaXaa: 0.0 ± 0.0
Cys
0.958CysAla: 0.958 ± 0.602
0.319CysCys: 0.319 ± 0.472
0.319CysAsp: 0.319 ± 0.227
0.319CysGlu: 0.319 ± 0.506
1.277CysPhe: 1.277 ± 1.136
1.596CysGly: 1.596 ± 0.62
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.958CysLys: 0.958 ± 0.534
0.319CysLeu: 0.319 ± 0.27
0.319CysMet: 0.319 ± 0.346
1.596CysAsn: 1.596 ± 1.134
0.319CysPro: 0.319 ± 0.27
0.958CysGln: 0.958 ± 0.534
1.277CysArg: 1.277 ± 0.679
1.915CysSer: 1.915 ± 0.983
3.192CysThr: 3.192 ± 1.014
1.596CysVal: 1.596 ± 0.662
0.638CysTrp: 0.638 ± 0.337
0.638CysTyr: 0.638 ± 0.666
0.0CysXaa: 0.0 ± 0.0
Asp
0.958AspAla: 0.958 ± 0.438
2.553AspCys: 2.553 ± 1.181
2.234AspAsp: 2.234 ± 0.706
0.638AspGlu: 0.638 ± 0.492
1.277AspPhe: 1.277 ± 0.923
1.596AspGly: 1.596 ± 0.584
0.0AspHis: 0.0 ± 0.0
3.511AspIle: 3.511 ± 0.617
4.149AspLys: 4.149 ± 1.25
4.149AspLeu: 4.149 ± 1.115
0.958AspMet: 0.958 ± 0.522
1.915AspAsn: 1.915 ± 0.815
2.873AspPro: 2.873 ± 0.919
1.915AspGln: 1.915 ± 0.555
3.83AspArg: 3.83 ± 1.744
2.553AspSer: 2.553 ± 1.127
2.873AspThr: 2.873 ± 0.668
1.596AspVal: 1.596 ± 0.558
0.958AspTrp: 0.958 ± 1.092
0.638AspTyr: 0.638 ± 0.337
0.0AspXaa: 0.0 ± 0.0
Glu
5.426GluAla: 5.426 ± 1.303
0.0GluCys: 0.0 ± 0.0
2.553GluAsp: 2.553 ± 1.008
7.341GluGlu: 7.341 ± 2.334
1.277GluPhe: 1.277 ± 0.667
4.788GluGly: 4.788 ± 0.951
0.638GluHis: 0.638 ± 0.453
3.83GluIle: 3.83 ± 1.317
4.788GluLys: 4.788 ± 1.034
7.341GluLeu: 7.341 ± 1.452
2.234GluMet: 2.234 ± 1.477
1.596GluAsn: 1.596 ± 0.498
5.426GluPro: 5.426 ± 1.504
3.511GluGln: 3.511 ± 0.889
4.149GluArg: 4.149 ± 1.845
3.192GluSer: 3.192 ± 1.546
3.83GluThr: 3.83 ± 1.721
4.469GluVal: 4.469 ± 0.757
1.915GluTrp: 1.915 ± 0.634
1.596GluTyr: 1.596 ± 0.986
0.0GluXaa: 0.0 ± 0.0
Phe
1.277PheAla: 1.277 ± 0.43
0.319PheCys: 0.319 ± 0.27
0.638PheAsp: 0.638 ± 0.651
0.319PheGlu: 0.319 ± 0.27
0.958PhePhe: 0.958 ± 0.491
1.596PheGly: 1.596 ± 0.597
0.638PheHis: 0.638 ± 0.666
1.596PheIle: 1.596 ± 0.59
1.277PheLys: 1.277 ± 0.434
2.553PheLeu: 2.553 ± 0.554
0.319PheMet: 0.319 ± 0.472
2.873PheAsn: 2.873 ± 1.705
1.915PhePro: 1.915 ± 1.034
0.638PheGln: 0.638 ± 0.272
3.511PheArg: 3.511 ± 1.257
2.234PheSer: 2.234 ± 0.625
1.277PheThr: 1.277 ± 0.619
0.638PheVal: 0.638 ± 0.272
0.319PheTrp: 0.319 ± 0.227
1.596PheTyr: 1.596 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
4.788GlyAla: 4.788 ± 1.042
1.596GlyCys: 1.596 ± 0.598
2.553GlyAsp: 2.553 ± 0.92
3.83GlyGlu: 3.83 ± 0.824
1.596GlyPhe: 1.596 ± 0.508
6.064GlyGly: 6.064 ± 1.31
2.873GlyHis: 2.873 ± 2.164
6.064GlyIle: 6.064 ± 2.434
5.107GlyLys: 5.107 ± 1.423
3.511GlyLeu: 3.511 ± 0.773
0.638GlyMet: 0.638 ± 0.407
2.553GlyAsn: 2.553 ± 0.967
4.469GlyPro: 4.469 ± 0.953
3.83GlyGln: 3.83 ± 1.365
4.788GlyArg: 4.788 ± 1.118
5.107GlySer: 5.107 ± 1.103
3.511GlyThr: 3.511 ± 1.81
3.83GlyVal: 3.83 ± 1.469
1.915GlyTrp: 1.915 ± 0.754
1.915GlyTyr: 1.915 ± 0.771
0.0GlyXaa: 0.0 ± 0.0
His
0.958HisAla: 0.958 ± 0.719
0.638HisCys: 0.638 ± 0.485
0.0HisAsp: 0.0 ± 0.0
0.638HisGlu: 0.638 ± 0.272
0.958HisPhe: 0.958 ± 0.991
1.596HisGly: 1.596 ± 0.733
1.277HisHis: 1.277 ± 1.55
1.596HisIle: 1.596 ± 0.751
1.277HisLys: 1.277 ± 0.675
2.873HisLeu: 2.873 ± 0.976
0.638HisMet: 0.638 ± 0.936
1.596HisAsn: 1.596 ± 0.492
1.915HisPro: 1.915 ± 0.972
2.234HisGln: 2.234 ± 1.462
0.958HisArg: 0.958 ± 0.438
1.915HisSer: 1.915 ± 0.81
1.596HisThr: 1.596 ± 1.028
0.638HisVal: 0.638 ± 0.407
0.0HisTrp: 0.0 ± 0.0
0.958HisTyr: 0.958 ± 0.968
0.0HisXaa: 0.0 ± 0.0
Ile
3.192IleAla: 3.192 ± 1.08
1.277IleCys: 1.277 ± 0.543
2.234IleAsp: 2.234 ± 2.052
4.788IleGlu: 4.788 ± 1.418
0.958IlePhe: 0.958 ± 0.491
6.064IleGly: 6.064 ± 2.243
2.553IleHis: 2.553 ± 0.788
5.107IleIle: 5.107 ± 2.558
4.788IleLys: 4.788 ± 1.454
5.745IleLeu: 5.745 ± 1.052
0.958IleMet: 0.958 ± 0.522
1.915IleAsn: 1.915 ± 0.609
3.511IlePro: 3.511 ± 0.725
2.873IleGln: 2.873 ± 1.419
5.107IleArg: 5.107 ± 1.489
3.83IleSer: 3.83 ± 1.21
2.553IleThr: 2.553 ± 0.807
7.341IleVal: 7.341 ± 2.872
2.234IleTrp: 2.234 ± 0.643
1.915IleTyr: 1.915 ± 0.783
0.0IleXaa: 0.0 ± 0.0
Lys
5.745LysAla: 5.745 ± 1.147
1.915LysCys: 1.915 ± 0.594
2.873LysAsp: 2.873 ± 0.62
7.022LysGlu: 7.022 ± 1.829
0.319LysPhe: 0.319 ± 0.227
4.469LysGly: 4.469 ± 1.306
1.596LysHis: 1.596 ± 0.831
6.064LysIle: 6.064 ± 2.012
4.149LysLys: 4.149 ± 1.688
7.022LysLeu: 7.022 ± 1.566
0.319LysMet: 0.319 ± 0.227
2.234LysAsn: 2.234 ± 1.036
1.915LysPro: 1.915 ± 1.376
4.788LysGln: 4.788 ± 1.285
2.553LysArg: 2.553 ± 0.879
2.234LysSer: 2.234 ± 0.544
4.149LysThr: 4.149 ± 0.728
4.469LysVal: 4.469 ± 1.213
2.234LysTrp: 2.234 ± 0.569
1.915LysTyr: 1.915 ± 0.815
0.0LysXaa: 0.0 ± 0.0
Leu
4.469LeuAla: 4.469 ± 1.082
0.958LeuCys: 0.958 ± 0.491
3.83LeuAsp: 3.83 ± 0.848
7.022LeuGlu: 7.022 ± 1.975
2.234LeuPhe: 2.234 ± 1.18
6.064LeuGly: 6.064 ± 1.38
2.553LeuHis: 2.553 ± 1.639
5.107LeuIle: 5.107 ± 2.007
7.022LeuLys: 7.022 ± 1.636
8.618LeuLeu: 8.618 ± 3.429
0.638LeuMet: 0.638 ± 0.764
3.192LeuAsn: 3.192 ± 1.049
3.192LeuPro: 3.192 ± 0.917
5.107LeuGln: 5.107 ± 1.016
4.469LeuArg: 4.469 ± 1.053
2.873LeuSer: 2.873 ± 0.888
4.149LeuThr: 4.149 ± 0.971
6.064LeuVal: 6.064 ± 1.774
3.192LeuTrp: 3.192 ± 1.158
1.915LeuTyr: 1.915 ± 0.872
0.0LeuXaa: 0.0 ± 0.0
Met
1.277MetAla: 1.277 ± 0.727
0.0MetCys: 0.0 ± 0.0
0.958MetAsp: 0.958 ± 0.515
1.915MetGlu: 1.915 ± 0.727
0.319MetPhe: 0.319 ± 0.316
1.915MetGly: 1.915 ± 1.379
0.638MetHis: 0.638 ± 0.272
1.915MetIle: 1.915 ± 0.609
0.638MetLys: 0.638 ± 0.337
0.958MetLeu: 0.958 ± 0.595
0.319MetMet: 0.319 ± 0.316
0.638MetAsn: 0.638 ± 0.453
0.0MetPro: 0.0 ± 0.0
0.638MetGln: 0.638 ± 0.768
1.915MetArg: 1.915 ± 0.771
0.319MetSer: 0.319 ± 0.316
2.553MetThr: 2.553 ± 0.87
0.638MetVal: 0.638 ± 0.337
0.638MetTrp: 0.638 ± 0.539
0.958MetTyr: 0.958 ± 0.514
0.0MetXaa: 0.0 ± 0.0
Asn
2.234AsnAla: 2.234 ± 1.036
2.234AsnCys: 2.234 ± 0.879
0.958AsnAsp: 0.958 ± 0.421
2.234AsnGlu: 2.234 ± 0.756
2.873AsnPhe: 2.873 ± 0.896
2.234AsnGly: 2.234 ± 0.947
0.0AsnHis: 0.0 ± 0.0
1.915AsnIle: 1.915 ± 0.983
3.192AsnLys: 3.192 ± 0.764
2.873AsnLeu: 2.873 ± 0.771
1.277AsnMet: 1.277 ± 1.078
4.788AsnAsn: 4.788 ± 2.607
3.511AsnPro: 3.511 ± 1.323
0.958AsnGln: 0.958 ± 0.277
1.596AsnArg: 1.596 ± 0.657
3.192AsnSer: 3.192 ± 1.014
4.788AsnThr: 4.788 ± 1.185
1.277AsnVal: 1.277 ± 0.745
1.915AsnTrp: 1.915 ± 0.636
0.958AsnTyr: 0.958 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
2.873ProAla: 2.873 ± 1.0
0.958ProCys: 0.958 ± 0.809
2.873ProAsp: 2.873 ± 0.808
3.83ProGlu: 3.83 ± 1.036
1.596ProPhe: 1.596 ± 0.716
4.788ProGly: 4.788 ± 1.659
0.319ProHis: 0.319 ± 0.227
4.788ProIle: 4.788 ± 1.06
2.873ProLys: 2.873 ± 1.262
5.107ProLeu: 5.107 ± 1.209
0.638ProMet: 0.638 ± 0.524
0.958ProAsn: 0.958 ± 0.692
3.192ProPro: 3.192 ± 1.464
3.192ProGln: 3.192 ± 0.791
2.873ProArg: 2.873 ± 0.87
2.234ProSer: 2.234 ± 1.243
2.873ProThr: 2.873 ± 1.052
5.107ProVal: 5.107 ± 1.385
0.958ProTrp: 0.958 ± 0.844
0.958ProTyr: 0.958 ± 0.561
0.0ProXaa: 0.0 ± 0.0
Gln
5.745GlnAla: 5.745 ± 1.01
0.638GlnCys: 0.638 ± 0.272
1.915GlnAsp: 1.915 ± 0.853
4.149GlnGlu: 4.149 ± 0.957
0.958GlnPhe: 0.958 ± 0.527
4.788GlnGly: 4.788 ± 0.849
1.277GlnHis: 1.277 ± 0.543
4.788GlnIle: 4.788 ± 1.106
3.83GlnLys: 3.83 ± 1.763
6.064GlnLeu: 6.064 ± 1.401
2.234GlnMet: 2.234 ± 1.38
3.192GlnAsn: 3.192 ± 1.05
1.915GlnPro: 1.915 ± 1.191
2.234GlnGln: 2.234 ± 1.277
3.83GlnArg: 3.83 ± 1.485
1.915GlnSer: 1.915 ± 0.713
1.596GlnThr: 1.596 ± 0.476
3.511GlnVal: 3.511 ± 1.626
0.638GlnTrp: 0.638 ± 0.453
2.234GlnTyr: 2.234 ± 0.708
0.0GlnXaa: 0.0 ± 0.0
Arg
4.469ArgAla: 4.469 ± 1.048
0.638ArgCys: 0.638 ± 0.499
4.149ArgAsp: 4.149 ± 0.81
5.107ArgGlu: 5.107 ± 1.187
1.277ArgPhe: 1.277 ± 1.042
2.873ArgGly: 2.873 ± 0.716
1.596ArgHis: 1.596 ± 1.268
5.107ArgIle: 5.107 ± 2.12
4.788ArgLys: 4.788 ± 2.315
3.83ArgLeu: 3.83 ± 2.069
1.596ArgMet: 1.596 ± 0.489
1.915ArgAsn: 1.915 ± 0.812
2.873ArgPro: 2.873 ± 1.012
6.703ArgGln: 6.703 ± 1.362
4.788ArgArg: 4.788 ± 2.928
2.873ArgSer: 2.873 ± 1.356
1.277ArgThr: 1.277 ± 0.576
1.915ArgVal: 1.915 ± 0.915
2.553ArgTrp: 2.553 ± 0.99
1.277ArgTyr: 1.277 ± 0.507
0.0ArgXaa: 0.0 ± 0.0
Ser
2.873SerAla: 2.873 ± 0.848
0.638SerCys: 0.638 ± 0.272
2.873SerAsp: 2.873 ± 0.629
4.469SerGlu: 4.469 ± 1.235
2.234SerPhe: 2.234 ± 0.872
3.511SerGly: 3.511 ± 1.414
0.958SerHis: 0.958 ± 0.71
3.192SerIle: 3.192 ± 0.932
1.915SerLys: 1.915 ± 0.543
6.064SerLeu: 6.064 ± 2.116
1.277SerMet: 1.277 ± 0.463
2.234SerAsn: 2.234 ± 0.748
3.511SerPro: 3.511 ± 1.147
5.107SerGln: 5.107 ± 1.898
3.511SerArg: 3.511 ± 1.329
2.873SerSer: 2.873 ± 0.937
4.469SerThr: 4.469 ± 1.76
2.234SerVal: 2.234 ± 0.35
0.638SerTrp: 0.638 ± 0.272
0.958SerTyr: 0.958 ± 0.626
0.0SerXaa: 0.0 ± 0.0
Thr
2.873ThrAla: 2.873 ± 0.574
0.0ThrCys: 0.0 ± 0.0
2.873ThrAsp: 2.873 ± 1.2
5.107ThrGlu: 5.107 ± 1.279
0.958ThrPhe: 0.958 ± 0.374
2.873ThrGly: 2.873 ± 0.542
2.234ThrHis: 2.234 ± 0.875
3.511ThrIle: 3.511 ± 1.016
3.83ThrLys: 3.83 ± 1.343
4.149ThrLeu: 4.149 ± 1.314
0.958ThrMet: 0.958 ± 0.522
3.192ThrAsn: 3.192 ± 0.939
3.511ThrPro: 3.511 ± 1.181
2.553ThrGln: 2.553 ± 0.746
2.553ThrArg: 2.553 ± 1.003
4.149ThrSer: 4.149 ± 1.335
3.511ThrThr: 3.511 ± 0.856
5.426ThrVal: 5.426 ± 1.511
2.234ThrTrp: 2.234 ± 0.68
1.596ThrTyr: 1.596 ± 0.979
0.0ThrXaa: 0.0 ± 0.0
Val
3.511ValAla: 3.511 ± 1.949
0.319ValCys: 0.319 ± 0.472
3.192ValAsp: 3.192 ± 1.316
3.83ValGlu: 3.83 ± 1.667
0.958ValPhe: 0.958 ± 0.421
4.788ValGly: 4.788 ± 0.654
2.553ValHis: 2.553 ± 0.833
4.149ValIle: 4.149 ± 0.872
4.469ValLys: 4.469 ± 1.072
4.469ValLeu: 4.469 ± 0.869
0.319ValMet: 0.319 ± 0.506
2.873ValAsn: 2.873 ± 0.944
4.149ValPro: 4.149 ± 1.14
3.83ValGln: 3.83 ± 1.272
3.192ValArg: 3.192 ± 1.002
4.149ValSer: 4.149 ± 1.179
3.83ValThr: 3.83 ± 0.992
4.469ValVal: 4.469 ± 1.369
2.553ValTrp: 2.553 ± 0.821
1.277ValTyr: 1.277 ± 0.619
0.0ValXaa: 0.0 ± 0.0
Trp
1.915TrpAla: 1.915 ± 0.488
0.319TrpCys: 0.319 ± 0.411
1.596TrpAsp: 1.596 ± 0.898
1.596TrpGlu: 1.596 ± 0.558
0.958TrpPhe: 0.958 ± 0.622
2.553TrpGly: 2.553 ± 0.885
0.319TrpHis: 0.319 ± 0.506
0.958TrpIle: 0.958 ± 0.483
3.192TrpLys: 3.192 ± 0.612
0.958TrpLeu: 0.958 ± 0.65
1.277TrpMet: 1.277 ± 0.529
1.915TrpAsn: 1.915 ± 1.293
1.277TrpPro: 1.277 ± 0.543
2.234TrpGln: 2.234 ± 0.925
2.234TrpArg: 2.234 ± 0.674
1.596TrpSer: 1.596 ± 0.988
1.596TrpThr: 1.596 ± 0.934
0.958TrpVal: 0.958 ± 0.277
0.958TrpTrp: 0.958 ± 0.421
0.638TrpTyr: 0.638 ± 0.272
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.596TyrAla: 1.596 ± 0.671
1.277TyrCys: 1.277 ± 0.59
0.958TyrAsp: 0.958 ± 0.421
0.958TyrGlu: 0.958 ± 0.606
1.596TyrPhe: 1.596 ± 0.895
1.277TyrGly: 1.277 ± 0.607
0.958TyrHis: 0.958 ± 0.374
0.958TyrIle: 0.958 ± 0.438
2.553TyrLys: 2.553 ± 1.198
1.277TyrLeu: 1.277 ± 0.669
0.319TyrMet: 0.319 ± 0.227
1.596TyrAsn: 1.596 ± 0.716
0.958TyrPro: 0.958 ± 0.761
2.553TyrGln: 2.553 ± 0.999
1.596TyrArg: 1.596 ± 0.924
1.277TyrSer: 1.277 ± 0.411
0.958TyrThr: 0.958 ± 0.374
1.277TyrVal: 1.277 ± 0.67
1.277TyrTrp: 1.277 ± 0.529
0.958TyrTyr: 0.958 ± 0.421
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (3134 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski