Amino acid dipepetide frequency for Human immunodeficiency virus type 1 group M subtype H (isolate 90CF056) (HIV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.396AlaAla: 4.396 ± 1.086
1.923AlaCys: 1.923 ± 0.601
1.648AlaAsp: 1.648 ± 0.67
5.769AlaGlu: 5.769 ± 1.146
1.648AlaPhe: 1.648 ± 0.365
4.121AlaGly: 4.121 ± 0.886
1.099AlaHis: 1.099 ± 0.521
5.495AlaIle: 5.495 ± 1.06
2.473AlaLys: 2.473 ± 1.218
5.769AlaLeu: 5.769 ± 0.842
2.198AlaMet: 2.198 ± 0.845
1.374AlaAsn: 1.374 ± 0.604
2.473AlaPro: 2.473 ± 0.717
1.648AlaGln: 1.648 ± 0.365
4.396AlaArg: 4.396 ± 0.921
6.319AlaSer: 6.319 ± 0.932
2.198AlaThr: 2.198 ± 0.637
4.67AlaVal: 4.67 ± 0.855
2.198AlaTrp: 2.198 ± 0.75
0.824AlaTyr: 0.824 ± 0.357
0.0AlaXaa: 0.0 ± 0.0
Cys
0.549CysAla: 0.549 ± 0.487
0.549CysCys: 0.549 ± 0.739
0.275CysAsp: 0.275 ± 0.192
0.0CysGlu: 0.0 ± 0.0
1.648CysPhe: 1.648 ± 1.089
1.648CysGly: 1.648 ± 0.692
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.648CysLys: 1.648 ± 0.844
0.824CysLeu: 0.824 ± 0.483
0.275CysMet: 0.275 ± 0.344
1.923CysAsn: 1.923 ± 1.083
0.549CysPro: 0.549 ± 0.486
1.648CysGln: 1.648 ± 0.87
1.099CysArg: 1.099 ± 0.437
1.374CysSer: 1.374 ± 0.736
2.747CysThr: 2.747 ± 1.024
1.923CysVal: 1.923 ± 0.82
0.824CysTrp: 0.824 ± 0.385
1.099CysTyr: 1.099 ± 1.478
0.0CysXaa: 0.0 ± 0.0
Asp
1.374AspAla: 1.374 ± 0.315
2.747AspCys: 2.747 ± 0.796
1.648AspAsp: 1.648 ± 0.612
0.549AspGlu: 0.549 ± 0.238
0.824AspPhe: 0.824 ± 0.575
2.198AspGly: 2.198 ± 1.194
0.549AspHis: 0.549 ± 0.739
4.67AspIle: 4.67 ± 1.103
4.121AspLys: 4.121 ± 1.329
3.571AspLeu: 3.571 ± 1.094
0.824AspMet: 0.824 ± 0.445
2.198AspAsn: 2.198 ± 0.806
3.022AspPro: 3.022 ± 1.984
1.374AspGln: 1.374 ± 0.668
3.297AspArg: 3.297 ± 0.839
1.923AspSer: 1.923 ± 0.907
2.747AspThr: 2.747 ± 0.786
1.648AspVal: 1.648 ± 0.566
0.549AspTrp: 0.549 ± 0.435
0.824AspTyr: 0.824 ± 0.385
0.0AspXaa: 0.0 ± 0.0
Glu
5.769GluAla: 5.769 ± 1.097
0.0GluCys: 0.0 ± 0.0
1.923GluAsp: 1.923 ± 1.563
6.044GluGlu: 6.044 ± 1.253
1.099GluPhe: 1.099 ± 0.521
5.495GluGly: 5.495 ± 0.895
0.549GluHis: 0.549 ± 0.383
3.846GluIle: 3.846 ± 1.191
4.121GluLys: 4.121 ± 0.73
6.044GluLeu: 6.044 ± 1.502
2.473GluMet: 2.473 ± 0.623
2.473GluAsn: 2.473 ± 0.741
5.22GluPro: 5.22 ± 1.34
4.945GluGln: 4.945 ± 0.783
4.396GluArg: 4.396 ± 1.517
2.473GluSer: 2.473 ± 1.002
4.67GluThr: 4.67 ± 1.82
3.297GluVal: 3.297 ± 1.109
1.923GluTrp: 1.923 ± 0.85
0.824GluTyr: 0.824 ± 0.662
0.0GluXaa: 0.0 ± 0.0
Phe
1.374PheAla: 1.374 ± 0.416
0.275PheCys: 0.275 ± 0.243
1.099PheAsp: 1.099 ± 0.934
0.824PheGlu: 0.824 ± 0.533
1.099PhePhe: 1.099 ± 0.318
1.374PheGly: 1.374 ± 0.924
0.0PheHis: 0.0 ± 0.0
1.374PheIle: 1.374 ± 0.649
2.198PheLys: 2.198 ± 0.553
3.022PheLeu: 3.022 ± 0.758
0.0PheMet: 0.0 ± 0.0
2.747PheAsn: 2.747 ± 1.025
1.648PhePro: 1.648 ± 1.244
0.824PheGln: 0.824 ± 0.357
2.747PheArg: 2.747 ± 0.996
2.198PheSer: 2.198 ± 0.578
0.824PheThr: 0.824 ± 0.575
0.824PheVal: 0.824 ± 0.385
0.275PheTrp: 0.275 ± 0.192
1.099PheTyr: 1.099 ± 0.725
0.0PheXaa: 0.0 ± 0.0
Gly
5.495GlyAla: 5.495 ± 0.914
1.923GlyCys: 1.923 ± 0.558
3.297GlyAsp: 3.297 ± 0.713
4.121GlyGlu: 4.121 ± 1.704
2.198GlyPhe: 2.198 ± 0.768
6.319GlyGly: 6.319 ± 1.24
2.747GlyHis: 2.747 ± 1.411
5.495GlyIle: 5.495 ± 1.34
5.22GlyLys: 5.22 ± 1.994
4.945GlyLeu: 4.945 ± 1.313
1.099GlyMet: 1.099 ± 0.487
2.747GlyAsn: 2.747 ± 0.736
4.396GlyPro: 4.396 ± 0.883
5.22GlyGln: 5.22 ± 1.714
3.846GlyArg: 3.846 ± 1.302
3.297GlySer: 3.297 ± 0.718
4.121GlyThr: 4.121 ± 1.641
2.473GlyVal: 2.473 ± 0.591
1.099GlyTrp: 1.099 ± 0.89
1.923GlyTyr: 1.923 ± 0.68
0.0GlyXaa: 0.0 ± 0.0
His
1.099HisAla: 1.099 ± 0.437
0.824HisCys: 0.824 ± 0.712
0.0HisAsp: 0.0 ± 0.0
0.275HisGlu: 0.275 ± 0.192
1.099HisPhe: 1.099 ± 1.319
1.374HisGly: 1.374 ± 0.534
0.275HisHis: 0.275 ± 0.396
1.648HisIle: 1.648 ± 0.93
1.099HisLys: 1.099 ± 0.448
3.022HisLeu: 3.022 ± 0.561
1.099HisMet: 1.099 ± 1.108
1.099HisAsn: 1.099 ± 0.436
2.473HisPro: 2.473 ± 1.247
3.022HisGln: 3.022 ± 0.964
1.648HisArg: 1.648 ± 0.851
0.549HisSer: 0.549 ± 0.525
1.374HisThr: 1.374 ± 0.432
0.275HisVal: 0.275 ± 0.192
0.275HisTrp: 0.275 ± 0.243
0.549HisTyr: 0.549 ± 0.402
0.0HisXaa: 0.0 ± 0.0
Ile
3.297IleAla: 3.297 ± 0.773
1.099IleCys: 1.099 ± 0.476
2.473IleAsp: 2.473 ± 0.578
4.396IleGlu: 4.396 ± 1.277
1.099IlePhe: 1.099 ± 0.476
4.396IleGly: 4.396 ± 1.618
2.198IleHis: 2.198 ± 0.67
5.769IleIle: 5.769 ± 1.374
4.67IleLys: 4.67 ± 0.915
5.495IleLeu: 5.495 ± 1.18
1.099IleMet: 1.099 ± 0.437
1.923IleAsn: 1.923 ± 0.397
4.396IlePro: 4.396 ± 1.392
3.571IleGln: 3.571 ± 1.38
3.571IleArg: 3.571 ± 0.705
5.495IleSer: 5.495 ± 1.535
2.198IleThr: 2.198 ± 1.167
5.22IleVal: 5.22 ± 1.377
1.648IleTrp: 1.648 ± 0.478
2.473IleTyr: 2.473 ± 0.591
0.0IleXaa: 0.0 ± 0.0
Lys
3.571LysAla: 3.571 ± 1.185
2.198LysCys: 2.198 ± 0.737
3.571LysAsp: 3.571 ± 1.501
6.593LysGlu: 6.593 ± 2.007
1.923LysPhe: 1.923 ± 0.783
5.22LysGly: 5.22 ± 1.33
1.374LysHis: 1.374 ± 0.315
6.593LysIle: 6.593 ± 1.711
6.868LysLys: 6.868 ± 2.892
5.769LysLeu: 5.769 ± 1.673
0.549LysMet: 0.549 ± 0.383
2.198LysAsn: 2.198 ± 0.831
1.648LysPro: 1.648 ± 0.678
4.945LysGln: 4.945 ± 0.643
2.198LysArg: 2.198 ± 0.693
2.473LysSer: 2.473 ± 0.52
4.945LysThr: 4.945 ± 0.945
4.121LysVal: 4.121 ± 1.508
1.648LysTrp: 1.648 ± 0.612
1.923LysTyr: 1.923 ± 0.359
0.0LysXaa: 0.0 ± 0.0
Leu
4.121LeuAla: 4.121 ± 0.687
0.824LeuCys: 0.824 ± 0.442
4.396LeuAsp: 4.396 ± 1.098
4.945LeuGlu: 4.945 ± 1.399
2.473LeuPhe: 2.473 ± 0.885
7.418LeuGly: 7.418 ± 2.501
1.648LeuHis: 1.648 ± 0.95
3.571LeuIle: 3.571 ± 1.612
6.593LeuLys: 6.593 ± 0.924
7.967LeuLeu: 7.967 ± 2.89
1.099LeuMet: 1.099 ± 1.181
5.495LeuAsn: 5.495 ± 0.998
2.473LeuPro: 2.473 ± 0.738
4.945LeuGln: 4.945 ± 1.297
5.22LeuArg: 5.22 ± 1.194
3.846LeuSer: 3.846 ± 0.596
4.67LeuThr: 4.67 ± 1.056
6.319LeuVal: 6.319 ± 1.767
3.022LeuTrp: 3.022 ± 1.236
1.374LeuTyr: 1.374 ± 0.421
0.0LeuXaa: 0.0 ± 0.0
Met
0.824MetAla: 0.824 ± 0.483
0.824MetCys: 0.824 ± 0.712
1.099MetAsp: 1.099 ± 0.709
2.747MetGlu: 2.747 ± 1.384
0.549MetPhe: 0.549 ± 0.295
2.198MetGly: 2.198 ± 0.82
1.374MetHis: 1.374 ± 0.718
0.824MetIle: 0.824 ± 0.339
0.549MetLys: 0.549 ± 0.295
1.648MetLeu: 1.648 ± 0.565
1.099MetMet: 1.099 ± 0.591
0.549MetAsn: 0.549 ± 0.387
0.0MetPro: 0.0 ± 0.0
1.099MetGln: 1.099 ± 0.591
1.648MetArg: 1.648 ± 0.409
0.824MetSer: 0.824 ± 0.416
2.747MetThr: 2.747 ± 1.018
1.374MetVal: 1.374 ± 0.416
0.824MetTrp: 0.824 ± 0.583
1.099MetTyr: 1.099 ± 0.439
0.0MetXaa: 0.0 ± 0.0
Asn
2.473AsnAla: 2.473 ± 1.09
3.022AsnCys: 3.022 ± 0.949
0.824AsnAsp: 0.824 ± 0.357
2.747AsnGlu: 2.747 ± 1.03
1.923AsnPhe: 1.923 ± 0.946
1.648AsnGly: 1.648 ± 0.795
0.549AsnHis: 0.549 ± 0.739
3.297AsnIle: 3.297 ± 1.564
3.297AsnLys: 3.297 ± 0.706
2.747AsnLeu: 2.747 ± 0.659
1.099AsnMet: 1.099 ± 0.974
3.571AsnAsn: 3.571 ± 1.62
3.571AsnPro: 3.571 ± 0.708
0.549AsnGln: 0.549 ± 0.383
1.923AsnArg: 1.923 ± 0.596
3.022AsnSer: 3.022 ± 1.293
4.945AsnThr: 4.945 ± 1.192
1.923AsnVal: 1.923 ± 1.391
1.374AsnTrp: 1.374 ± 0.443
2.198AsnTyr: 2.198 ± 0.833
0.0AsnXaa: 0.0 ± 0.0
Pro
3.846ProAla: 3.846 ± 1.059
0.824ProCys: 0.824 ± 0.63
1.923ProAsp: 1.923 ± 0.49
4.121ProGlu: 4.121 ± 1.044
1.374ProPhe: 1.374 ± 0.7
4.945ProGly: 4.945 ± 1.133
0.824ProHis: 0.824 ± 0.474
4.396ProIle: 4.396 ± 1.497
3.022ProLys: 3.022 ± 1.07
4.396ProLeu: 4.396 ± 0.857
1.374ProMet: 1.374 ± 0.943
1.374ProAsn: 1.374 ± 0.91
2.747ProPro: 2.747 ± 1.071
3.571ProGln: 3.571 ± 1.332
3.297ProArg: 3.297 ± 0.871
2.473ProSer: 2.473 ± 0.826
1.923ProThr: 1.923 ± 0.485
5.22ProVal: 5.22 ± 1.087
0.824ProTrp: 0.824 ± 0.712
0.549ProTyr: 0.549 ± 0.383
0.0ProXaa: 0.0 ± 0.0
Gln
6.319GlnAla: 6.319 ± 1.057
0.275GlnCys: 0.275 ± 0.243
3.571GlnAsp: 3.571 ± 0.754
3.022GlnGlu: 3.022 ± 0.654
0.0GlnPhe: 0.0 ± 0.0
5.22GlnGly: 5.22 ± 0.867
1.099GlnHis: 1.099 ± 0.893
4.67GlnIle: 4.67 ± 1.229
4.396GlnLys: 4.396 ± 1.382
5.22GlnLeu: 5.22 ± 1.298
3.846GlnMet: 3.846 ± 1.608
3.571GlnAsn: 3.571 ± 1.179
1.648GlnPro: 1.648 ± 1.051
3.297GlnGln: 3.297 ± 0.93
3.571GlnArg: 3.571 ± 1.284
1.923GlnSer: 1.923 ± 0.637
1.648GlnThr: 1.648 ± 0.62
3.846GlnVal: 3.846 ± 1.497
0.549GlnTrp: 0.549 ± 0.383
1.923GlnTyr: 1.923 ± 0.618
0.0GlnXaa: 0.0 ± 0.0
Arg
5.495ArgAla: 5.495 ± 0.909
0.275ArgCys: 0.275 ± 0.396
3.571ArgAsp: 3.571 ± 0.928
6.319ArgGlu: 6.319 ± 1.365
1.374ArgPhe: 1.374 ± 0.604
3.297ArgGly: 3.297 ± 0.92
1.099ArgHis: 1.099 ± 1.156
3.022ArgIle: 3.022 ± 1.655
4.67ArgLys: 4.67 ± 1.679
4.121ArgLeu: 4.121 ± 1.11
1.374ArgMet: 1.374 ± 0.899
1.923ArgAsn: 1.923 ± 0.634
3.571ArgPro: 3.571 ± 1.1
4.121ArgGln: 4.121 ± 1.702
3.571ArgArg: 3.571 ± 2.014
2.473ArgSer: 2.473 ± 1.33
4.121ArgThr: 4.121 ± 1.162
3.846ArgVal: 3.846 ± 0.953
1.923ArgTrp: 1.923 ± 0.782
1.099ArgTyr: 1.099 ± 0.377
0.0ArgXaa: 0.0 ± 0.0
Ser
2.198SerAla: 2.198 ± 0.474
0.549SerCys: 0.549 ± 0.42
2.473SerAsp: 2.473 ± 0.641
3.297SerGlu: 3.297 ± 0.965
2.473SerPhe: 2.473 ± 1.589
3.297SerGly: 3.297 ± 1.046
1.374SerHis: 1.374 ± 0.5
3.297SerIle: 3.297 ± 0.584
2.198SerLys: 2.198 ± 1.732
5.495SerLeu: 5.495 ± 1.628
1.099SerMet: 1.099 ± 0.413
2.473SerAsn: 2.473 ± 1.34
3.846SerPro: 3.846 ± 1.333
4.121SerGln: 4.121 ± 1.328
4.396SerArg: 4.396 ± 1.479
2.198SerSer: 2.198 ± 0.598
4.121SerThr: 4.121 ± 0.952
2.198SerVal: 2.198 ± 0.462
1.099SerTrp: 1.099 ± 0.476
0.824SerTyr: 0.824 ± 0.712
0.0SerXaa: 0.0 ± 0.0
Thr
3.846ThrAla: 3.846 ± 1.001
0.275ThrCys: 0.275 ± 0.243
2.747ThrAsp: 2.747 ± 0.944
6.044ThrGlu: 6.044 ± 1.297
1.099ThrPhe: 1.099 ± 0.807
3.846ThrGly: 3.846 ± 0.745
1.923ThrHis: 1.923 ± 0.827
3.022ThrIle: 3.022 ± 0.869
3.297ThrLys: 3.297 ± 0.912
6.044ThrLeu: 6.044 ± 1.507
0.549ThrMet: 0.549 ± 0.402
3.571ThrAsn: 3.571 ± 0.864
4.121ThrPro: 4.121 ± 0.913
3.846ThrGln: 3.846 ± 0.936
1.648ThrArg: 1.648 ± 1.077
3.571ThrSer: 3.571 ± 1.293
2.473ThrThr: 2.473 ± 0.663
4.121ThrVal: 4.121 ± 1.461
1.923ThrTrp: 1.923 ± 0.584
1.374ThrTyr: 1.374 ± 0.784
0.0ThrXaa: 0.0 ± 0.0
Val
3.846ValAla: 3.846 ± 1.118
0.275ValCys: 0.275 ± 0.44
2.198ValAsp: 2.198 ± 0.888
3.022ValGlu: 3.022 ± 1.034
0.275ValPhe: 0.275 ± 0.192
4.67ValGly: 4.67 ± 0.647
3.022ValHis: 3.022 ± 1.049
3.297ValIle: 3.297 ± 0.87
5.495ValLys: 5.495 ± 1.566
4.121ValLeu: 4.121 ± 0.983
0.549ValMet: 0.549 ± 0.402
2.747ValAsn: 2.747 ± 0.907
3.022ValPro: 3.022 ± 1.073
3.571ValGln: 3.571 ± 1.076
4.67ValArg: 4.67 ± 1.294
3.846ValSer: 3.846 ± 0.67
3.571ValThr: 3.571 ± 0.899
4.121ValVal: 4.121 ± 1.213
2.473ValTrp: 2.473 ± 1.077
1.648ValTyr: 1.648 ± 0.607
0.0ValXaa: 0.0 ± 0.0
Trp
1.648TrpAla: 1.648 ± 0.478
0.275TrpCys: 0.275 ± 0.396
1.648TrpAsp: 1.648 ± 0.877
1.923TrpGlu: 1.923 ± 0.598
0.549TrpPhe: 0.549 ± 0.402
1.923TrpGly: 1.923 ± 0.795
0.275TrpHis: 0.275 ± 0.396
0.824TrpIle: 0.824 ± 0.385
2.473TrpLys: 2.473 ± 0.598
0.824TrpLeu: 0.824 ± 0.706
1.648TrpMet: 1.648 ± 0.566
1.648TrpAsn: 1.648 ± 0.955
0.824TrpPro: 0.824 ± 0.385
1.923TrpGln: 1.923 ± 0.783
1.923TrpArg: 1.923 ± 0.67
0.824TrpSer: 0.824 ± 0.795
1.923TrpThr: 1.923 ± 0.709
1.648TrpVal: 1.648 ± 0.508
0.549TrpTrp: 0.549 ± 0.383
0.549TrpTyr: 0.549 ± 0.238
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.099TyrAla: 1.099 ± 0.476
1.648TyrCys: 1.648 ± 0.688
0.275TyrAsp: 0.275 ± 0.192
0.824TyrGlu: 0.824 ± 0.357
1.099TyrPhe: 1.099 ± 0.566
1.374TyrGly: 1.374 ± 0.83
1.099TyrHis: 1.099 ± 0.859
1.374TyrIle: 1.374 ± 0.921
1.923TyrLys: 1.923 ± 0.936
1.374TyrLeu: 1.374 ± 0.432
0.275TyrMet: 0.275 ± 0.192
1.099TyrAsn: 1.099 ± 0.594
1.374TyrPro: 1.374 ± 0.316
1.648TyrGln: 1.648 ± 0.918
2.198TyrArg: 2.198 ± 0.712
1.923TyrSer: 1.923 ± 0.397
1.374TyrThr: 1.374 ± 0.684
1.374TyrVal: 1.374 ± 0.547
0.824TyrTrp: 0.824 ± 0.339
1.099TyrTyr: 1.099 ± 0.521
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3641 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski