Amino acid dipepetide frequency for Human immunodeficiency virus type 1 group M subtype A (isolate MAL) (HIV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.112AlaAla: 7.112 ± 2.643
1.368AlaCys: 1.368 ± 0.346
1.368AlaAsp: 1.368 ± 0.579
5.47AlaGlu: 5.47 ± 1.101
1.368AlaPhe: 1.368 ± 0.563
4.376AlaGly: 4.376 ± 0.775
1.094AlaHis: 1.094 ± 0.64
5.744AlaIle: 5.744 ± 1.278
1.915AlaLys: 1.915 ± 0.582
5.744AlaLeu: 5.744 ± 0.708
2.188AlaMet: 2.188 ± 0.526
2.462AlaAsn: 2.462 ± 1.028
2.462AlaPro: 2.462 ± 0.941
2.188AlaGln: 2.188 ± 0.738
4.376AlaArg: 4.376 ± 0.777
3.829AlaSer: 3.829 ± 0.823
4.923AlaThr: 4.923 ± 1.378
3.829AlaVal: 3.829 ± 0.921
1.368AlaTrp: 1.368 ± 0.644
0.821AlaTyr: 0.821 ± 0.34
0.0AlaXaa: 0.0 ± 0.0
Cys
0.547CysAla: 0.547 ± 0.499
0.547CysCys: 0.547 ± 0.677
0.547CysAsp: 0.547 ± 0.34
0.274CysGlu: 0.274 ± 0.336
1.641CysPhe: 1.641 ± 0.956
1.915CysGly: 1.915 ± 0.756
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.915CysLys: 1.915 ± 0.537
0.821CysLeu: 0.821 ± 0.53
0.274CysMet: 0.274 ± 0.312
1.641CysAsn: 1.641 ± 1.048
0.274CysPro: 0.274 ± 0.249
2.188CysGln: 2.188 ± 0.938
1.094CysArg: 1.094 ± 0.392
1.094CysSer: 1.094 ± 0.675
2.735CysThr: 2.735 ± 1.19
1.641CysVal: 1.641 ± 0.543
0.821CysTrp: 0.821 ± 0.378
1.094CysTyr: 1.094 ± 1.353
0.0CysXaa: 0.0 ± 0.0
Asp
2.188AspAla: 2.188 ± 0.922
2.735AspCys: 2.735 ± 0.838
1.915AspAsp: 1.915 ± 0.635
0.821AspGlu: 0.821 ± 0.461
0.821AspPhe: 0.821 ± 0.576
1.368AspGly: 1.368 ± 0.429
0.274AspHis: 0.274 ± 0.438
3.556AspIle: 3.556 ± 0.899
4.65AspLys: 4.65 ± 1.206
4.103AspLeu: 4.103 ± 1.0
1.094AspMet: 1.094 ± 0.485
2.188AspAsn: 2.188 ± 1.087
3.282AspPro: 3.282 ± 2.035
1.641AspGln: 1.641 ± 0.592
3.009AspArg: 3.009 ± 1.093
1.915AspSer: 1.915 ± 0.53
2.188AspThr: 2.188 ± 0.761
1.368AspVal: 1.368 ± 0.509
1.094AspTrp: 1.094 ± 0.587
1.094AspTyr: 1.094 ± 0.393
0.0AspXaa: 0.0 ± 0.0
Glu
4.923GluAla: 4.923 ± 1.049
0.274GluCys: 0.274 ± 0.249
2.462GluAsp: 2.462 ± 0.932
8.206GluGlu: 8.206 ± 2.11
1.094GluPhe: 1.094 ± 0.503
4.923GluGly: 4.923 ± 0.748
0.821GluHis: 0.821 ± 0.576
6.565GluIle: 6.565 ± 1.179
3.556GluLys: 3.556 ± 0.457
6.018GluLeu: 6.018 ± 1.213
1.094GluMet: 1.094 ± 0.64
1.915GluAsn: 1.915 ± 0.423
4.103GluPro: 4.103 ± 1.037
5.197GluGln: 5.197 ± 1.055
3.829GluArg: 3.829 ± 0.845
3.009GluSer: 3.009 ± 0.754
5.47GluThr: 5.47 ± 1.446
3.556GluVal: 3.556 ± 0.923
1.094GluTrp: 1.094 ± 0.576
0.821GluTyr: 0.821 ± 0.278
0.0GluXaa: 0.0 ± 0.0
Phe
1.368PheAla: 1.368 ± 0.346
0.274PheCys: 0.274 ± 0.249
1.094PheAsp: 1.094 ± 0.819
0.274PheGlu: 0.274 ± 0.249
0.821PhePhe: 0.821 ± 0.237
1.915PheGly: 1.915 ± 0.923
0.0PheHis: 0.0 ± 0.0
2.188PheIle: 2.188 ± 1.017
1.641PheLys: 1.641 ± 0.724
2.188PheLeu: 2.188 ± 0.492
0.0PheMet: 0.0 ± 0.0
2.462PheAsn: 2.462 ± 1.251
2.188PhePro: 2.188 ± 1.071
0.274PheGln: 0.274 ± 0.192
2.188PheArg: 2.188 ± 1.056
2.735PheSer: 2.735 ± 0.412
0.821PheThr: 0.821 ± 0.576
0.547PheVal: 0.547 ± 0.228
0.274PheTrp: 0.274 ± 0.192
1.094PheTyr: 1.094 ± 0.502
0.0PheXaa: 0.0 ± 0.0
Gly
5.197GlyAla: 5.197 ± 1.078
1.915GlyCys: 1.915 ± 0.521
3.556GlyAsp: 3.556 ± 0.765
3.282GlyGlu: 3.282 ± 0.428
2.462GlyPhe: 2.462 ± 0.949
6.565GlyGly: 6.565 ± 1.285
3.282GlyHis: 3.282 ± 2.003
6.291GlyIle: 6.291 ± 1.648
6.018GlyLys: 6.018 ± 2.078
3.829GlyLeu: 3.829 ± 0.973
1.368GlyMet: 1.368 ± 0.632
2.735GlyAsn: 2.735 ± 0.993
4.65GlyPro: 4.65 ± 1.043
4.923GlyGln: 4.923 ± 1.412
3.282GlyArg: 3.282 ± 1.006
5.744GlySer: 5.744 ± 1.205
2.735GlyThr: 2.735 ± 1.412
3.009GlyVal: 3.009 ± 0.575
1.641GlyTrp: 1.641 ± 0.624
1.641GlyTyr: 1.641 ± 0.59
0.0GlyXaa: 0.0 ± 0.0
His
0.821HisAla: 0.821 ± 0.237
0.547HisCys: 0.547 ± 0.677
0.274HisAsp: 0.274 ± 0.414
0.547HisGlu: 0.547 ± 0.228
1.094HisPhe: 1.094 ± 0.929
1.368HisGly: 1.368 ± 0.394
0.547HisHis: 0.547 ± 0.75
1.368HisIle: 1.368 ± 1.027
1.094HisLys: 1.094 ± 0.577
3.009HisLeu: 3.009 ± 0.762
0.547HisMet: 0.547 ± 0.667
1.094HisAsn: 1.094 ± 0.576
2.735HisPro: 2.735 ± 1.322
3.009HisGln: 3.009 ± 1.186
1.368HisArg: 1.368 ± 0.685
1.368HisSer: 1.368 ± 0.963
1.094HisThr: 1.094 ± 0.486
0.274HisVal: 0.274 ± 0.192
0.0HisTrp: 0.0 ± 0.0
0.547HisTyr: 0.547 ± 0.44
0.0HisXaa: 0.0 ± 0.0
Ile
3.009IleAla: 3.009 ± 0.644
1.094IleCys: 1.094 ± 0.457
1.915IleAsp: 1.915 ± 0.653
4.65IleGlu: 4.65 ± 0.815
1.094IlePhe: 1.094 ± 0.675
5.197IleGly: 5.197 ± 1.581
2.188IleHis: 2.188 ± 0.429
5.197IleIle: 5.197 ± 1.298
4.376IleLys: 4.376 ± 1.132
5.744IleLeu: 5.744 ± 1.018
1.368IleMet: 1.368 ± 0.466
2.735IleAsn: 2.735 ± 0.964
3.829IlePro: 3.829 ± 0.932
3.556IleGln: 3.556 ± 1.581
6.018IleArg: 6.018 ± 1.834
3.282IleSer: 3.282 ± 1.08
2.735IleThr: 2.735 ± 1.258
7.385IleVal: 7.385 ± 1.213
2.188IleTrp: 2.188 ± 0.526
1.915IleTyr: 1.915 ± 0.581
0.0IleXaa: 0.0 ± 0.0
Lys
4.65LysAla: 4.65 ± 1.429
3.009LysCys: 3.009 ± 1.269
3.556LysAsp: 3.556 ± 1.316
6.291LysGlu: 6.291 ± 1.709
0.821LysPhe: 0.821 ± 0.278
4.65LysGly: 4.65 ± 1.419
1.368LysHis: 1.368 ± 0.267
6.018LysIle: 6.018 ± 1.781
6.018LysLys: 6.018 ± 1.67
4.65LysLeu: 4.65 ± 1.084
0.547LysMet: 0.547 ± 0.228
3.556LysAsn: 3.556 ± 1.002
1.641LysPro: 1.641 ± 0.335
3.009LysGln: 3.009 ± 0.982
3.556LysArg: 3.556 ± 0.588
2.735LysSer: 2.735 ± 0.882
4.65LysThr: 4.65 ± 1.015
3.556LysVal: 3.556 ± 0.994
2.188LysTrp: 2.188 ± 0.613
1.915LysTyr: 1.915 ± 0.522
0.0LysXaa: 0.0 ± 0.0
Leu
4.103LeuAla: 4.103 ± 0.824
0.821LeuCys: 0.821 ± 0.438
4.923LeuAsp: 4.923 ± 0.983
5.197LeuGlu: 5.197 ± 1.293
1.641LeuPhe: 1.641 ± 0.648
6.291LeuGly: 6.291 ± 1.915
1.094LeuHis: 1.094 ± 0.47
3.829LeuIle: 3.829 ± 1.62
7.659LeuLys: 7.659 ± 1.642
7.659LeuLeu: 7.659 ± 3.13
0.821LeuMet: 0.821 ± 0.558
5.197LeuAsn: 5.197 ± 1.139
3.009LeuPro: 3.009 ± 0.799
4.923LeuGln: 4.923 ± 0.971
4.376LeuArg: 4.376 ± 0.706
4.103LeuSer: 4.103 ± 0.827
4.65LeuThr: 4.65 ± 1.104
5.197LeuVal: 5.197 ± 1.407
2.735LeuTrp: 2.735 ± 0.963
2.735LeuTyr: 2.735 ± 0.682
0.0LeuXaa: 0.0 ± 0.0
Met
1.094MetAla: 1.094 ± 0.607
0.547MetCys: 0.547 ± 0.677
1.094MetAsp: 1.094 ± 0.64
1.915MetGlu: 1.915 ± 0.574
0.821MetPhe: 0.821 ± 0.237
1.641MetGly: 1.641 ± 0.476
0.547MetHis: 0.547 ± 0.228
1.641MetIle: 1.641 ± 0.31
0.547MetLys: 0.547 ± 0.288
1.915MetLeu: 1.915 ± 0.462
1.915MetMet: 1.915 ± 0.737
0.547MetAsn: 0.547 ± 0.34
0.0MetPro: 0.0 ± 0.0
1.368MetGln: 1.368 ± 0.466
1.368MetArg: 1.368 ± 0.456
0.821MetSer: 0.821 ± 0.467
2.735MetThr: 2.735 ± 0.724
0.274MetVal: 0.274 ± 0.249
0.821MetTrp: 0.821 ± 0.748
1.094MetTyr: 1.094 ± 0.347
0.0MetXaa: 0.0 ± 0.0
Asn
2.462AsnAla: 2.462 ± 0.748
3.282AsnCys: 3.282 ± 1.203
1.094AsnAsp: 1.094 ± 0.263
3.282AsnGlu: 3.282 ± 0.723
2.735AsnPhe: 2.735 ± 1.27
2.462AsnGly: 2.462 ± 1.441
0.547AsnHis: 0.547 ± 0.677
2.462AsnIle: 2.462 ± 0.748
3.009AsnLys: 3.009 ± 0.856
3.282AsnLeu: 3.282 ± 1.259
1.915AsnMet: 1.915 ± 0.77
2.188AsnAsn: 2.188 ± 1.144
3.009AsnPro: 3.009 ± 0.959
1.094AsnGln: 1.094 ± 0.64
2.188AsnArg: 2.188 ± 0.66
4.103AsnSer: 4.103 ± 1.533
2.735AsnThr: 2.735 ± 0.964
1.094AsnVal: 1.094 ± 0.675
2.188AsnTrp: 2.188 ± 0.457
1.368AsnTyr: 1.368 ± 0.455
0.0AsnXaa: 0.0 ± 0.0
Pro
2.735ProAla: 2.735 ± 0.686
1.094ProCys: 1.094 ± 0.803
2.462ProAsp: 2.462 ± 0.569
3.829ProGlu: 3.829 ± 1.056
1.094ProPhe: 1.094 ± 0.525
5.744ProGly: 5.744 ± 1.194
0.547ProHis: 0.547 ± 0.45
4.65ProIle: 4.65 ± 1.238
2.735ProLys: 2.735 ± 1.05
4.65ProLeu: 4.65 ± 1.424
1.094ProMet: 1.094 ± 0.761
1.094ProAsn: 1.094 ± 0.823
4.923ProPro: 4.923 ± 1.344
3.556ProGln: 3.556 ± 0.953
3.829ProArg: 3.829 ± 0.803
2.462ProSer: 2.462 ± 0.95
2.188ProThr: 2.188 ± 0.614
5.197ProVal: 5.197 ± 0.806
1.094ProTrp: 1.094 ± 0.956
0.547ProTyr: 0.547 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
7.659GlnAla: 7.659 ± 1.283
0.274GlnCys: 0.274 ± 0.249
3.009GlnAsp: 3.009 ± 0.796
4.376GlnGlu: 4.376 ± 0.64
0.0GlnPhe: 0.0 ± 0.0
5.47GlnGly: 5.47 ± 0.846
1.915GlnHis: 1.915 ± 1.084
4.376GlnIle: 4.376 ± 1.217
4.65GlnLys: 4.65 ± 1.606
5.744GlnLeu: 5.744 ± 0.907
3.829GlnMet: 3.829 ± 1.363
3.556GlnAsn: 3.556 ± 0.916
1.641GlnPro: 1.641 ± 1.177
3.556GlnGln: 3.556 ± 0.905
3.829GlnArg: 3.829 ± 1.317
1.915GlnSer: 1.915 ± 0.554
1.368GlnThr: 1.368 ± 0.643
1.915GlnVal: 1.915 ± 0.681
0.821GlnTrp: 0.821 ± 0.34
1.915GlnTyr: 1.915 ± 0.756
0.0GlnXaa: 0.0 ± 0.0
Arg
4.923ArgAla: 4.923 ± 0.841
0.274ArgCys: 0.274 ± 0.375
2.188ArgAsp: 2.188 ± 0.595
5.197ArgGlu: 5.197 ± 1.277
1.641ArgPhe: 1.641 ± 0.577
3.556ArgGly: 3.556 ± 0.804
1.094ArgHis: 1.094 ± 1.063
5.197ArgIle: 5.197 ± 1.488
3.282ArgLys: 3.282 ± 1.31
3.829ArgLeu: 3.829 ± 1.083
0.821ArgMet: 0.821 ± 0.395
1.915ArgAsn: 1.915 ± 0.523
3.829ArgPro: 3.829 ± 1.306
6.291ArgGln: 6.291 ± 1.503
5.197ArgArg: 5.197 ± 3.051
1.641ArgSer: 1.641 ± 1.157
3.009ArgThr: 3.009 ± 0.902
3.556ArgVal: 3.556 ± 0.911
1.915ArgTrp: 1.915 ± 0.686
0.821ArgTyr: 0.821 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
2.188SerAla: 2.188 ± 0.377
0.547SerCys: 0.547 ± 0.321
3.009SerAsp: 3.009 ± 0.935
3.829SerGlu: 3.829 ± 0.745
1.368SerPhe: 1.368 ± 0.346
3.282SerGly: 3.282 ± 1.014
1.368SerHis: 1.368 ± 0.805
3.009SerIle: 3.009 ± 0.643
1.641SerLys: 1.641 ± 0.779
5.47SerLeu: 5.47 ± 2.222
0.821SerMet: 0.821 ± 0.481
2.735SerAsn: 2.735 ± 0.964
4.65SerPro: 4.65 ± 1.302
4.376SerGln: 4.376 ± 1.662
2.462SerArg: 2.462 ± 1.121
4.923SerSer: 4.923 ± 1.341
4.103SerThr: 4.103 ± 1.026
2.462SerVal: 2.462 ± 0.532
0.821SerTrp: 0.821 ± 0.34
1.368SerTyr: 1.368 ± 0.962
0.0SerXaa: 0.0 ± 0.0
Thr
3.282ThrAla: 3.282 ± 0.673
0.274ThrCys: 0.274 ± 0.249
2.188ThrAsp: 2.188 ± 0.882
5.744ThrGlu: 5.744 ± 0.682
1.368ThrPhe: 1.368 ± 0.632
4.376ThrGly: 4.376 ± 0.615
1.094ThrHis: 1.094 ± 0.675
2.462ThrIle: 2.462 ± 0.392
4.65ThrLys: 4.65 ± 0.827
7.385ThrLeu: 7.385 ± 1.213
0.547ThrMet: 0.547 ± 0.44
1.915ThrAsn: 1.915 ± 0.536
4.103ThrPro: 4.103 ± 0.687
3.009ThrGln: 3.009 ± 0.448
2.188ThrArg: 2.188 ± 1.066
2.462ThrSer: 2.462 ± 0.543
3.009ThrThr: 3.009 ± 1.193
3.556ThrVal: 3.556 ± 1.118
1.641ThrTrp: 1.641 ± 0.599
1.368ThrTyr: 1.368 ± 0.792
0.0ThrXaa: 0.0 ± 0.0
Val
3.009ValAla: 3.009 ± 0.673
0.0ValCys: 0.0 ± 0.0
2.462ValAsp: 2.462 ± 0.972
2.462ValGlu: 2.462 ± 1.054
0.821ValPhe: 0.821 ± 0.445
5.47ValGly: 5.47 ± 0.978
3.009ValHis: 3.009 ± 0.827
3.282ValIle: 3.282 ± 0.552
4.65ValLys: 4.65 ± 1.133
3.009ValLeu: 3.009 ± 0.713
0.274ValMet: 0.274 ± 0.375
2.735ValAsn: 2.735 ± 1.094
3.009ValPro: 3.009 ± 0.667
3.556ValGln: 3.556 ± 0.911
3.282ValArg: 3.282 ± 0.671
4.103ValSer: 4.103 ± 1.351
2.735ValThr: 2.735 ± 1.02
3.556ValVal: 3.556 ± 1.027
2.188ValTrp: 2.188 ± 0.649
1.368ValTyr: 1.368 ± 0.429
0.0ValXaa: 0.0 ± 0.0
Trp
1.641TrpAla: 1.641 ± 0.399
0.274TrpCys: 0.274 ± 0.336
1.641TrpAsp: 1.641 ± 0.65
2.188TrpGlu: 2.188 ± 0.515
0.547TrpPhe: 0.547 ± 0.44
2.188TrpGly: 2.188 ± 0.826
0.547TrpHis: 0.547 ± 0.75
1.094TrpIle: 1.094 ± 0.263
2.462TrpLys: 2.462 ± 0.569
0.547TrpLeu: 0.547 ± 0.475
1.641TrpMet: 1.641 ± 0.502
1.915TrpAsn: 1.915 ± 1.26
1.094TrpPro: 1.094 ± 0.48
2.188TrpGln: 2.188 ± 0.671
1.368TrpArg: 1.368 ± 0.453
0.821TrpSer: 0.821 ± 0.658
1.641TrpThr: 1.641 ± 0.82
1.368TrpVal: 1.368 ± 0.341
0.821TrpTrp: 0.821 ± 0.34
0.547TrpTyr: 0.547 ± 0.228
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.094TyrAla: 1.094 ± 0.457
1.915TyrCys: 1.915 ± 0.801
0.547TyrAsp: 0.547 ± 0.384
1.094TyrGlu: 1.094 ± 0.576
1.094TyrPhe: 1.094 ± 0.486
1.368TyrGly: 1.368 ± 0.877
1.368TyrHis: 1.368 ± 0.68
0.547TyrIle: 0.547 ± 0.228
1.915TyrLys: 1.915 ± 0.599
1.368TyrLeu: 1.368 ± 0.596
0.274TyrMet: 0.274 ± 0.192
1.915TyrAsn: 1.915 ± 0.522
1.094TyrPro: 1.094 ± 0.577
2.188TyrGln: 2.188 ± 0.753
1.368TyrArg: 1.368 ± 0.267
1.368TyrSer: 1.368 ± 0.267
1.094TyrThr: 1.094 ± 0.42
1.641TyrVal: 1.641 ± 0.609
0.821TyrTrp: 0.821 ± 0.349
1.368TyrTyr: 1.368 ± 0.394
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3657 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski