Amino acid dipepetide frequency for Human immunodeficiency virus type 1 group M subtype B (isolate YU-2) (HIV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.205AlaAla: 5.205 ± 1.471
1.918AlaCys: 1.918 ± 0.521
1.918AlaAsp: 1.918 ± 0.781
5.479AlaGlu: 5.479 ± 1.173
1.644AlaPhe: 1.644 ± 0.387
6.301AlaGly: 6.301 ± 1.43
0.822AlaHis: 0.822 ± 0.325
4.658AlaIle: 4.658 ± 1.202
2.466AlaLys: 2.466 ± 0.48
5.479AlaLeu: 5.479 ± 0.988
2.192AlaMet: 2.192 ± 0.61
1.918AlaAsn: 1.918 ± 0.787
2.74AlaPro: 2.74 ± 0.995
1.918AlaGln: 1.918 ± 0.345
3.288AlaArg: 3.288 ± 0.557
4.384AlaSer: 4.384 ± 0.987
5.205AlaThr: 5.205 ± 0.895
3.836AlaVal: 3.836 ± 0.79
1.37AlaTrp: 1.37 ± 0.548
0.822AlaTyr: 0.822 ± 0.325
0.0AlaXaa: 0.0 ± 0.0
Cys
0.822CysAla: 0.822 ± 0.542
0.548CysCys: 0.548 ± 0.62
0.274CysAsp: 0.274 ± 0.185
0.274CysGlu: 0.274 ± 0.42
2.192CysPhe: 2.192 ± 1.387
1.918CysGly: 1.918 ± 0.546
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.37CysLys: 1.37 ± 0.628
0.548CysLeu: 0.548 ± 0.37
0.0CysMet: 0.0 ± 0.0
1.918CysAsn: 1.918 ± 1.285
0.274CysPro: 0.274 ± 0.244
1.644CysGln: 1.644 ± 0.803
1.37CysArg: 1.37 ± 0.512
1.37CysSer: 1.37 ± 0.897
2.74CysThr: 2.74 ± 0.9
1.644CysVal: 1.644 ± 0.519
0.822CysTrp: 0.822 ± 0.4
0.548CysTyr: 0.548 ± 0.62
0.0CysXaa: 0.0 ± 0.0
Asp
0.822AspAla: 0.822 ± 0.399
3.014AspCys: 3.014 ± 1.052
1.37AspAsp: 1.37 ± 0.574
0.548AspGlu: 0.548 ± 0.46
1.096AspPhe: 1.096 ± 0.74
2.192AspGly: 2.192 ± 0.629
0.0AspHis: 0.0 ± 0.0
3.288AspIle: 3.288 ± 0.65
3.014AspLys: 3.014 ± 0.752
4.384AspLeu: 4.384 ± 0.91
0.548AspMet: 0.548 ± 0.244
1.918AspAsn: 1.918 ± 0.846
3.562AspPro: 3.562 ± 1.214
1.918AspGln: 1.918 ± 0.73
3.562AspArg: 3.562 ± 0.864
3.014AspSer: 3.014 ± 1.376
3.014AspThr: 3.014 ± 0.602
1.096AspVal: 1.096 ± 0.406
0.548AspTrp: 0.548 ± 0.574
0.822AspTyr: 0.822 ± 0.4
0.0AspXaa: 0.0 ± 0.0
Glu
4.932GluAla: 4.932 ± 1.133
0.0GluCys: 0.0 ± 0.0
2.466GluAsp: 2.466 ± 1.137
7.945GluGlu: 7.945 ± 2.02
0.822GluPhe: 0.822 ± 0.325
4.384GluGly: 4.384 ± 1.039
0.548GluHis: 0.548 ± 0.37
4.658GluIle: 4.658 ± 1.729
4.658GluLys: 4.658 ± 1.087
6.575GluLeu: 6.575 ± 1.254
1.37GluMet: 1.37 ± 0.546
2.192GluAsn: 2.192 ± 0.871
6.027GluPro: 6.027 ± 1.656
4.384GluGln: 4.384 ± 0.977
5.479GluArg: 5.479 ± 1.702
2.74GluSer: 2.74 ± 0.968
5.205GluThr: 5.205 ± 1.74
4.11GluVal: 4.11 ± 0.678
1.644GluTrp: 1.644 ± 0.63
1.37GluTyr: 1.37 ± 0.679
0.0GluXaa: 0.0 ± 0.0
Phe
1.096PheAla: 1.096 ± 0.278
0.274PheCys: 0.274 ± 0.244
0.822PheAsp: 0.822 ± 0.598
0.274PheGlu: 0.274 ± 0.244
0.274PhePhe: 0.274 ± 0.244
1.37PheGly: 1.37 ± 0.577
0.822PheHis: 0.822 ± 0.731
1.37PheIle: 1.37 ± 0.532
1.096PheLys: 1.096 ± 0.568
2.74PheLeu: 2.74 ± 0.486
0.0PheMet: 0.0 ± 0.0
2.192PheAsn: 2.192 ± 0.969
1.644PhePro: 1.644 ± 0.711
0.822PheGln: 0.822 ± 0.476
3.014PheArg: 3.014 ± 0.734
2.466PheSer: 2.466 ± 0.667
1.918PheThr: 1.918 ± 0.601
0.548PheVal: 0.548 ± 0.22
0.274PheTrp: 0.274 ± 0.185
1.644PheTyr: 1.644 ± 0.376
0.0PheXaa: 0.0 ± 0.0
Gly
4.384GlyAla: 4.384 ± 0.799
1.918GlyCys: 1.918 ± 0.604
2.74GlyAsp: 2.74 ± 1.089
3.836GlyGlu: 3.836 ± 0.579
1.644GlyPhe: 1.644 ± 0.613
6.301GlyGly: 6.301 ± 1.216
3.562GlyHis: 3.562 ± 1.585
5.205GlyIle: 5.205 ± 1.498
5.479GlyLys: 5.479 ± 1.284
4.932GlyLeu: 4.932 ± 1.095
0.822GlyMet: 0.822 ± 0.312
2.192GlyAsn: 2.192 ± 0.857
4.932GlyPro: 4.932 ± 1.008
4.384GlyGln: 4.384 ± 1.365
3.562GlyArg: 3.562 ± 1.065
4.11GlySer: 4.11 ± 0.922
4.11GlyThr: 4.11 ± 1.764
3.014GlyVal: 3.014 ± 0.674
2.192GlyTrp: 2.192 ± 0.811
1.918GlyTyr: 1.918 ± 0.622
0.0GlyXaa: 0.0 ± 0.0
His
0.822HisAla: 0.822 ± 0.249
0.822HisCys: 0.822 ± 0.637
0.0HisAsp: 0.0 ± 0.0
0.822HisGlu: 0.822 ± 0.325
0.822HisPhe: 0.822 ± 0.856
1.644HisGly: 1.644 ± 0.61
0.822HisHis: 0.822 ± 0.955
2.192HisIle: 2.192 ± 0.633
1.644HisLys: 1.644 ± 0.701
2.74HisLeu: 2.74 ± 1.029
0.548HisMet: 0.548 ± 0.819
1.096HisAsn: 1.096 ± 0.545
2.466HisPro: 2.466 ± 0.965
3.014HisGln: 3.014 ± 1.315
0.822HisArg: 0.822 ± 0.399
1.096HisSer: 1.096 ± 0.44
1.096HisThr: 1.096 ± 0.547
0.548HisVal: 0.548 ± 0.325
0.0HisTrp: 0.0 ± 0.0
0.548HisTyr: 0.548 ± 0.475
0.0HisXaa: 0.0 ± 0.0
Ile
3.288IleAla: 3.288 ± 0.863
1.096IleCys: 1.096 ± 0.439
1.37IleAsp: 1.37 ± 0.528
4.658IleGlu: 4.658 ± 0.621
1.096IlePhe: 1.096 ± 0.659
5.205IleGly: 5.205 ± 1.801
1.644IleHis: 1.644 ± 0.551
5.205IleIle: 5.205 ± 1.318
3.836IleLys: 3.836 ± 1.287
4.932IleLeu: 4.932 ± 0.913
0.822IleMet: 0.822 ± 0.249
1.918IleAsn: 1.918 ± 0.596
3.836IlePro: 3.836 ± 1.131
2.466IleGln: 2.466 ± 1.094
5.205IleArg: 5.205 ± 1.412
3.014IleSer: 3.014 ± 0.985
3.288IleThr: 3.288 ± 1.537
6.575IleVal: 6.575 ± 1.62
1.644IleTrp: 1.644 ± 0.519
2.192IleTyr: 2.192 ± 0.673
0.0IleXaa: 0.0 ± 0.0
Lys
6.027LysAla: 6.027 ± 1.365
2.192LysCys: 2.192 ± 0.651
2.192LysAsp: 2.192 ± 0.77
7.123LysGlu: 7.123 ± 2.039
0.822LysPhe: 0.822 ± 0.325
4.384LysGly: 4.384 ± 0.792
1.644LysHis: 1.644 ± 0.707
6.301LysIle: 6.301 ± 1.863
7.671LysLys: 7.671 ± 2.523
5.205LysLeu: 5.205 ± 1.516
0.274LysMet: 0.274 ± 0.185
3.014LysAsn: 3.014 ± 0.822
1.37LysPro: 1.37 ± 0.755
3.836LysGln: 3.836 ± 0.983
3.014LysArg: 3.014 ± 0.74
1.644LysSer: 1.644 ± 0.376
4.11LysThr: 4.11 ± 0.798
4.384LysVal: 4.384 ± 0.918
2.192LysTrp: 2.192 ± 0.455
1.37LysTyr: 1.37 ± 0.523
0.0LysXaa: 0.0 ± 0.0
Leu
4.658LeuAla: 4.658 ± 0.958
0.822LeuCys: 0.822 ± 0.426
3.562LeuAsp: 3.562 ± 0.858
7.397LeuGlu: 7.397 ± 1.44
2.192LeuPhe: 2.192 ± 1.118
6.575LeuGly: 6.575 ± 1.369
2.466LeuHis: 2.466 ± 0.675
3.288LeuIle: 3.288 ± 1.445
6.027LeuLys: 6.027 ± 1.262
8.767LeuLeu: 8.767 ± 3.348
0.822LeuMet: 0.822 ± 0.54
4.658LeuAsn: 4.658 ± 1.104
2.466LeuPro: 2.466 ± 0.712
5.479LeuGln: 5.479 ± 1.267
5.753LeuArg: 5.753 ± 0.767
3.836LeuSer: 3.836 ± 0.962
3.836LeuThr: 3.836 ± 1.003
5.205LeuVal: 5.205 ± 1.499
3.014LeuTrp: 3.014 ± 1.026
2.74LeuTyr: 2.74 ± 0.685
0.0LeuXaa: 0.0 ± 0.0
Met
1.37MetAla: 1.37 ± 0.66
0.0MetCys: 0.0 ± 0.0
1.096MetAsp: 1.096 ± 0.649
1.644MetGlu: 1.644 ± 0.891
0.548MetPhe: 0.548 ± 0.315
1.37MetGly: 1.37 ± 0.308
0.548MetHis: 0.548 ± 0.22
1.096MetIle: 1.096 ± 0.531
0.822MetLys: 0.822 ± 0.249
1.37MetLeu: 1.37 ± 0.512
1.096MetMet: 1.096 ± 0.629
0.822MetAsn: 0.822 ± 0.45
0.0MetPro: 0.0 ± 0.0
1.096MetGln: 1.096 ± 0.629
1.918MetArg: 1.918 ± 0.762
0.822MetSer: 0.822 ± 0.409
2.466MetThr: 2.466 ± 0.827
0.822MetVal: 0.822 ± 0.249
0.548MetTrp: 0.548 ± 0.488
1.096MetTyr: 1.096 ± 0.393
0.0MetXaa: 0.0 ± 0.0
Asn
2.74AsnAla: 2.74 ± 0.819
3.014AsnCys: 3.014 ± 0.807
1.096AsnAsp: 1.096 ± 0.439
2.74AsnGlu: 2.74 ± 0.73
3.014AsnPhe: 3.014 ± 0.907
1.644AsnGly: 1.644 ± 0.815
0.0AsnHis: 0.0 ± 0.0
1.644AsnIle: 1.644 ± 0.852
3.014AsnLys: 3.014 ± 0.693
3.288AsnLeu: 3.288 ± 0.664
1.37AsnMet: 1.37 ± 0.974
3.836AsnAsn: 3.836 ± 1.3
2.74AsnPro: 2.74 ± 0.572
1.096AsnGln: 1.096 ± 0.278
1.644AsnArg: 1.644 ± 0.696
2.466AsnSer: 2.466 ± 0.665
4.11AsnThr: 4.11 ± 0.929
1.096AsnVal: 1.096 ± 0.659
1.644AsnTrp: 1.644 ± 0.39
1.644AsnTyr: 1.644 ± 0.593
0.0AsnXaa: 0.0 ± 0.0
Pro
3.014ProAla: 3.014 ± 0.799
0.822ProCys: 0.822 ± 0.732
2.74ProAsp: 2.74 ± 0.508
3.014ProGlu: 3.014 ± 0.92
1.644ProPhe: 1.644 ± 0.681
4.932ProGly: 4.932 ± 1.322
0.822ProHis: 0.822 ± 0.54
4.932ProIle: 4.932 ± 1.329
1.918ProLys: 1.918 ± 0.489
4.384ProLeu: 4.384 ± 0.876
1.096ProMet: 1.096 ± 0.659
1.096ProAsn: 1.096 ± 0.738
3.562ProPro: 3.562 ± 2.023
3.562ProGln: 3.562 ± 0.942
3.288ProArg: 3.288 ± 1.268
3.014ProSer: 3.014 ± 1.354
3.014ProThr: 3.014 ± 1.209
5.479ProVal: 5.479 ± 1.219
1.37ProTrp: 1.37 ± 1.036
0.548ProTyr: 0.548 ± 0.37
0.0ProXaa: 0.0 ± 0.0
Gln
5.753GlnAla: 5.753 ± 0.953
0.274GlnCys: 0.274 ± 0.244
2.466GlnAsp: 2.466 ± 0.879
4.384GlnGlu: 4.384 ± 0.731
0.274GlnPhe: 0.274 ± 0.244
4.932GlnGly: 4.932 ± 0.847
1.918GlnHis: 1.918 ± 0.779
4.384GlnIle: 4.384 ± 1.116
3.562GlnLys: 3.562 ± 1.274
6.027GlnLeu: 6.027 ± 1.235
3.288GlnMet: 3.288 ± 1.359
3.562GlnAsn: 3.562 ± 1.044
1.918GlnPro: 1.918 ± 0.904
2.74GlnGln: 2.74 ± 1.286
4.11GlnArg: 4.11 ± 1.363
2.466GlnSer: 2.466 ± 0.902
1.918GlnThr: 1.918 ± 0.533
4.11GlnVal: 4.11 ± 1.467
0.822GlnTrp: 0.822 ± 0.325
2.192GlnTyr: 2.192 ± 0.636
0.0GlnXaa: 0.0 ± 0.0
Arg
5.479ArgAla: 5.479 ± 1.109
0.548ArgCys: 0.548 ± 0.475
4.384ArgAsp: 4.384 ± 1.148
4.932ArgGlu: 4.932 ± 1.107
1.644ArgPhe: 1.644 ± 0.938
3.562ArgGly: 3.562 ± 0.829
0.822ArgHis: 0.822 ± 0.712
3.014ArgIle: 3.014 ± 1.815
5.479ArgLys: 5.479 ± 1.062
3.288ArgLeu: 3.288 ± 1.051
1.37ArgMet: 1.37 ± 0.803
1.918ArgAsn: 1.918 ± 0.609
4.11ArgPro: 4.11 ± 1.337
6.575ArgGln: 6.575 ± 1.33
5.205ArgArg: 5.205 ± 3.386
3.288ArgSer: 3.288 ± 1.203
2.466ArgThr: 2.466 ± 0.9
2.192ArgVal: 2.192 ± 0.743
2.466ArgTrp: 2.466 ± 0.937
0.822ArgTyr: 0.822 ± 0.399
0.0ArgXaa: 0.0 ± 0.0
Ser
3.562SerAla: 3.562 ± 0.775
0.548SerCys: 0.548 ± 0.22
2.466SerAsp: 2.466 ± 0.616
4.384SerGlu: 4.384 ± 0.95
1.644SerPhe: 1.644 ± 0.852
3.288SerGly: 3.288 ± 2.097
0.548SerHis: 0.548 ± 0.451
2.74SerIle: 2.74 ± 0.709
2.466SerLys: 2.466 ± 0.835
5.753SerLeu: 5.753 ± 1.628
1.096SerMet: 1.096 ± 0.475
1.918SerAsn: 1.918 ± 0.995
4.11SerPro: 4.11 ± 1.093
6.027SerGln: 6.027 ± 2.089
2.466SerArg: 2.466 ± 1.167
3.836SerSer: 3.836 ± 0.895
2.192SerThr: 2.192 ± 0.743
2.192SerVal: 2.192 ± 0.445
0.822SerTrp: 0.822 ± 0.426
1.096SerTyr: 1.096 ± 0.738
0.0SerXaa: 0.0 ± 0.0
Thr
3.014ThrAla: 3.014 ± 0.699
0.0ThrCys: 0.0 ± 0.0
2.192ThrAsp: 2.192 ± 0.682
5.753ThrGlu: 5.753 ± 0.795
0.548ThrPhe: 0.548 ± 0.325
3.562ThrGly: 3.562 ± 0.513
2.466ThrHis: 2.466 ± 1.193
3.836ThrIle: 3.836 ± 1.053
4.11ThrLys: 4.11 ± 0.925
6.027ThrLeu: 6.027 ± 1.307
1.37ThrMet: 1.37 ± 0.476
3.288ThrAsn: 3.288 ± 0.6
3.288ThrPro: 3.288 ± 0.907
2.466ThrGln: 2.466 ± 0.977
2.74ThrArg: 2.74 ± 1.064
3.836ThrSer: 3.836 ± 0.698
4.384ThrThr: 4.384 ± 1.103
5.753ThrVal: 5.753 ± 1.081
1.644ThrTrp: 1.644 ± 0.524
0.822ThrTyr: 0.822 ± 0.54
0.0ThrXaa: 0.0 ± 0.0
Val
3.288ValAla: 3.288 ± 0.798
0.548ValCys: 0.548 ± 0.62
4.384ValAsp: 4.384 ± 0.877
3.288ValGlu: 3.288 ± 0.823
1.096ValPhe: 1.096 ± 0.406
4.658ValGly: 4.658 ± 0.626
3.014ValHis: 3.014 ± 0.844
3.562ValIle: 3.562 ± 0.834
4.932ValLys: 4.932 ± 1.08
4.658ValLeu: 4.658 ± 0.893
0.274ValMet: 0.274 ± 0.42
1.37ValAsn: 1.37 ± 0.652
3.014ValPro: 3.014 ± 0.932
3.836ValGln: 3.836 ± 1.024
3.288ValArg: 3.288 ± 0.684
3.562ValSer: 3.562 ± 0.774
3.288ValThr: 3.288 ± 0.978
4.11ValVal: 4.11 ± 1.175
2.192ValTrp: 2.192 ± 0.678
1.37ValTyr: 1.37 ± 0.472
0.0ValXaa: 0.0 ± 0.0
Trp
1.918TrpAla: 1.918 ± 0.429
0.274TrpCys: 0.274 ± 0.316
1.37TrpAsp: 1.37 ± 0.528
2.192TrpGlu: 2.192 ± 0.569
0.548TrpPhe: 0.548 ± 0.475
1.644TrpGly: 1.644 ± 0.66
0.274TrpHis: 0.274 ± 0.42
1.096TrpIle: 1.096 ± 0.278
3.288TrpLys: 3.288 ± 0.512
1.096TrpLeu: 1.096 ± 0.806
1.644TrpMet: 1.644 ± 0.552
1.37TrpAsn: 1.37 ± 1.044
1.37TrpPro: 1.37 ± 0.662
1.918TrpGln: 1.918 ± 0.827
1.918TrpArg: 1.918 ± 0.646
0.548TrpSer: 0.548 ± 0.37
1.644TrpThr: 1.644 ± 0.713
1.644TrpVal: 1.644 ± 0.33
0.822TrpTrp: 0.822 ± 0.325
0.548TrpTyr: 0.548 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.37TyrAla: 1.37 ± 0.523
1.644TyrCys: 1.644 ± 0.612
0.822TyrAsp: 0.822 ± 0.325
0.822TyrGlu: 0.822 ± 0.54
1.096TyrPhe: 1.096 ± 0.547
1.37TyrGly: 1.37 ± 0.773
0.822TyrHis: 0.822 ± 0.312
0.822TyrIle: 0.822 ± 0.399
1.918TyrLys: 1.918 ± 0.491
1.37TyrLeu: 1.37 ± 0.559
0.274TyrMet: 0.274 ± 0.185
1.644TyrAsn: 1.644 ± 0.559
0.822TyrPro: 0.822 ± 0.614
2.192TyrGln: 2.192 ± 0.752
1.918TyrArg: 1.918 ± 0.871
1.644TyrSer: 1.644 ± 0.376
1.096TyrThr: 1.096 ± 0.456
1.37TyrVal: 1.37 ± 0.697
1.096TyrTrp: 1.096 ± 0.531
1.37TyrTyr: 1.37 ± 0.485
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3651 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski