Amino acid dipepetide frequency for Human immunodeficiency virus type 1 group M subtype B (isolate BRU/LAI) (HIV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.453AlaAla: 5.453 ± 2.021
2.454AlaCys: 2.454 ± 0.775
1.636AlaAsp: 1.636 ± 0.555
4.907AlaGlu: 4.907 ± 1.238
1.908AlaPhe: 1.908 ± 0.317
4.635AlaGly: 4.635 ± 1.19
0.818AlaHis: 0.818 ± 0.312
4.362AlaIle: 4.362 ± 1.18
1.908AlaLys: 1.908 ± 0.847
5.725AlaLeu: 5.725 ± 1.118
1.908AlaMet: 1.908 ± 0.62
2.181AlaAsn: 2.181 ± 0.58
2.999AlaPro: 2.999 ± 1.123
1.636AlaGln: 1.636 ± 0.394
4.362AlaArg: 4.362 ± 0.721
4.907AlaSer: 4.907 ± 0.811
4.635AlaThr: 4.635 ± 0.853
4.362AlaVal: 4.362 ± 0.906
1.091AlaTrp: 1.091 ± 0.482
0.818AlaTyr: 0.818 ± 0.312
0.0AlaXaa: 0.0 ± 0.0
Cys
0.818CysAla: 0.818 ± 0.611
0.545CysCys: 0.545 ± 0.698
0.273CysAsp: 0.273 ± 0.196
0.273CysGlu: 0.273 ± 0.319
1.908CysPhe: 1.908 ± 1.417
1.908CysGly: 1.908 ± 0.508
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.363CysLys: 1.363 ± 0.719
0.273CysLeu: 0.273 ± 0.266
0.273CysMet: 0.273 ± 0.371
1.363CysAsn: 1.363 ± 1.107
0.273CysPro: 0.273 ± 0.266
1.363CysGln: 1.363 ± 0.719
1.908CysArg: 1.908 ± 0.552
1.363CysSer: 1.363 ± 0.956
3.272CysThr: 3.272 ± 0.909
1.636CysVal: 1.636 ± 0.531
0.818CysTrp: 0.818 ± 0.406
0.818CysTyr: 0.818 ± 0.853
0.0CysXaa: 0.0 ± 0.0
Asp
0.818AspAla: 0.818 ± 0.369
2.726AspCys: 2.726 ± 0.954
1.636AspAsp: 1.636 ± 0.57
1.091AspGlu: 1.091 ± 0.576
1.091AspPhe: 1.091 ± 0.782
1.363AspGly: 1.363 ± 0.498
0.0AspHis: 0.0 ± 0.0
3.544AspIle: 3.544 ± 0.798
2.999AspLys: 2.999 ± 1.035
3.817AspLeu: 3.817 ± 1.091
0.818AspMet: 0.818 ± 0.444
1.636AspAsn: 1.636 ± 0.642
2.726AspPro: 2.726 ± 1.114
1.636AspGln: 1.636 ± 0.582
4.362AspArg: 4.362 ± 1.156
2.454AspSer: 2.454 ± 0.928
2.999AspThr: 2.999 ± 0.531
0.545AspVal: 0.545 ± 0.391
0.545AspTrp: 0.545 ± 0.484
0.818AspTyr: 0.818 ± 0.406
0.0AspXaa: 0.0 ± 0.0
Glu
5.18GluAla: 5.18 ± 1.145
0.0GluCys: 0.0 ± 0.0
2.454GluAsp: 2.454 ± 0.993
7.361GluGlu: 7.361 ± 1.59
1.091GluPhe: 1.091 ± 0.475
5.18GluGly: 5.18 ± 0.627
0.545GluHis: 0.545 ± 0.391
4.362GluIle: 4.362 ± 0.86
4.635GluLys: 4.635 ± 0.808
7.088GluLeu: 7.088 ± 1.216
2.454GluMet: 2.454 ± 1.077
1.363GluAsn: 1.363 ± 0.425
5.998GluPro: 5.998 ± 1.856
4.089GluGln: 4.089 ± 0.722
4.089GluArg: 4.089 ± 1.438
2.454GluSer: 2.454 ± 0.819
4.362GluThr: 4.362 ± 2.018
4.362GluVal: 4.362 ± 0.689
1.908GluTrp: 1.908 ± 0.557
1.363GluTyr: 1.363 ± 0.583
0.0GluXaa: 0.0 ± 0.0
Phe
1.363PheAla: 1.363 ± 0.319
0.273PheCys: 0.273 ± 0.266
0.545PheAsp: 0.545 ± 0.484
0.273PheGlu: 0.273 ± 0.266
0.545PhePhe: 0.545 ± 0.531
1.091PheGly: 1.091 ± 0.348
0.818PheHis: 0.818 ± 0.853
1.363PheIle: 1.363 ± 0.505
1.363PheLys: 1.363 ± 0.498
3.272PheLeu: 3.272 ± 0.62
0.0PheMet: 0.0 ± 0.0
2.999PheAsn: 2.999 ± 1.457
1.363PhePro: 1.363 ± 0.938
0.545PheGln: 0.545 ± 0.214
2.999PheArg: 2.999 ± 0.989
2.181PheSer: 2.181 ± 0.596
1.363PheThr: 1.363 ± 0.609
0.545PheVal: 0.545 ± 0.214
0.273PheTrp: 0.273 ± 0.196
1.636PheTyr: 1.636 ± 0.431
0.0PheXaa: 0.0 ± 0.0
Gly
4.635GlyAla: 4.635 ± 0.786
1.908GlyCys: 1.908 ± 0.572
2.181GlyAsp: 2.181 ± 0.817
3.817GlyGlu: 3.817 ± 0.441
1.091GlyPhe: 1.091 ± 0.502
6.27GlyGly: 6.27 ± 0.897
4.362GlyHis: 4.362 ± 1.74
5.725GlyIle: 5.725 ± 1.687
5.725GlyLys: 5.725 ± 1.138
4.089GlyLeu: 4.089 ± 0.668
0.545GlyMet: 0.545 ± 0.398
2.726GlyAsn: 2.726 ± 0.927
4.907GlyPro: 4.907 ± 0.848
4.362GlyGln: 4.362 ± 1.493
3.272GlyArg: 3.272 ± 0.893
5.453GlySer: 5.453 ± 1.55
3.272GlyThr: 3.272 ± 1.281
3.817GlyVal: 3.817 ± 1.362
2.181GlyTrp: 2.181 ± 0.68
1.636GlyTyr: 1.636 ± 0.648
0.0GlyXaa: 0.0 ± 0.0
His
1.091HisAla: 1.091 ± 0.42
0.818HisCys: 0.818 ± 0.726
0.0HisAsp: 0.0 ± 0.0
0.545HisGlu: 0.545 ± 0.214
0.818HisPhe: 0.818 ± 1.038
1.636HisGly: 1.636 ± 0.866
1.091HisHis: 1.091 ± 0.828
2.181HisIle: 2.181 ± 0.806
1.091HisLys: 1.091 ± 0.562
2.454HisLeu: 2.454 ± 0.864
0.545HisMet: 0.545 ± 0.624
1.091HisAsn: 1.091 ± 0.522
2.454HisPro: 2.454 ± 1.028
2.999HisGln: 2.999 ± 1.355
0.818HisArg: 0.818 ± 0.369
2.181HisSer: 2.181 ± 0.67
1.363HisThr: 1.363 ± 0.658
0.545HisVal: 0.545 ± 0.398
0.0HisTrp: 0.0 ± 0.0
0.545HisTyr: 0.545 ± 0.452
0.0HisXaa: 0.0 ± 0.0
Ile
3.544IleAla: 3.544 ± 1.218
1.091IleCys: 1.091 ± 0.428
1.636IleAsp: 1.636 ± 1.206
4.635IleGlu: 4.635 ± 0.83
1.091IlePhe: 1.091 ± 0.696
4.907IleGly: 4.907 ± 1.715
2.454IleHis: 2.454 ± 0.688
4.362IleIle: 4.362 ± 1.183
4.362IleLys: 4.362 ± 0.999
5.725IleLeu: 5.725 ± 0.903
0.818IleMet: 0.818 ± 0.225
1.636IleAsn: 1.636 ± 0.531
3.817IlePro: 3.817 ± 0.941
2.999IleGln: 2.999 ± 1.256
4.907IleArg: 4.907 ± 1.601
3.817IleSer: 3.817 ± 0.943
2.726IleThr: 2.726 ± 1.005
7.361IleVal: 7.361 ± 1.16
1.908IleTrp: 1.908 ± 0.596
1.908IleTyr: 1.908 ± 0.8
0.0IleXaa: 0.0 ± 0.0
Lys
6.816LysAla: 6.816 ± 1.162
2.454LysCys: 2.454 ± 0.657
2.181LysAsp: 2.181 ± 0.765
7.361LysGlu: 7.361 ± 1.978
0.545LysPhe: 0.545 ± 0.358
3.544LysGly: 3.544 ± 1.015
1.908LysHis: 1.908 ± 0.817
6.816LysIle: 6.816 ± 1.845
6.543LysLys: 6.543 ± 2.35
5.453LysLeu: 5.453 ± 1.479
0.273LysMet: 0.273 ± 0.196
2.454LysAsn: 2.454 ± 0.825
1.363LysPro: 1.363 ± 0.629
3.817LysGln: 3.817 ± 0.798
2.454LysArg: 2.454 ± 0.602
2.181LysSer: 2.181 ± 0.348
4.089LysThr: 4.089 ± 0.683
4.089LysVal: 4.089 ± 1.079
1.636LysTrp: 1.636 ± 0.635
2.181LysTyr: 2.181 ± 0.533
0.0LysXaa: 0.0 ± 0.0
Leu
3.817LeuAla: 3.817 ± 0.903
0.818LeuCys: 0.818 ± 0.441
4.089LeuAsp: 4.089 ± 0.942
7.088LeuGlu: 7.088 ± 1.891
1.908LeuPhe: 1.908 ± 0.941
6.543LeuGly: 6.543 ± 1.464
1.636LeuHis: 1.636 ± 1.298
3.817LeuIle: 3.817 ± 1.41
6.816LeuLys: 6.816 ± 0.925
8.451LeuLeu: 8.451 ± 3.147
0.818LeuMet: 0.818 ± 0.643
3.817LeuAsn: 3.817 ± 0.986
2.454LeuPro: 2.454 ± 0.647
5.18LeuGln: 5.18 ± 1.063
4.907LeuArg: 4.907 ± 0.659
3.272LeuSer: 3.272 ± 0.834
4.362LeuThr: 4.362 ± 0.713
5.453LeuVal: 5.453 ± 1.106
2.999LeuTrp: 2.999 ± 0.989
2.454LeuTyr: 2.454 ± 0.757
0.0LeuXaa: 0.0 ± 0.0
Met
1.091MetAla: 1.091 ± 0.576
0.0MetCys: 0.0 ± 0.0
0.818MetAsp: 0.818 ± 0.489
1.908MetGlu: 1.908 ± 0.91
0.545MetPhe: 0.545 ± 0.299
1.908MetGly: 1.908 ± 0.838
0.545MetHis: 0.545 ± 0.214
1.091MetIle: 1.091 ± 0.563
0.545MetLys: 0.545 ± 0.299
1.363MetLeu: 1.363 ± 0.458
1.636MetMet: 1.636 ± 0.449
0.545MetAsn: 0.545 ± 0.311
0.0MetPro: 0.0 ± 0.0
1.363MetGln: 1.363 ± 0.686
1.908MetArg: 1.908 ± 0.736
0.818MetSer: 0.818 ± 0.406
2.726MetThr: 2.726 ± 0.597
0.818MetVal: 0.818 ± 0.225
0.545MetTrp: 0.545 ± 0.531
1.091MetTyr: 1.091 ± 0.382
0.0MetXaa: 0.0 ± 0.0
Asn
2.726AsnAla: 2.726 ± 0.959
2.181AsnCys: 2.181 ± 0.54
1.363AsnAsp: 1.363 ± 0.319
2.454AsnGlu: 2.454 ± 1.048
3.272AsnPhe: 3.272 ± 0.99
1.908AsnGly: 1.908 ± 1.093
0.0AsnHis: 0.0 ± 0.0
2.181AsnIle: 2.181 ± 0.709
3.272AsnLys: 3.272 ± 0.616
1.363AsnLeu: 1.363 ± 0.499
1.091AsnMet: 1.091 ± 1.062
4.089AsnAsn: 4.089 ± 2.502
3.272AsnPro: 3.272 ± 1.015
1.636AsnGln: 1.636 ± 0.357
1.363AsnArg: 1.363 ± 0.412
3.272AsnSer: 3.272 ± 1.104
4.089AsnThr: 4.089 ± 0.728
1.091AsnVal: 1.091 ± 0.696
1.908AsnTrp: 1.908 ± 0.495
1.091AsnTyr: 1.091 ± 0.348
0.0AsnXaa: 0.0 ± 0.0
Pro
2.454ProAla: 2.454 ± 0.883
0.818ProCys: 0.818 ± 0.797
2.454ProAsp: 2.454 ± 0.86
4.089ProGlu: 4.089 ± 1.117
1.636ProPhe: 1.636 ± 0.811
5.453ProGly: 5.453 ± 1.408
0.818ProHis: 0.818 ± 0.497
4.635ProIle: 4.635 ± 0.909
2.726ProLys: 2.726 ± 0.993
4.362ProLeu: 4.362 ± 0.866
0.818ProMet: 0.818 ± 0.437
0.818ProAsn: 0.818 ± 0.634
3.817ProPro: 3.817 ± 1.438
3.272ProGln: 3.272 ± 0.923
4.089ProArg: 4.089 ± 1.23
2.181ProSer: 2.181 ± 1.146
3.817ProThr: 3.817 ± 1.113
4.907ProVal: 4.907 ± 1.302
1.091ProTrp: 1.091 ± 0.933
0.545ProTyr: 0.545 ± 0.391
0.0ProXaa: 0.0 ± 0.0
Gln
5.725GlnAla: 5.725 ± 0.976
0.273GlnCys: 0.273 ± 0.266
2.181GlnAsp: 2.181 ± 1.079
3.817GlnGlu: 3.817 ± 0.668
0.545GlnPhe: 0.545 ± 0.531
5.725GlnGly: 5.725 ± 0.759
1.363GlnHis: 1.363 ± 0.505
4.635GlnIle: 4.635 ± 1.019
2.999GlnLys: 2.999 ± 1.239
5.453GlnLeu: 5.453 ± 1.152
3.272GlnMet: 3.272 ± 1.388
3.544GlnAsn: 3.544 ± 0.923
2.454GlnPro: 2.454 ± 1.675
2.454GlnGln: 2.454 ± 1.37
4.362GlnArg: 4.362 ± 1.481
2.181GlnSer: 2.181 ± 0.604
2.181GlnThr: 2.181 ± 0.674
4.362GlnVal: 4.362 ± 1.575
0.545GlnTrp: 0.545 ± 0.391
1.636GlnTyr: 1.636 ± 0.626
0.0GlnXaa: 0.0 ± 0.0
Arg
4.635ArgAla: 4.635 ± 0.593
0.545ArgCys: 0.545 ± 0.452
3.817ArgAsp: 3.817 ± 0.8
4.907ArgGlu: 4.907 ± 1.115
1.363ArgPhe: 1.363 ± 0.638
3.817ArgGly: 3.817 ± 0.795
1.091ArgHis: 1.091 ± 0.979
4.635ArgIle: 4.635 ± 2.416
4.635ArgLys: 4.635 ± 1.257
3.272ArgLeu: 3.272 ± 1.77
1.091ArgMet: 1.091 ± 0.483
1.908ArgAsn: 1.908 ± 0.806
3.544ArgPro: 3.544 ± 1.228
6.27ArgGln: 6.27 ± 1.234
4.907ArgArg: 4.907 ± 3.132
3.272ArgSer: 3.272 ± 1.485
1.636ArgThr: 1.636 ± 0.692
2.454ArgVal: 2.454 ± 0.686
2.999ArgTrp: 2.999 ± 0.732
1.091ArgTyr: 1.091 ± 0.452
0.0ArgXaa: 0.0 ± 0.0
Ser
3.272SerAla: 3.272 ± 0.541
0.545SerCys: 0.545 ± 0.214
2.454SerAsp: 2.454 ± 0.388
4.362SerGlu: 4.362 ± 0.907
1.636SerPhe: 1.636 ± 0.883
4.089SerGly: 4.089 ± 1.414
0.545SerHis: 0.545 ± 0.484
2.999SerIle: 2.999 ± 0.806
2.181SerLys: 2.181 ± 0.686
6.816SerLeu: 6.816 ± 2.347
1.091SerMet: 1.091 ± 0.374
2.181SerAsn: 2.181 ± 0.895
4.362SerPro: 4.362 ± 1.153
5.725SerGln: 5.725 ± 2.102
3.272SerArg: 3.272 ± 1.251
3.817SerSer: 3.817 ± 0.847
3.817SerThr: 3.817 ± 1.757
2.181SerVal: 2.181 ± 0.348
0.545SerTrp: 0.545 ± 0.214
1.363SerTyr: 1.363 ± 0.866
0.0SerXaa: 0.0 ± 0.0
Thr
3.272ThrAla: 3.272 ± 0.801
0.545ThrCys: 0.545 ± 0.698
2.181ThrAsp: 2.181 ± 0.856
4.907ThrGlu: 4.907 ± 1.178
0.818ThrPhe: 0.818 ± 0.367
2.999ThrGly: 2.999 ± 0.583
1.908ThrHis: 1.908 ± 1.088
3.817ThrIle: 3.817 ± 0.817
4.089ThrLys: 4.089 ± 1.093
5.998ThrLeu: 5.998 ± 1.155
1.363ThrMet: 1.363 ± 0.529
3.544ThrAsn: 3.544 ± 0.665
3.817ThrPro: 3.817 ± 0.824
2.454ThrGln: 2.454 ± 0.886
2.454ThrArg: 2.454 ± 0.791
4.635ThrSer: 4.635 ± 1.104
4.635ThrThr: 4.635 ± 1.256
4.907ThrVal: 4.907 ± 0.982
2.181ThrTrp: 2.181 ± 0.704
1.363ThrTyr: 1.363 ± 0.812
0.0ThrXaa: 0.0 ± 0.0
Val
2.999ValAla: 2.999 ± 0.696
0.545ValCys: 0.545 ± 0.698
3.272ValAsp: 3.272 ± 1.044
4.089ValGlu: 4.089 ± 1.354
0.818ValPhe: 0.818 ± 0.312
5.453ValGly: 5.453 ± 0.815
3.272ValHis: 3.272 ± 1.131
3.544ValIle: 3.544 ± 0.763
4.362ValLys: 4.362 ± 1.089
3.817ValLeu: 3.817 ± 0.922
0.273ValMet: 0.273 ± 0.319
2.726ValAsn: 2.726 ± 0.769
2.726ValPro: 2.726 ± 0.829
4.089ValGln: 4.089 ± 1.194
2.726ValArg: 2.726 ± 0.934
4.089ValSer: 4.089 ± 1.385
4.089ValThr: 4.089 ± 0.758
4.362ValVal: 4.362 ± 1.041
2.181ValTrp: 2.181 ± 0.652
1.091ValTyr: 1.091 ± 0.475
0.0ValXaa: 0.0 ± 0.0
Trp
1.908TrpAla: 1.908 ± 0.409
0.273TrpCys: 0.273 ± 0.391
1.636TrpAsp: 1.636 ± 0.7
1.636TrpGlu: 1.636 ± 0.562
0.818TrpPhe: 0.818 ± 0.669
2.181TrpGly: 2.181 ± 0.904
0.273TrpHis: 0.273 ± 0.319
1.091TrpIle: 1.091 ± 0.482
3.272TrpLys: 3.272 ± 0.606
0.818TrpLeu: 0.818 ± 0.668
1.636TrpMet: 1.636 ± 0.508
1.636TrpAsn: 1.636 ± 1.423
1.091TrpPro: 1.091 ± 0.482
1.908TrpGln: 1.908 ± 0.648
1.908TrpArg: 1.908 ± 0.589
1.363TrpSer: 1.363 ± 0.929
1.363TrpThr: 1.363 ± 0.786
1.091TrpVal: 1.091 ± 0.25
0.818TrpTrp: 0.818 ± 0.312
0.545TrpTyr: 0.545 ± 0.214
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.091TyrAla: 1.091 ± 0.428
1.636TyrCys: 1.636 ± 0.73
0.818TyrAsp: 0.818 ± 0.312
0.818TyrGlu: 0.818 ± 0.497
1.363TyrPhe: 1.363 ± 0.779
1.363TyrGly: 1.363 ± 0.776
0.818TyrHis: 0.818 ± 0.367
0.545TyrIle: 0.545 ± 0.214
3.272TyrLys: 3.272 ± 0.954
1.091TyrLeu: 1.091 ± 0.461
0.273TyrMet: 0.273 ± 0.196
1.363TyrAsn: 1.363 ± 0.738
1.363TyrPro: 1.363 ± 0.638
2.181TyrGln: 2.181 ± 0.708
0.818TyrArg: 0.818 ± 0.449
1.363TyrSer: 1.363 ± 0.339
1.091TyrThr: 1.091 ± 0.502
1.636TyrVal: 1.636 ± 0.692
1.091TyrTrp: 1.091 ± 0.563
1.091TyrTyr: 1.091 ± 0.381
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3669 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski