Amino acid dipepetide frequency for Human immunodeficiency virus type 2 subtype A (isolate BEN) (HIV-2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.83AlaAla: 4.83 ± 1.35
1.271AlaCys: 1.271 ± 0.319
2.542AlaAsp: 2.542 ± 0.83
7.88AlaGlu: 7.88 ± 2.067
2.034AlaPhe: 2.034 ± 0.973
5.084AlaGly: 5.084 ± 0.87
0.763AlaHis: 0.763 ± 0.673
3.05AlaIle: 3.05 ± 0.445
2.542AlaLys: 2.542 ± 0.669
5.338AlaLeu: 5.338 ± 1.198
2.542AlaMet: 2.542 ± 0.605
3.305AlaAsn: 3.305 ± 0.565
4.321AlaPro: 4.321 ± 0.996
4.575AlaGln: 4.575 ± 0.786
4.575AlaArg: 4.575 ± 0.608
2.796AlaSer: 2.796 ± 0.7
1.525AlaThr: 1.525 ± 0.479
3.05AlaVal: 3.05 ± 0.605
2.288AlaTrp: 2.288 ± 0.623
1.017AlaTyr: 1.017 ± 0.452
0.0AlaXaa: 0.0 ± 0.0
Cys
1.779CysAla: 1.779 ± 0.87
0.254CysCys: 0.254 ± 0.346
0.508CysAsp: 0.508 ± 0.431
1.525CysGlu: 1.525 ± 0.524
0.763CysPhe: 0.763 ± 0.636
1.271CysGly: 1.271 ± 0.48
0.508CysHis: 0.508 ± 0.431
0.254CysIle: 0.254 ± 0.216
1.271CysLys: 1.271 ± 0.46
1.525CysLeu: 1.525 ± 0.479
0.254CysMet: 0.254 ± 0.159
1.525CysAsn: 1.525 ± 0.665
0.763CysPro: 0.763 ± 0.337
2.034CysGln: 2.034 ± 0.616
1.779CysArg: 1.779 ± 0.471
1.779CysSer: 1.779 ± 0.612
2.288CysThr: 2.288 ± 1.045
2.034CysVal: 2.034 ± 0.639
0.763CysTrp: 0.763 ± 0.392
1.017CysTyr: 1.017 ± 0.473
0.0CysXaa: 0.0 ± 0.0
Asp
1.017AspAla: 1.017 ± 0.422
0.763AspCys: 0.763 ± 0.365
2.034AspAsp: 2.034 ± 1.612
2.288AspGlu: 2.288 ± 0.891
1.017AspPhe: 1.017 ± 0.635
1.525AspGly: 1.525 ± 0.671
1.271AspHis: 1.271 ± 0.661
3.559AspIle: 3.559 ± 0.64
2.796AspLys: 2.796 ± 0.653
2.796AspLeu: 2.796 ± 1.023
0.763AspMet: 0.763 ± 0.473
1.525AspAsn: 1.525 ± 0.555
3.813AspPro: 3.813 ± 1.142
1.271AspGln: 1.271 ± 0.31
2.542AspArg: 2.542 ± 0.698
3.05AspSer: 3.05 ± 0.62
2.796AspThr: 2.796 ± 0.654
2.542AspVal: 2.542 ± 0.916
1.525AspTrp: 1.525 ± 0.554
1.017AspTyr: 1.017 ± 0.37
0.0AspXaa: 0.0 ± 0.0
Glu
8.134GluAla: 8.134 ± 1.098
0.0GluCys: 0.0 ± 0.0
3.813GluAsp: 3.813 ± 0.983
8.134GluGlu: 8.134 ± 1.359
1.017GluPhe: 1.017 ± 0.646
5.592GluGly: 5.592 ± 1.226
0.763GluHis: 0.763 ± 0.368
3.559GluIle: 3.559 ± 0.947
6.863GluLys: 6.863 ± 1.366
7.372GluLeu: 7.372 ± 0.898
2.034GluMet: 2.034 ± 0.517
1.525GluAsn: 1.525 ± 0.354
3.559GluPro: 3.559 ± 0.839
4.321GluGln: 4.321 ± 0.842
4.067GluArg: 4.067 ± 1.52
3.05GluSer: 3.05 ± 0.537
4.575GluThr: 4.575 ± 0.744
3.559GluVal: 3.559 ± 0.752
1.271GluTrp: 1.271 ± 0.484
0.763GluTyr: 0.763 ± 0.653
0.0GluXaa: 0.0 ± 0.0
Phe
1.525PheAla: 1.525 ± 0.485
0.254PheCys: 0.254 ± 0.216
1.017PheAsp: 1.017 ± 0.646
0.763PheGlu: 0.763 ± 0.479
0.254PhePhe: 0.254 ± 0.159
3.05PheGly: 3.05 ± 0.692
1.017PheHis: 1.017 ± 0.473
0.763PheIle: 0.763 ± 0.368
0.763PheLys: 0.763 ± 0.407
2.288PheLeu: 2.288 ± 0.832
0.254PheMet: 0.254 ± 0.216
1.271PheAsn: 1.271 ± 0.319
1.017PhePro: 1.017 ± 0.785
2.796PheGln: 2.796 ± 0.671
2.288PheArg: 2.288 ± 0.681
2.034PheSer: 2.034 ± 0.58
1.779PheThr: 1.779 ± 0.528
1.017PheVal: 1.017 ± 0.464
0.254PheTrp: 0.254 ± 0.346
1.271PheTyr: 1.271 ± 0.28
0.0PheXaa: 0.0 ± 0.0
Gly
4.321GlyAla: 4.321 ± 0.786
2.796GlyCys: 2.796 ± 0.68
3.559GlyAsp: 3.559 ± 0.776
4.575GlyGlu: 4.575 ± 1.571
3.305GlyPhe: 3.305 ± 0.943
4.83GlyGly: 4.83 ± 1.08
1.525GlyHis: 1.525 ± 0.571
5.084GlyIle: 5.084 ± 1.127
6.609GlyLys: 6.609 ± 1.768
7.117GlyLeu: 7.117 ± 1.344
2.034GlyMet: 2.034 ± 0.695
3.05GlyAsn: 3.05 ± 0.616
4.321GlyPro: 4.321 ± 1.449
2.542GlyGln: 2.542 ± 0.504
4.067GlyArg: 4.067 ± 0.524
2.796GlySer: 2.796 ± 0.416
3.559GlyThr: 3.559 ± 0.739
3.05GlyVal: 3.05 ± 0.557
1.271GlyTrp: 1.271 ± 0.709
2.034GlyTyr: 2.034 ± 0.756
0.0GlyXaa: 0.0 ± 0.0
His
1.017HisAla: 1.017 ± 0.663
1.017HisCys: 1.017 ± 0.473
0.763HisAsp: 0.763 ± 0.473
0.508HisGlu: 0.508 ± 0.318
1.271HisPhe: 1.271 ± 1.054
1.271HisGly: 1.271 ± 0.766
0.508HisHis: 0.508 ± 0.345
1.779HisIle: 1.779 ± 0.559
1.779HisLys: 1.779 ± 0.687
4.067HisLeu: 4.067 ± 0.82
0.0HisMet: 0.0 ± 0.0
0.254HisAsn: 0.254 ± 0.373
1.525HisPro: 1.525 ± 0.309
1.525HisGln: 1.525 ± 0.378
1.271HisArg: 1.271 ± 0.789
1.525HisSer: 1.525 ± 0.333
1.017HisThr: 1.017 ± 0.37
0.763HisVal: 0.763 ± 0.337
0.254HisTrp: 0.254 ± 0.346
0.254HisTyr: 0.254 ± 0.216
0.0HisXaa: 0.0 ± 0.0
Ile
1.779IleAla: 1.779 ± 0.535
0.508IleCys: 0.508 ± 0.185
1.017IleAsp: 1.017 ± 0.422
3.559IleGlu: 3.559 ± 1.088
1.271IlePhe: 1.271 ± 0.804
3.305IleGly: 3.305 ± 0.489
1.779IleHis: 1.779 ± 0.6
4.83IleIle: 4.83 ± 1.0
4.067IleLys: 4.067 ± 1.023
4.575IleLeu: 4.575 ± 0.716
0.763IleMet: 0.763 ± 0.337
3.559IleAsn: 3.559 ± 0.941
4.321IlePro: 4.321 ± 1.069
5.084IleGln: 5.084 ± 0.921
2.796IleArg: 2.796 ± 0.619
2.034IleSer: 2.034 ± 0.836
1.271IleThr: 1.271 ± 0.543
3.559IleVal: 3.559 ± 1.234
1.017IleTrp: 1.017 ± 0.461
2.034IleTyr: 2.034 ± 0.332
0.0IleXaa: 0.0 ± 0.0
Lys
4.575LysAla: 4.575 ± 1.065
1.779LysCys: 1.779 ± 0.505
3.05LysAsp: 3.05 ± 0.455
6.609LysGlu: 6.609 ± 1.839
2.034LysPhe: 2.034 ± 0.752
4.321LysGly: 4.321 ± 0.961
1.525LysHis: 1.525 ± 0.317
3.813LysIle: 3.813 ± 1.873
5.592LysLys: 5.592 ± 1.443
5.084LysLeu: 5.084 ± 1.134
1.271LysMet: 1.271 ± 0.623
4.321LysAsn: 4.321 ± 1.517
2.034LysPro: 2.034 ± 0.536
3.05LysGln: 3.05 ± 0.885
4.067LysArg: 4.067 ± 0.733
2.288LysSer: 2.288 ± 0.564
2.034LysThr: 2.034 ± 1.222
5.084LysVal: 5.084 ± 1.762
0.763LysTrp: 0.763 ± 0.412
3.05LysTyr: 3.05 ± 0.672
0.0LysXaa: 0.0 ± 0.0
Leu
6.863LeuAla: 6.863 ± 1.068
2.034LeuCys: 2.034 ± 0.645
3.305LeuAsp: 3.305 ± 0.843
8.388LeuGlu: 8.388 ± 1.018
2.288LeuPhe: 2.288 ± 0.558
4.575LeuGly: 4.575 ± 1.036
2.288LeuHis: 2.288 ± 0.798
4.321LeuIle: 4.321 ± 0.648
6.101LeuLys: 6.101 ± 1.049
7.626LeuLeu: 7.626 ± 1.138
1.525LeuMet: 1.525 ± 0.328
4.067LeuAsn: 4.067 ± 0.75
3.813LeuPro: 3.813 ± 0.702
4.321LeuGln: 4.321 ± 0.738
5.592LeuArg: 5.592 ± 1.51
3.559LeuSer: 3.559 ± 0.702
4.575LeuThr: 4.575 ± 1.302
6.355LeuVal: 6.355 ± 0.94
1.779LeuTrp: 1.779 ± 0.447
1.271LeuTyr: 1.271 ± 0.893
0.0LeuXaa: 0.0 ± 0.0
Met
2.034MetAla: 2.034 ± 0.921
0.0MetCys: 0.0 ± 0.0
0.763MetAsp: 0.763 ± 0.368
2.034MetGlu: 2.034 ± 0.843
0.254MetPhe: 0.254 ± 0.35
2.542MetGly: 2.542 ± 0.575
0.0MetHis: 0.0 ± 0.0
0.508MetIle: 0.508 ± 0.318
0.254MetLys: 0.254 ± 0.216
1.779MetLeu: 1.779 ± 0.44
0.508MetMet: 0.508 ± 0.431
1.525MetAsn: 1.525 ± 0.309
0.508MetPro: 0.508 ± 0.256
1.525MetGln: 1.525 ± 0.533
1.017MetArg: 1.017 ± 0.576
1.271MetSer: 1.271 ± 0.575
2.542MetThr: 2.542 ± 0.445
0.508MetVal: 0.508 ± 0.318
0.254MetTrp: 0.254 ± 0.216
1.017MetTyr: 1.017 ± 0.334
0.0MetXaa: 0.0 ± 0.0
Asn
1.525AsnAla: 1.525 ± 0.469
2.542AsnCys: 2.542 ± 0.629
1.525AsnAsp: 1.525 ± 0.821
2.034AsnGlu: 2.034 ± 0.536
1.525AsnPhe: 1.525 ± 0.484
1.017AsnGly: 1.017 ± 0.401
0.508AsnHis: 0.508 ± 0.333
3.05AsnIle: 3.05 ± 0.708
2.796AsnLys: 2.796 ± 0.699
2.034AsnLeu: 2.034 ± 0.413
1.779AsnMet: 1.779 ± 0.712
0.763AsnAsn: 0.763 ± 0.369
3.559AsnPro: 3.559 ± 0.986
2.288AsnGln: 2.288 ± 0.536
2.542AsnArg: 2.542 ± 1.008
3.305AsnSer: 3.305 ± 0.638
4.575AsnThr: 4.575 ± 1.183
0.763AsnVal: 0.763 ± 0.369
1.525AsnTrp: 1.525 ± 0.352
3.05AsnTyr: 3.05 ± 0.497
0.0AsnXaa: 0.0 ± 0.0
Pro
4.067ProAla: 4.067 ± 0.967
1.017ProCys: 1.017 ± 0.473
3.05ProAsp: 3.05 ± 0.833
3.05ProGlu: 3.05 ± 0.959
1.525ProPhe: 1.525 ± 0.675
6.355ProGly: 6.355 ± 0.837
1.017ProHis: 1.017 ± 0.395
3.05ProIle: 3.05 ± 0.708
1.525ProLys: 1.525 ± 0.484
4.575ProLeu: 4.575 ± 0.838
0.254ProMet: 0.254 ± 0.35
1.525ProAsn: 1.525 ± 0.484
5.338ProPro: 5.338 ± 2.231
3.05ProGln: 3.05 ± 1.007
7.117ProArg: 7.117 ± 0.978
3.813ProSer: 3.813 ± 1.516
5.846ProThr: 5.846 ± 1.049
3.559ProVal: 3.559 ± 0.896
0.763ProTrp: 0.763 ± 0.521
2.288ProTyr: 2.288 ± 0.715
0.0ProXaa: 0.0 ± 0.0
Gln
4.575GlnAla: 4.575 ± 1.014
1.271GlnCys: 1.271 ± 0.319
0.508GlnAsp: 0.508 ± 0.185
5.338GlnGlu: 5.338 ± 1.073
1.271GlnPhe: 1.271 ± 0.591
6.863GlnGly: 6.863 ± 1.457
1.017GlnHis: 1.017 ± 0.363
3.305GlnIle: 3.305 ± 0.827
4.321GlnLys: 4.321 ± 1.112
4.321GlnLeu: 4.321 ± 1.0
1.271GlnMet: 1.271 ± 0.463
2.288GlnAsn: 2.288 ± 1.017
1.271GlnPro: 1.271 ± 0.426
4.83GlnGln: 4.83 ± 1.097
4.321GlnArg: 4.321 ± 1.286
2.034GlnSer: 2.034 ± 0.508
2.796GlnThr: 2.796 ± 0.449
2.796GlnVal: 2.796 ± 0.883
2.288GlnTrp: 2.288 ± 0.817
2.288GlnTyr: 2.288 ± 0.503
0.0GlnXaa: 0.0 ± 0.0
Arg
3.559ArgAla: 3.559 ± 1.085
1.271ArgCys: 1.271 ± 0.584
3.813ArgAsp: 3.813 ± 0.579
6.355ArgGlu: 6.355 ± 1.782
1.271ArgPhe: 1.271 ± 0.548
6.101ArgGly: 6.101 ± 0.794
2.034ArgHis: 2.034 ± 0.987
2.288ArgIle: 2.288 ± 0.554
4.575ArgLys: 4.575 ± 1.052
6.101ArgLeu: 6.101 ± 1.129
1.271ArgMet: 1.271 ± 0.441
2.796ArgAsn: 2.796 ± 0.468
4.067ArgPro: 4.067 ± 0.736
5.084ArgGln: 5.084 ± 0.784
7.88ArgArg: 7.88 ± 3.263
1.271ArgSer: 1.271 ± 0.588
3.559ArgThr: 3.559 ± 1.318
2.796ArgVal: 2.796 ± 1.002
1.017ArgTrp: 1.017 ± 0.248
3.305ArgTyr: 3.305 ± 0.977
0.0ArgXaa: 0.0 ± 0.0
Ser
2.796SerAla: 2.796 ± 1.123
2.542SerCys: 2.542 ± 0.827
1.525SerAsp: 1.525 ± 0.461
2.796SerGlu: 2.796 ± 0.863
0.763SerPhe: 0.763 ± 0.445
4.321SerGly: 4.321 ± 1.105
0.763SerHis: 0.763 ± 0.775
2.542SerIle: 2.542 ± 0.925
2.288SerLys: 2.288 ± 0.714
4.575SerLeu: 4.575 ± 0.898
0.254SerMet: 0.254 ± 0.35
1.017SerAsn: 1.017 ± 0.638
4.067SerPro: 4.067 ± 0.616
4.575SerGln: 4.575 ± 0.763
3.559SerArg: 3.559 ± 0.911
3.305SerSer: 3.305 ± 1.09
2.796SerThr: 2.796 ± 0.607
0.763SerVal: 0.763 ± 0.381
1.017SerTrp: 1.017 ± 0.655
1.271SerTyr: 1.271 ± 0.595
0.0SerXaa: 0.0 ± 0.0
Thr
5.084ThrAla: 5.084 ± 0.701
1.271ThrCys: 1.271 ± 0.744
2.796ThrAsp: 2.796 ± 0.762
4.067ThrGlu: 4.067 ± 0.732
1.017ThrPhe: 1.017 ± 0.401
2.796ThrGly: 2.796 ± 0.815
1.779ThrHis: 1.779 ± 0.502
2.542ThrIle: 2.542 ± 0.755
3.305ThrLys: 3.305 ± 0.933
4.83ThrLeu: 4.83 ± 1.201
1.017ThrMet: 1.017 ± 0.343
3.05ThrAsn: 3.05 ± 0.564
5.846ThrPro: 5.846 ± 1.309
1.271ThrGln: 1.271 ± 0.432
2.542ThrArg: 2.542 ± 1.419
5.338ThrSer: 5.338 ± 1.75
3.05ThrThr: 3.05 ± 1.019
3.305ThrVal: 3.305 ± 0.986
2.542ThrTrp: 2.542 ± 1.258
1.017ThrTyr: 1.017 ± 0.423
0.0ThrXaa: 0.0 ± 0.0
Val
4.321ValAla: 4.321 ± 0.939
1.017ValCys: 1.017 ± 0.359
2.034ValAsp: 2.034 ± 0.729
2.288ValGlu: 2.288 ± 0.437
1.271ValPhe: 1.271 ± 0.787
4.83ValGly: 4.83 ± 1.345
1.779ValHis: 1.779 ± 0.457
2.288ValIle: 2.288 ± 0.604
3.559ValLys: 3.559 ± 0.654
4.83ValLeu: 4.83 ± 1.599
0.508ValMet: 0.508 ± 0.319
2.034ValAsn: 2.034 ± 0.823
5.592ValPro: 5.592 ± 1.297
2.542ValGln: 2.542 ± 0.812
3.305ValArg: 3.305 ± 0.828
0.763ValSer: 0.763 ± 0.368
4.321ValThr: 4.321 ± 0.77
4.321ValVal: 4.321 ± 1.159
1.779ValTrp: 1.779 ± 0.37
1.271ValTyr: 1.271 ± 0.434
0.0ValXaa: 0.0 ± 0.0
Trp
1.017TrpAla: 1.017 ± 0.395
0.763TrpCys: 0.763 ± 0.225
1.525TrpAsp: 1.525 ± 0.354
0.763TrpGlu: 0.763 ± 0.369
0.763TrpPhe: 0.763 ± 0.647
1.525TrpGly: 1.525 ± 0.759
1.271TrpHis: 1.271 ± 0.766
1.271TrpIle: 1.271 ± 0.306
2.796TrpLys: 2.796 ± 0.624
1.525TrpLeu: 1.525 ± 0.83
1.271TrpMet: 1.271 ± 0.578
1.017TrpAsn: 1.017 ± 0.329
1.271TrpPro: 1.271 ± 0.434
1.525TrpGln: 1.525 ± 0.671
1.779TrpArg: 1.779 ± 1.163
0.0TrpSer: 0.0 ± 0.0
1.525TrpThr: 1.525 ± 0.683
2.034TrpVal: 2.034 ± 0.501
0.763TrpTrp: 0.763 ± 0.412
0.508TrpTyr: 0.508 ± 0.395
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.508TyrAla: 0.508 ± 0.318
1.525TyrCys: 1.525 ± 1.023
0.763TyrAsp: 0.763 ± 0.225
1.017TyrGlu: 1.017 ± 0.675
0.763TyrPhe: 0.763 ± 0.445
1.271TyrGly: 1.271 ± 0.596
0.254TyrHis: 0.254 ± 0.159
1.271TyrIle: 1.271 ± 0.442
2.796TyrLys: 2.796 ± 0.558
2.288TyrLeu: 2.288 ± 1.299
1.017TyrMet: 1.017 ± 0.334
2.288TyrAsn: 2.288 ± 0.459
1.779TyrPro: 1.779 ± 0.754
0.763TyrGln: 0.763 ± 0.307
3.305TyrArg: 3.305 ± 0.549
1.525TyrSer: 1.525 ± 0.998
2.034TyrThr: 2.034 ± 0.792
2.796TyrVal: 2.796 ± 0.499
1.779TyrTrp: 1.779 ± 0.44
1.271TyrTyr: 1.271 ± 0.467
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3935 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski