Amino acid dipepetide frequency for Human immunodeficiency virus type 1 04CD.FR.KZS

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.429AlaAla: 4.429 ± 1.471
1.363AlaCys: 1.363 ± 0.384
1.704AlaAsp: 1.704 ± 0.554
6.133AlaGlu: 6.133 ± 1.486
1.022AlaPhe: 1.022 ± 0.655
4.429AlaGly: 4.429 ± 1.112
1.704AlaHis: 1.704 ± 0.703
6.133AlaIle: 6.133 ± 1.357
3.407AlaLys: 3.407 ± 1.206
6.814AlaLeu: 6.814 ± 1.052
1.704AlaMet: 1.704 ± 0.863
2.726AlaAsn: 2.726 ± 0.904
2.726AlaPro: 2.726 ± 1.296
2.044AlaGln: 2.044 ± 0.661
3.748AlaArg: 3.748 ± 0.86
4.089AlaSer: 4.089 ± 1.088
2.726AlaThr: 2.726 ± 1.214
4.089AlaVal: 4.089 ± 1.139
2.044AlaTrp: 2.044 ± 1.258
1.022AlaTyr: 1.022 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
1.704CysAla: 1.704 ± 0.756
0.0CysCys: 0.0 ± 0.0
1.022CysAsp: 1.022 ± 0.454
0.341CysGlu: 0.341 ± 0.281
1.022CysPhe: 1.022 ± 0.776
1.363CysGly: 1.363 ± 0.607
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.341CysLys: 0.341 ± 0.395
0.681CysLeu: 0.681 ± 0.391
0.0CysMet: 0.0 ± 0.387
1.022CysAsn: 1.022 ± 0.844
0.341CysPro: 0.341 ± 0.281
1.022CysGln: 1.022 ± 0.565
1.022CysArg: 1.022 ± 0.256
1.704CysSer: 1.704 ± 0.74
3.066CysThr: 3.066 ± 0.998
1.704CysVal: 1.704 ± 0.623
0.681CysTrp: 0.681 ± 0.377
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.363AspAla: 1.363 ± 0.387
3.066AspCys: 3.066 ± 0.956
1.704AspAsp: 1.704 ± 0.939
1.022AspGlu: 1.022 ± 0.416
1.363AspPhe: 1.363 ± 1.089
2.385AspGly: 2.385 ± 1.141
0.341AspHis: 0.341 ± 0.262
4.429AspIle: 4.429 ± 0.952
3.748AspLys: 3.748 ± 1.692
4.089AspLeu: 4.089 ± 1.57
0.341AspMet: 0.341 ± 0.223
2.044AspAsn: 2.044 ± 0.798
2.044AspPro: 2.044 ± 0.484
2.044AspGln: 2.044 ± 0.756
3.066AspArg: 3.066 ± 1.373
3.066AspSer: 3.066 ± 1.137
3.066AspThr: 3.066 ± 0.756
1.363AspVal: 1.363 ± 0.384
0.681AspTrp: 0.681 ± 0.678
1.022AspTyr: 1.022 ± 0.519
0.0AspXaa: 0.0 ± 0.0
Glu
5.451GluAla: 5.451 ± 0.973
0.0GluCys: 0.0 ± 0.0
2.385GluAsp: 2.385 ± 1.271
7.836GluGlu: 7.836 ± 2.049
1.704GluPhe: 1.704 ± 0.635
4.77GluGly: 4.77 ± 0.952
0.681GluHis: 0.681 ± 0.524
4.77GluIle: 4.77 ± 1.177
4.429GluLys: 4.429 ± 1.263
7.496GluLeu: 7.496 ± 1.205
1.363GluMet: 1.363 ± 0.921
2.726GluAsn: 2.726 ± 0.931
4.429GluPro: 4.429 ± 1.28
3.066GluGln: 3.066 ± 0.681
4.089GluArg: 4.089 ± 1.439
4.089GluSer: 4.089 ± 0.895
4.429GluThr: 4.429 ± 2.317
5.111GluVal: 5.111 ± 1.327
2.044GluTrp: 2.044 ± 0.643
0.341GluTyr: 0.341 ± 0.564
0.0GluXaa: 0.0 ± 0.0
Phe
1.022PheAla: 1.022 ± 0.256
1.022PheCys: 1.022 ± 0.844
1.022PheAsp: 1.022 ± 0.671
0.681PheGlu: 0.681 ± 0.563
1.022PhePhe: 1.022 ± 0.256
1.022PheGly: 1.022 ± 0.256
0.0PheHis: 0.0 ± 0.0
1.363PheIle: 1.363 ± 0.598
1.704PheLys: 1.704 ± 0.84
3.748PheLeu: 3.748 ± 0.688
0.0PheMet: 0.0 ± 0.0
3.066PheAsn: 3.066 ± 1.397
2.044PhePro: 2.044 ± 1.423
1.022PheGln: 1.022 ± 0.489
3.066PheArg: 3.066 ± 1.294
2.385PheSer: 2.385 ± 0.635
1.363PheThr: 1.363 ± 0.69
0.341PheVal: 0.341 ± 0.262
0.341PheTrp: 0.341 ± 0.262
1.022PheTyr: 1.022 ± 0.416
0.0PheXaa: 0.0 ± 0.0
Gly
5.451GlyAla: 5.451 ± 1.012
1.704GlyCys: 1.704 ± 0.702
2.726GlyAsp: 2.726 ± 0.909
3.748GlyGlu: 3.748 ± 0.791
2.044GlyPhe: 2.044 ± 0.686
6.133GlyGly: 6.133 ± 0.944
2.044GlyHis: 2.044 ± 1.307
7.496GlyIle: 7.496 ± 2.367
5.111GlyLys: 5.111 ± 1.649
5.111GlyLeu: 5.111 ± 1.805
1.363GlyMet: 1.363 ± 0.569
3.066GlyAsn: 3.066 ± 1.456
4.77GlyPro: 4.77 ± 1.386
5.111GlyGln: 5.111 ± 1.328
3.748GlyArg: 3.748 ± 0.743
3.748GlySer: 3.748 ± 0.628
2.044GlyThr: 2.044 ± 0.54
2.726GlyVal: 2.726 ± 0.699
1.704GlyTrp: 1.704 ± 0.948
2.385GlyTyr: 2.385 ± 0.875
0.0GlyXaa: 0.0 ± 0.0
His
0.681HisAla: 0.681 ± 0.563
0.0HisCys: 0.0 ± 0.0
0.681HisAsp: 0.681 ± 0.603
1.022HisGlu: 1.022 ± 0.489
1.363HisPhe: 1.363 ± 1.093
2.385HisGly: 2.385 ± 0.962
0.681HisHis: 0.681 ± 0.92
0.681HisIle: 0.681 ± 0.92
1.022HisLys: 1.022 ± 0.519
3.066HisLeu: 3.066 ± 0.425
0.681HisMet: 0.681 ± 0.497
1.704HisAsn: 1.704 ± 0.433
2.044HisPro: 2.044 ± 1.024
1.704HisGln: 1.704 ± 1.494
1.022HisArg: 1.022 ± 0.454
1.022HisSer: 1.022 ± 0.508
2.044HisThr: 2.044 ± 0.977
0.681HisVal: 0.681 ± 0.461
0.0HisTrp: 0.0 ± 0.0
1.363HisTyr: 1.363 ± 1.366
0.0HisXaa: 0.0 ± 0.0
Ile
3.407IleAla: 3.407 ± 0.846
1.363IleCys: 1.363 ± 0.546
1.363IleAsp: 1.363 ± 0.699
5.111IleGlu: 5.111 ± 1.112
1.022IlePhe: 1.022 ± 0.489
5.451IleGly: 5.451 ± 2.584
3.066IleHis: 3.066 ± 1.256
6.474IleIle: 6.474 ± 1.578
4.77IleLys: 4.77 ± 1.237
5.111IleLeu: 5.111 ± 0.796
1.363IleMet: 1.363 ± 0.782
4.089IleAsn: 4.089 ± 1.417
3.748IlePro: 3.748 ± 0.822
3.407IleGln: 3.407 ± 1.879
4.429IleArg: 4.429 ± 0.865
2.726IleSer: 2.726 ± 1.21
3.066IleThr: 3.066 ± 0.862
6.133IleVal: 6.133 ± 1.555
2.044IleTrp: 2.044 ± 0.364
2.044IleTyr: 2.044 ± 0.488
0.0IleXaa: 0.0 ± 0.0
Lys
4.429LysAla: 4.429 ± 1.41
1.363LysCys: 1.363 ± 0.607
4.429LysAsp: 4.429 ± 1.124
5.111LysGlu: 5.111 ± 1.427
0.681LysPhe: 0.681 ± 0.514
4.77LysGly: 4.77 ± 1.505
1.704LysHis: 1.704 ± 0.96
6.474LysIle: 6.474 ± 2.006
5.111LysLys: 5.111 ± 1.405
4.429LysLeu: 4.429 ± 1.612
1.022LysMet: 1.022 ± 0.416
2.726LysAsn: 2.726 ± 0.949
1.704LysPro: 1.704 ± 0.906
4.429LysGln: 4.429 ± 1.393
3.407LysArg: 3.407 ± 0.659
4.089LysSer: 4.089 ± 1.241
3.748LysThr: 3.748 ± 0.851
4.429LysVal: 4.429 ± 1.466
2.044LysTrp: 2.044 ± 0.851
2.044LysTyr: 2.044 ± 0.358
0.0LysXaa: 0.0 ± 0.0
Leu
5.451LeuAla: 5.451 ± 0.941
0.681LeuCys: 0.681 ± 0.273
5.111LeuAsp: 5.111 ± 0.804
4.089LeuGlu: 4.089 ± 1.73
2.726LeuPhe: 2.726 ± 0.928
7.155LeuGly: 7.155 ± 2.107
1.704LeuHis: 1.704 ± 1.078
3.748LeuIle: 3.748 ± 1.514
8.518LeuLys: 8.518 ± 0.587
8.518LeuLeu: 8.518 ± 2.015
1.022LeuMet: 1.022 ± 0.781
3.748LeuAsn: 3.748 ± 1.092
1.704LeuPro: 1.704 ± 1.078
5.792LeuGln: 5.792 ± 1.038
5.111LeuArg: 5.111 ± 1.199
3.066LeuSer: 3.066 ± 1.265
4.77LeuThr: 4.77 ± 1.376
6.474LeuVal: 6.474 ± 2.045
3.748LeuTrp: 3.748 ± 1.344
1.704LeuTyr: 1.704 ± 0.623
0.0LeuXaa: 0.0 ± 0.0
Met
1.363MetAla: 1.363 ± 0.69
0.0MetCys: 0.0 ± 0.0
0.681MetAsp: 0.681 ± 0.524
1.704MetGlu: 1.704 ± 0.866
0.341MetPhe: 0.341 ± 0.395
2.044MetGly: 2.044 ± 0.727
1.022MetHis: 1.022 ± 0.454
2.385MetIle: 2.385 ± 1.1
1.704MetLys: 1.704 ± 0.607
1.022MetLeu: 1.022 ± 0.733
0.681MetMet: 0.681 ± 0.789
1.022MetAsn: 1.022 ± 0.454
0.0MetPro: 0.0 ± 0.0
1.022MetGln: 1.022 ± 0.726
1.022MetArg: 1.022 ± 0.779
1.022MetSer: 1.022 ± 0.614
2.385MetThr: 2.385 ± 1.472
0.341MetVal: 0.341 ± 0.395
1.022MetTrp: 1.022 ± 0.676
0.341MetTyr: 0.341 ± 0.395
0.0MetXaa: 0.0 ± 0.0
Asn
1.363AsnAla: 1.363 ± 1.213
2.385AsnCys: 2.385 ± 1.335
1.704AsnAsp: 1.704 ± 1.161
3.748AsnGlu: 3.748 ± 1.156
3.066AsnPhe: 3.066 ± 0.864
2.385AsnGly: 2.385 ± 1.119
0.341AsnHis: 0.341 ± 0.281
2.044AsnIle: 2.044 ± 0.977
3.407AsnLys: 3.407 ± 0.904
4.089AsnLeu: 4.089 ± 1.271
1.704AsnMet: 1.704 ± 1.407
3.748AsnAsn: 3.748 ± 1.918
3.066AsnPro: 3.066 ± 0.814
1.363AsnGln: 1.363 ± 0.988
2.385AsnArg: 2.385 ± 0.467
2.385AsnSer: 2.385 ± 0.913
4.77AsnThr: 4.77 ± 1.533
1.704AsnVal: 1.704 ± 1.021
2.726AsnTrp: 2.726 ± 1.049
1.022AsnTyr: 1.022 ± 0.614
0.0AsnXaa: 0.0 ± 0.0
Pro
3.748ProAla: 3.748 ± 1.058
0.681ProCys: 0.681 ± 0.563
2.044ProAsp: 2.044 ± 1.005
3.407ProGlu: 3.407 ± 0.947
2.044ProPhe: 2.044 ± 0.529
5.111ProGly: 5.111 ± 1.428
0.681ProHis: 0.681 ± 0.568
4.089ProIle: 4.089 ± 1.019
2.726ProLys: 2.726 ± 0.943
3.748ProLeu: 3.748 ± 0.771
1.704ProMet: 1.704 ± 1.145
1.363ProAsn: 1.363 ± 0.688
3.407ProPro: 3.407 ± 0.74
3.407ProGln: 3.407 ± 1.108
3.066ProArg: 3.066 ± 0.963
2.385ProSer: 2.385 ± 0.971
1.704ProThr: 1.704 ± 0.352
4.089ProVal: 4.089 ± 1.485
0.681ProTrp: 0.681 ± 0.544
0.681ProTyr: 0.681 ± 0.524
0.0ProXaa: 0.0 ± 0.0
Gln
6.133GlnAla: 6.133 ± 1.866
1.363GlnCys: 1.363 ± 0.749
2.726GlnAsp: 2.726 ± 0.94
4.77GlnGlu: 4.77 ± 1.138
0.681GlnPhe: 0.681 ± 0.305
5.792GlnGly: 5.792 ± 0.594
1.022GlnHis: 1.022 ± 0.508
3.407GlnIle: 3.407 ± 1.129
5.111GlnLys: 5.111 ± 1.745
5.792GlnLeu: 5.792 ± 1.923
3.066GlnMet: 3.066 ± 1.568
2.044GlnAsn: 2.044 ± 1.166
1.022GlnPro: 1.022 ± 0.256
3.407GlnGln: 3.407 ± 1.268
2.726GlnArg: 2.726 ± 0.96
1.022GlnSer: 1.022 ± 0.416
1.363GlnThr: 1.363 ± 0.906
2.385GlnVal: 2.385 ± 1.491
0.681GlnTrp: 0.681 ± 0.524
2.385GlnTyr: 2.385 ± 1.041
0.0GlnXaa: 0.0 ± 0.0
Arg
5.451ArgAla: 5.451 ± 1.108
0.681ArgCys: 0.681 ± 0.494
2.385ArgAsp: 2.385 ± 0.775
7.496ArgGlu: 7.496 ± 1.566
1.363ArgPhe: 1.363 ± 0.866
1.704ArgGly: 1.704 ± 0.63
1.704ArgHis: 1.704 ± 0.946
5.111ArgIle: 5.111 ± 1.892
4.429ArgLys: 4.429 ± 1.133
4.77ArgLeu: 4.77 ± 1.482
1.022ArgMet: 1.022 ± 0.86
2.385ArgAsn: 2.385 ± 0.935
4.429ArgPro: 4.429 ± 1.354
4.429ArgGln: 4.429 ± 1.156
2.044ArgArg: 2.044 ± 0.635
0.341ArgSer: 0.341 ± 0.281
2.044ArgThr: 2.044 ± 0.358
2.385ArgVal: 2.385 ± 0.86
1.704ArgTrp: 1.704 ± 0.811
1.363ArgTyr: 1.363 ± 0.549
0.0ArgXaa: 0.0 ± 0.0
Ser
1.704SerAla: 1.704 ± 0.404
0.341SerCys: 0.341 ± 0.262
2.044SerAsp: 2.044 ± 0.654
3.748SerGlu: 3.748 ± 1.067
1.704SerPhe: 1.704 ± 0.74
3.407SerGly: 3.407 ± 0.731
1.022SerHis: 1.022 ± 0.779
3.407SerIle: 3.407 ± 0.733
1.022SerLys: 1.022 ± 0.637
6.133SerLeu: 6.133 ± 2.14
0.681SerMet: 0.681 ± 0.524
2.726SerAsn: 2.726 ± 1.499
4.089SerPro: 4.089 ± 1.293
3.748SerGln: 3.748 ± 1.48
2.726SerArg: 2.726 ± 0.97
3.066SerSer: 3.066 ± 0.9
3.407SerThr: 3.407 ± 1.657
3.407SerVal: 3.407 ± 1.66
1.022SerTrp: 1.022 ± 0.455
0.681SerTyr: 0.681 ± 0.391
0.0SerXaa: 0.0 ± 0.0
Thr
4.089ThrAla: 4.089 ± 0.627
0.0ThrCys: 0.0 ± 0.0
3.066ThrAsp: 3.066 ± 1.231
4.429ThrGlu: 4.429 ± 0.979
1.022ThrPhe: 1.022 ± 0.489
3.748ThrGly: 3.748 ± 0.909
1.704ThrHis: 1.704 ± 1.021
2.726ThrIle: 2.726 ± 1.287
3.066ThrLys: 3.066 ± 0.83
5.111ThrLeu: 5.111 ± 1.773
1.022ThrMet: 1.022 ± 0.523
2.726ThrAsn: 2.726 ± 0.732
3.748ThrPro: 3.748 ± 0.971
2.385ThrGln: 2.385 ± 0.907
2.385ThrArg: 2.385 ± 1.281
3.066ThrSer: 3.066 ± 0.768
3.748ThrThr: 3.748 ± 1.527
5.111ThrVal: 5.111 ± 1.65
2.385ThrTrp: 2.385 ± 0.862
1.704ThrTyr: 1.704 ± 0.78
0.0ThrXaa: 0.0 ± 0.0
Val
4.089ValAla: 4.089 ± 0.731
0.0ValCys: 0.0 ± 0.0
3.407ValAsp: 3.407 ± 1.435
3.407ValGlu: 3.407 ± 1.065
1.363ValPhe: 1.363 ± 0.546
4.77ValGly: 4.77 ± 1.302
2.726ValHis: 2.726 ± 0.787
3.407ValIle: 3.407 ± 0.795
4.089ValLys: 4.089 ± 1.456
3.066ValLeu: 3.066 ± 0.947
0.681ValMet: 0.681 ± 0.273
2.385ValAsn: 2.385 ± 1.139
3.066ValPro: 3.066 ± 0.945
3.066ValGln: 3.066 ± 0.632
3.066ValArg: 3.066 ± 0.921
4.429ValSer: 4.429 ± 0.818
3.748ValThr: 3.748 ± 0.858
4.77ValVal: 4.77 ± 1.828
2.726ValTrp: 2.726 ± 0.97
2.385ValTyr: 2.385 ± 0.843
0.0ValXaa: 0.0 ± 0.0
Trp
1.704TrpAla: 1.704 ± 0.352
0.341TrpCys: 0.341 ± 0.479
1.363TrpAsp: 1.363 ± 0.789
2.044TrpGlu: 2.044 ± 0.625
0.341TrpPhe: 0.341 ± 0.281
2.385TrpGly: 2.385 ± 0.959
0.681TrpHis: 0.681 ± 0.92
1.022TrpIle: 1.022 ± 0.256
2.385TrpLys: 2.385 ± 0.941
0.681TrpLeu: 0.681 ± 0.544
1.022TrpMet: 1.022 ± 0.726
2.385TrpAsn: 2.385 ± 1.674
1.022TrpPro: 1.022 ± 0.577
2.726TrpGln: 2.726 ± 0.931
2.385TrpArg: 2.385 ± 0.959
1.363TrpSer: 1.363 ± 1.03
2.726TrpThr: 2.726 ± 0.951
1.704TrpVal: 1.704 ± 0.896
0.681TrpTrp: 0.681 ± 0.524
1.022TrpTyr: 1.022 ± 0.489
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.022TyrAla: 1.022 ± 0.455
1.022TyrCys: 1.022 ± 0.555
0.681TyrAsp: 0.681 ± 0.494
1.363TyrGlu: 1.363 ± 0.546
1.363TyrPhe: 1.363 ± 0.7
1.022TyrGly: 1.022 ± 0.508
1.022TyrHis: 1.022 ± 0.454
1.022TyrIle: 1.022 ± 0.543
2.044TyrLys: 2.044 ± 1.162
1.363TyrLeu: 1.363 ± 0.572
0.341TyrMet: 0.341 ± 0.262
1.704TyrAsn: 1.704 ± 0.859
1.363TyrPro: 1.363 ± 0.779
1.704TyrGln: 1.704 ± 1.309
2.726TyrArg: 2.726 ± 1.379
1.022TyrSer: 1.022 ± 0.256
1.022TyrThr: 1.022 ± 0.416
1.704TyrVal: 1.704 ± 0.797
1.022TyrTrp: 1.022 ± 0.454
1.022TyrTyr: 1.022 ± 0.455
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2936 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski