Amino acid dipepetide frequency for Human immunodeficiency virus type 1 group M subtype B (isolate ARV2/SF2) (HIV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.008AlaAla: 6.008 ± 2.022
2.458AlaCys: 2.458 ± 1.547
1.638AlaAsp: 1.638 ± 0.881
5.188AlaGlu: 5.188 ± 1.534
1.638AlaPhe: 1.638 ± 0.417
5.461AlaGly: 5.461 ± 1.428
0.819AlaHis: 0.819 ± 0.338
5.461AlaIle: 5.461 ± 1.74
2.458AlaLys: 2.458 ± 0.751
4.915AlaLeu: 4.915 ± 0.865
2.185AlaMet: 2.185 ± 0.602
2.458AlaAsn: 2.458 ± 1.043
3.004AlaPro: 3.004 ± 1.256
2.458AlaGln: 2.458 ± 0.526
3.55AlaArg: 3.55 ± 0.602
5.188AlaSer: 5.188 ± 0.982
3.823AlaThr: 3.823 ± 0.859
4.642AlaVal: 4.642 ± 1.084
1.365AlaTrp: 1.365 ± 0.561
1.092AlaTyr: 1.092 ± 0.464
0.0AlaXaa: 0.0 ± 0.0
Cys
0.546CysAla: 0.546 ± 0.43
0.546CysCys: 0.546 ± 0.861
0.273CysAsp: 0.273 ± 0.18
0.273CysGlu: 0.273 ± 0.387
2.185CysPhe: 2.185 ± 1.76
1.638CysGly: 1.638 ± 0.723
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.365CysLys: 1.365 ± 0.77
0.273CysLeu: 0.273 ± 0.241
0.273CysMet: 0.273 ± 0.402
1.638CysAsn: 1.638 ± 1.244
0.273CysPro: 0.273 ± 0.241
1.092CysGln: 1.092 ± 0.524
1.638CysArg: 1.638 ± 0.548
1.638CysSer: 1.638 ± 0.974
3.004CysThr: 3.004 ± 1.182
1.638CysVal: 1.638 ± 0.566
0.819CysTrp: 0.819 ± 0.387
1.092CysTyr: 1.092 ± 1.722
0.0CysXaa: 0.0 ± 0.0
Asp
1.092AspAla: 1.092 ± 0.48
3.277AspCys: 3.277 ± 1.451
1.092AspAsp: 1.092 ± 0.588
0.546AspGlu: 0.546 ± 0.597
1.092AspPhe: 1.092 ± 0.719
1.638AspGly: 1.638 ± 0.623
0.0AspHis: 0.0 ± 0.0
3.55AspIle: 3.55 ± 0.796
3.277AspLys: 3.277 ± 1.123
3.277AspLeu: 3.277 ± 0.84
0.819AspMet: 0.819 ± 0.447
2.185AspAsn: 2.185 ± 1.093
2.731AspPro: 2.731 ± 1.495
1.912AspGln: 1.912 ± 0.616
3.823AspArg: 3.823 ± 1.038
2.731AspSer: 2.731 ± 1.857
2.458AspThr: 2.458 ± 0.623
2.185AspVal: 2.185 ± 0.674
0.273AspTrp: 0.273 ± 0.416
0.819AspTyr: 0.819 ± 0.387
0.0AspXaa: 0.0 ± 0.0
Glu
5.461GluAla: 5.461 ± 1.368
0.0GluCys: 0.0 ± 0.0
2.731GluAsp: 2.731 ± 0.979
8.738GluGlu: 8.738 ± 2.024
1.092GluPhe: 1.092 ± 0.489
4.642GluGly: 4.642 ± 0.88
0.546GluHis: 0.546 ± 0.36
3.823GluIle: 3.823 ± 0.995
5.735GluLys: 5.735 ± 1.656
7.1GluLeu: 7.1 ± 1.298
1.638GluMet: 1.638 ± 0.632
1.912GluAsn: 1.912 ± 0.52
6.008GluPro: 6.008 ± 2.004
3.823GluGln: 3.823 ± 0.933
4.096GluArg: 4.096 ± 1.366
3.004GluSer: 3.004 ± 1.031
4.369GluThr: 4.369 ± 1.604
4.369GluVal: 4.369 ± 0.997
1.638GluTrp: 1.638 ± 0.796
1.092GluTyr: 1.092 ± 0.731
0.0GluXaa: 0.0 ± 0.0
Phe
1.365PheAla: 1.365 ± 0.384
0.273PheCys: 0.273 ± 0.241
1.092PheAsp: 1.092 ± 0.971
0.273PheGlu: 0.273 ± 0.241
0.273PhePhe: 0.273 ± 0.241
1.365PheGly: 1.365 ± 0.632
1.092PheHis: 1.092 ± 0.97
1.092PheIle: 1.092 ± 0.407
1.092PheLys: 1.092 ± 0.585
3.004PheLeu: 3.004 ± 0.581
0.0PheMet: 0.0 ± 0.0
2.731PheAsn: 2.731 ± 1.198
1.365PhePro: 1.365 ± 0.946
0.546PheGln: 0.546 ± 0.232
3.277PheArg: 3.277 ± 0.959
2.458PheSer: 2.458 ± 0.588
1.638PheThr: 1.638 ± 0.785
0.273PheVal: 0.273 ± 0.18
0.273PheTrp: 0.273 ± 0.18
1.365PheTyr: 1.365 ± 0.396
0.0PheXaa: 0.0 ± 0.0
Gly
4.642GlyAla: 4.642 ± 0.828
1.912GlyCys: 1.912 ± 0.666
2.458GlyAsp: 2.458 ± 0.958
3.823GlyGlu: 3.823 ± 0.703
1.638GlyPhe: 1.638 ± 0.623
6.827GlyGly: 6.827 ± 1.313
3.823GlyHis: 3.823 ± 1.456
6.281GlyIle: 6.281 ± 1.648
5.188GlyLys: 5.188 ± 1.322
4.369GlyLeu: 4.369 ± 0.965
0.819GlyMet: 0.819 ± 0.375
3.277GlyAsn: 3.277 ± 1.035
4.915GlyPro: 4.915 ± 0.984
4.369GlyGln: 4.369 ± 1.452
3.277GlyArg: 3.277 ± 1.461
3.823GlySer: 3.823 ± 1.16
3.823GlyThr: 3.823 ± 1.032
2.185GlyVal: 2.185 ± 0.782
2.185GlyTrp: 2.185 ± 0.962
1.912GlyTyr: 1.912 ± 0.623
0.0GlyXaa: 0.0 ± 0.0
His
0.819HisAla: 0.819 ± 0.274
1.092HisCys: 1.092 ± 0.955
0.0HisAsp: 0.0 ± 0.0
0.546HisGlu: 0.546 ± 0.36
0.819HisPhe: 0.819 ± 0.957
1.092HisGly: 1.092 ± 0.757
0.819HisHis: 0.819 ± 0.869
1.638HisIle: 1.638 ± 0.596
1.092HisLys: 1.092 ± 0.516
2.458HisLeu: 2.458 ± 0.79
0.819HisMet: 0.819 ± 0.835
0.819HisAsn: 0.819 ± 0.39
2.458HisPro: 2.458 ± 1.099
3.004HisGln: 3.004 ± 1.247
1.092HisArg: 1.092 ± 0.52
1.092HisSer: 1.092 ± 0.719
1.912HisThr: 1.912 ± 0.77
0.273HisVal: 0.273 ± 0.18
0.0HisTrp: 0.0 ± 0.0
0.819HisTyr: 0.819 ± 0.786
0.0HisXaa: 0.0 ± 0.0
Ile
3.277IleAla: 3.277 ± 0.82
1.092IleCys: 1.092 ± 0.464
1.912IleAsp: 1.912 ± 0.657
3.277IleGlu: 3.277 ± 0.706
0.546IlePhe: 0.546 ± 0.232
4.915IleGly: 4.915 ± 1.729
2.185IleHis: 2.185 ± 0.629
4.642IleIle: 4.642 ± 1.177
4.096IleLys: 4.096 ± 1.31
6.008IleLeu: 6.008 ± 1.209
1.092IleMet: 1.092 ± 0.301
1.365IleAsn: 1.365 ± 0.384
3.823IlePro: 3.823 ± 1.479
2.731IleGln: 2.731 ± 1.074
5.188IleArg: 5.188 ± 0.771
3.277IleSer: 3.277 ± 1.117
2.458IleThr: 2.458 ± 0.987
7.1IleVal: 7.1 ± 1.849
1.912IleTrp: 1.912 ± 0.523
2.458IleTyr: 2.458 ± 0.556
0.0IleXaa: 0.0 ± 0.0
Lys
6.281LysAla: 6.281 ± 1.254
1.638LysCys: 1.638 ± 0.709
2.731LysAsp: 2.731 ± 1.131
6.554LysGlu: 6.554 ± 1.501
0.273LysPhe: 0.273 ± 0.18
4.915LysGly: 4.915 ± 0.665
1.912LysHis: 1.912 ± 0.919
6.008LysIle: 6.008 ± 1.726
8.192LysLys: 8.192 ± 2.712
6.008LysLeu: 6.008 ± 1.378
0.273LysMet: 0.273 ± 0.18
2.731LysAsn: 2.731 ± 0.76
1.365LysPro: 1.365 ± 0.738
3.55LysGln: 3.55 ± 1.045
2.731LysArg: 2.731 ± 0.794
1.912LysSer: 1.912 ± 0.454
4.369LysThr: 4.369 ± 1.163
4.369LysVal: 4.369 ± 0.999
1.912LysTrp: 1.912 ± 0.527
2.185LysTyr: 2.185 ± 0.531
0.0LysXaa: 0.0 ± 0.0
Leu
4.369LeuAla: 4.369 ± 1.153
0.819LeuCys: 0.819 ± 0.438
3.823LeuAsp: 3.823 ± 1.106
7.646LeuGlu: 7.646 ± 1.176
2.185LeuPhe: 2.185 ± 1.003
6.827LeuGly: 6.827 ± 1.55
2.185LeuHis: 2.185 ± 1.422
2.731LeuIle: 2.731 ± 1.3
5.735LeuLys: 5.735 ± 1.427
8.738LeuLeu: 8.738 ± 3.087
0.819LeuMet: 0.819 ± 0.664
4.642LeuAsn: 4.642 ± 1.226
2.458LeuPro: 2.458 ± 0.841
5.735LeuGln: 5.735 ± 1.131
5.461LeuArg: 5.461 ± 0.765
3.277LeuSer: 3.277 ± 1.134
3.823LeuThr: 3.823 ± 0.746
5.735LeuVal: 5.735 ± 1.452
3.004LeuTrp: 3.004 ± 1.044
2.185LeuTyr: 2.185 ± 0.799
0.0LeuXaa: 0.0 ± 0.0
Met
1.365MetAla: 1.365 ± 0.769
0.0MetCys: 0.0 ± 0.0
0.546MetAsp: 0.546 ± 0.36
1.912MetGlu: 1.912 ± 1.018
0.819MetPhe: 0.819 ± 0.274
1.912MetGly: 1.912 ± 0.943
0.546MetHis: 0.546 ± 0.232
1.092MetIle: 1.092 ± 0.52
0.819MetLys: 0.819 ± 0.274
1.638MetLeu: 1.638 ± 0.568
1.092MetMet: 1.092 ± 0.625
0.819MetAsn: 0.819 ± 0.463
0.0MetPro: 0.0 ± 0.0
1.912MetGln: 1.912 ± 0.657
1.638MetArg: 1.638 ± 0.44
0.819MetSer: 0.819 ± 0.477
2.458MetThr: 2.458 ± 0.841
0.819MetVal: 0.819 ± 0.274
0.546MetTrp: 0.546 ± 0.482
1.092MetTyr: 1.092 ± 0.417
0.0MetXaa: 0.0 ± 0.0
Asn
3.004AsnAla: 3.004 ± 0.937
2.731AsnCys: 2.731 ± 0.959
1.365AsnAsp: 1.365 ± 0.384
2.731AsnGlu: 2.731 ± 0.719
3.277AsnPhe: 3.277 ± 1.043
1.638AsnGly: 1.638 ± 0.486
0.273AsnHis: 0.273 ± 0.241
1.912AsnIle: 1.912 ± 0.573
3.004AsnLys: 3.004 ± 0.804
2.731AsnLeu: 2.731 ± 0.721
1.092AsnMet: 1.092 ± 0.964
4.642AsnAsn: 4.642 ± 1.74
3.55AsnPro: 3.55 ± 1.157
1.365AsnGln: 1.365 ± 0.384
1.638AsnArg: 1.638 ± 0.73
2.458AsnSer: 2.458 ± 0.488
4.096AsnThr: 4.096 ± 1.008
1.365AsnVal: 1.365 ± 0.903
2.185AsnTrp: 2.185 ± 0.762
1.912AsnTyr: 1.912 ± 0.807
0.0AsnXaa: 0.0 ± 0.0
Pro
3.55ProAla: 3.55 ± 0.905
0.819ProCys: 0.819 ± 0.723
2.458ProAsp: 2.458 ± 0.59
3.277ProGlu: 3.277 ± 1.133
1.638ProPhe: 1.638 ± 0.671
5.461ProGly: 5.461 ± 1.205
0.273ProHis: 0.273 ± 0.18
4.642ProIle: 4.642 ± 1.226
1.912ProLys: 1.912 ± 0.564
4.369ProLeu: 4.369 ± 1.312
1.092ProMet: 1.092 ± 0.839
1.092ProAsn: 1.092 ± 0.955
3.004ProPro: 3.004 ± 1.707
3.277ProGln: 3.277 ± 1.166
3.823ProArg: 3.823 ± 1.143
2.458ProSer: 2.458 ± 1.265
2.458ProThr: 2.458 ± 0.77
4.915ProVal: 4.915 ± 1.13
1.365ProTrp: 1.365 ± 1.196
0.819ProTyr: 0.819 ± 0.449
0.0ProXaa: 0.0 ± 0.0
Gln
6.008GlnAla: 6.008 ± 1.151
0.273GlnCys: 0.273 ± 0.241
2.731GlnAsp: 2.731 ± 0.993
4.369GlnGlu: 4.369 ± 1.065
0.273GlnPhe: 0.273 ± 0.241
4.915GlnGly: 4.915 ± 0.925
1.365GlnHis: 1.365 ± 0.495
4.642GlnIle: 4.642 ± 1.049
3.004GlnLys: 3.004 ± 1.189
5.461GlnLeu: 5.461 ± 1.211
3.277GlnMet: 3.277 ± 1.399
3.55GlnAsn: 3.55 ± 1.058
1.912GlnPro: 1.912 ± 1.102
2.458GlnGln: 2.458 ± 1.243
4.369GlnArg: 4.369 ± 1.53
2.185GlnSer: 2.185 ± 0.851
1.912GlnThr: 1.912 ± 0.686
3.277GlnVal: 3.277 ± 1.318
1.092GlnTrp: 1.092 ± 0.464
1.912GlnTyr: 1.912 ± 0.594
0.0GlnXaa: 0.0 ± 0.0
Arg
5.461ArgAla: 5.461 ± 1.187
0.819ArgCys: 0.819 ± 0.465
3.823ArgAsp: 3.823 ± 1.298
6.281ArgGlu: 6.281 ± 1.076
1.638ArgPhe: 1.638 ± 0.968
3.277ArgGly: 3.277 ± 0.55
0.546ArgHis: 0.546 ± 0.488
3.55ArgIle: 3.55 ± 1.24
5.188ArgLys: 5.188 ± 1.643
3.004ArgLeu: 3.004 ± 1.589
1.365ArgMet: 1.365 ± 0.563
1.912ArgAsn: 1.912 ± 0.562
3.277ArgPro: 3.277 ± 1.192
5.735ArgGln: 5.735 ± 1.839
5.461ArgArg: 5.461 ± 4.422
3.55ArgSer: 3.55 ± 1.893
2.458ArgThr: 2.458 ± 1.134
3.004ArgVal: 3.004 ± 0.924
2.185ArgTrp: 2.185 ± 0.911
0.819ArgTyr: 0.819 ± 0.375
0.0ArgXaa: 0.0 ± 0.0
Ser
2.731SerAla: 2.731 ± 0.699
0.546SerCys: 0.546 ± 0.232
2.185SerAsp: 2.185 ± 0.521
4.096SerGlu: 4.096 ± 1.07
1.638SerPhe: 1.638 ± 0.839
3.277SerGly: 3.277 ± 2.229
0.546SerHis: 0.546 ± 0.561
2.731SerIle: 2.731 ± 0.702
2.731SerLys: 2.731 ± 1.358
6.554SerLeu: 6.554 ± 2.09
1.092SerMet: 1.092 ± 0.592
1.912SerAsn: 1.912 ± 0.746
4.096SerPro: 4.096 ± 1.71
5.735SerGln: 5.735 ± 2.052
2.458SerArg: 2.458 ± 1.053
2.731SerSer: 2.731 ± 0.539
3.277SerThr: 3.277 ± 0.943
2.185SerVal: 2.185 ± 0.569
0.819SerTrp: 0.819 ± 0.438
1.092SerTyr: 1.092 ± 0.871
0.0SerXaa: 0.0 ± 0.0
Thr
3.004ThrAla: 3.004 ± 0.636
0.0ThrCys: 0.0 ± 0.0
2.185ThrAsp: 2.185 ± 0.794
5.461ThrGlu: 5.461 ± 1.168
0.819ThrPhe: 0.819 ± 0.375
3.004ThrGly: 3.004 ± 0.56
1.638ThrHis: 1.638 ± 0.937
3.277ThrIle: 3.277 ± 0.757
3.823ThrLys: 3.823 ± 1.147
6.008ThrLeu: 6.008 ± 1.359
1.092ThrMet: 1.092 ± 0.417
4.642ThrAsn: 4.642 ± 1.9
3.277ThrPro: 3.277 ± 0.654
1.912ThrGln: 1.912 ± 0.638
2.731ThrArg: 2.731 ± 1.469
2.731ThrSer: 2.731 ± 1.12
4.096ThrThr: 4.096 ± 1.519
5.461ThrVal: 5.461 ± 1.205
1.638ThrTrp: 1.638 ± 0.486
1.365ThrTyr: 1.365 ± 0.937
0.0ThrXaa: 0.0 ± 0.0
Val
3.277ValAla: 3.277 ± 0.829
0.0ValCys: 0.0 ± 0.0
3.55ValAsp: 3.55 ± 1.021
3.823ValGlu: 3.823 ± 1.139
1.365ValPhe: 1.365 ± 0.658
5.461ValGly: 5.461 ± 0.557
3.277ValHis: 3.277 ± 1.205
3.277ValIle: 3.277 ± 0.872
5.461ValLys: 5.461 ± 1.505
4.096ValLeu: 4.096 ± 0.552
0.546ValMet: 0.546 ± 0.434
1.912ValAsn: 1.912 ± 0.724
3.277ValPro: 3.277 ± 1.071
3.55ValGln: 3.55 ± 0.99
2.458ValArg: 2.458 ± 0.592
4.369ValSer: 4.369 ± 0.856
3.004ValThr: 3.004 ± 1.006
3.823ValVal: 3.823 ± 1.334
2.458ValTrp: 2.458 ± 0.689
1.092ValTyr: 1.092 ± 0.489
0.0ValXaa: 0.0 ± 0.0
Trp
1.912TrpAla: 1.912 ± 0.487
0.273TrpCys: 0.273 ± 0.416
1.365TrpAsp: 1.365 ± 0.576
2.185TrpGlu: 2.185 ± 0.632
0.546TrpPhe: 0.546 ± 0.434
1.912TrpGly: 1.912 ± 0.693
0.273TrpHis: 0.273 ± 0.387
1.638TrpIle: 1.638 ± 0.574
2.731TrpLys: 2.731 ± 0.898
1.092TrpLeu: 1.092 ± 0.706
1.912TrpMet: 1.912 ± 0.65
1.092TrpAsn: 1.092 ± 0.783
0.819TrpPro: 0.819 ± 0.387
1.912TrpGln: 1.912 ± 0.772
2.731TrpArg: 2.731 ± 0.608
1.365TrpSer: 1.365 ± 1.231
1.365TrpThr: 1.365 ± 0.69
1.092TrpVal: 1.092 ± 0.301
0.819TrpTrp: 0.819 ± 0.338
0.546TrpTyr: 0.546 ± 0.232
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.638TyrAla: 1.638 ± 0.785
1.638TyrCys: 1.638 ± 0.813
0.819TyrAsp: 0.819 ± 0.338
0.819TyrGlu: 0.819 ± 0.45
1.092TyrPhe: 1.092 ± 0.546
1.365TyrGly: 1.365 ± 0.882
0.546TyrHis: 0.546 ± 0.406
1.365TyrIle: 1.365 ± 0.529
3.004TyrLys: 3.004 ± 1.193
1.092TyrLeu: 1.092 ± 0.73
0.273TyrMet: 0.273 ± 0.18
1.638TyrAsn: 1.638 ± 0.707
1.092TyrPro: 1.092 ± 0.715
2.185TyrGln: 2.185 ± 0.885
1.912TyrArg: 1.912 ± 0.96
1.365TyrSer: 1.365 ± 0.396
1.365TyrThr: 1.365 ± 0.664
1.365TyrVal: 1.365 ± 0.669
1.092TyrTrp: 1.092 ± 0.52
1.092TyrTyr: 1.092 ± 0.4
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3663 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski