Amino acid dipepetide frequency for Human immunodeficiency virus type 1 group M subtype B (isolate BH10) (HIV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.487AlaAla: 5.487 ± 1.944
2.195AlaCys: 2.195 ± 0.839
1.646AlaAsp: 1.646 ± 0.534
4.938AlaGlu: 4.938 ± 1.275
1.92AlaPhe: 1.92 ± 0.347
4.664AlaGly: 4.664 ± 1.191
0.823AlaHis: 0.823 ± 0.33
4.39AlaIle: 4.39 ± 1.262
2.195AlaLys: 2.195 ± 0.863
5.761AlaLeu: 5.761 ± 1.358
1.92AlaMet: 1.92 ± 0.607
2.743AlaAsn: 2.743 ± 0.86
3.018AlaPro: 3.018 ± 1.08
1.646AlaGln: 1.646 ± 0.404
3.841AlaArg: 3.841 ± 0.734
4.39AlaSer: 4.39 ± 0.886
4.115AlaThr: 4.115 ± 1.015
4.664AlaVal: 4.664 ± 0.938
1.097AlaTrp: 1.097 ± 0.48
1.097AlaTyr: 1.097 ± 0.438
0.0AlaXaa: 0.0 ± 0.0
Cys
0.823CysAla: 0.823 ± 0.517
0.549CysCys: 0.549 ± 0.911
0.274CysAsp: 0.274 ± 0.186
0.274CysGlu: 0.274 ± 0.366
1.92CysPhe: 1.92 ± 1.899
1.92CysGly: 1.92 ± 0.563
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.372CysLys: 1.372 ± 0.811
0.274CysLeu: 0.274 ± 0.237
0.274CysMet: 0.274 ± 0.341
1.372CysAsn: 1.372 ± 1.039
0.274CysPro: 0.274 ± 0.237
1.372CysGln: 1.372 ± 0.811
1.372CysArg: 1.372 ± 0.468
1.372CysSer: 1.372 ± 0.874
3.292CysThr: 3.292 ± 0.892
1.646CysVal: 1.646 ± 0.535
0.823CysTrp: 0.823 ± 0.371
0.823CysTyr: 0.823 ± 1.064
0.0CysXaa: 0.0 ± 0.0
Asp
0.823AspAla: 0.823 ± 0.387
2.743AspCys: 2.743 ± 0.916
1.646AspAsp: 1.646 ± 0.597
1.097AspGlu: 1.097 ± 0.584
1.097AspPhe: 1.097 ± 0.745
1.372AspGly: 1.372 ± 0.457
0.0AspHis: 0.0 ± 0.0
3.292AspIle: 3.292 ± 0.86
3.018AspLys: 3.018 ± 0.994
3.841AspLeu: 3.841 ± 1.07
0.823AspMet: 0.823 ± 0.428
1.646AspAsn: 1.646 ± 0.657
2.469AspPro: 2.469 ± 1.666
1.646AspGln: 1.646 ± 0.716
4.39AspArg: 4.39 ± 1.05
2.743AspSer: 2.743 ± 0.76
3.018AspThr: 3.018 ± 0.675
0.823AspVal: 0.823 ± 0.547
0.549AspTrp: 0.549 ± 0.516
0.823AspTyr: 0.823 ± 0.371
0.0AspXaa: 0.0 ± 0.0
Glu
5.213GluAla: 5.213 ± 1.2
0.0GluCys: 0.0 ± 0.0
2.469GluAsp: 2.469 ± 1.009
7.133GluGlu: 7.133 ± 1.524
1.097GluPhe: 1.097 ± 0.489
4.938GluGly: 4.938 ± 0.875
0.549GluHis: 0.549 ± 0.372
4.39GluIle: 4.39 ± 1.013
4.938GluLys: 4.938 ± 0.982
7.133GluLeu: 7.133 ± 1.304
2.195GluMet: 2.195 ± 1.251
1.372GluAsn: 1.372 ± 0.409
6.036GluPro: 6.036 ± 2.055
4.115GluGln: 4.115 ± 0.807
4.115GluArg: 4.115 ± 1.457
2.743GluSer: 2.743 ± 0.754
4.39GluThr: 4.39 ± 1.841
4.115GluVal: 4.115 ± 0.666
1.92GluTrp: 1.92 ± 0.642
1.372GluTyr: 1.372 ± 0.585
0.0GluXaa: 0.0 ± 0.0
Phe
1.372PheAla: 1.372 ± 0.353
0.274PheCys: 0.274 ± 0.237
0.549PheAsp: 0.549 ± 0.516
0.274PheGlu: 0.274 ± 0.237
0.549PhePhe: 0.549 ± 0.474
1.097PheGly: 1.097 ± 0.378
0.823PheHis: 0.823 ± 1.064
1.646PheIle: 1.646 ± 1.1
1.646PheLys: 1.646 ± 0.607
3.292PheLeu: 3.292 ± 0.773
0.0PheMet: 0.0 ± 0.0
3.018PheAsn: 3.018 ± 1.374
1.372PhePro: 1.372 ± 0.913
0.823PheGln: 0.823 ± 0.564
2.469PheArg: 2.469 ± 0.857
2.195PheSer: 2.195 ± 0.538
0.823PheThr: 0.823 ± 0.33
0.549PheVal: 0.549 ± 0.219
0.274PheTrp: 0.274 ± 0.186
1.646PheTyr: 1.646 ± 0.416
0.0PheXaa: 0.0 ± 0.0
Gly
4.664GlyAla: 4.664 ± 0.885
1.646GlyCys: 1.646 ± 0.509
2.195GlyAsp: 2.195 ± 1.119
3.567GlyGlu: 3.567 ± 0.406
1.097GlyPhe: 1.097 ± 0.419
6.31GlyGly: 6.31 ± 1.113
4.39GlyHis: 4.39 ± 1.627
5.761GlyIle: 5.761 ± 1.422
5.761GlyLys: 5.761 ± 1.156
4.115GlyLeu: 4.115 ± 0.589
0.823GlyMet: 0.823 ± 0.314
2.469GlyAsn: 2.469 ± 0.811
4.938GlyPro: 4.938 ± 0.876
4.39GlyGln: 4.39 ± 1.385
3.567GlyArg: 3.567 ± 0.813
5.213GlySer: 5.213 ± 1.734
3.018GlyThr: 3.018 ± 1.162
3.292GlyVal: 3.292 ± 0.79
2.195GlyTrp: 2.195 ± 0.653
1.646GlyTyr: 1.646 ± 0.607
0.0GlyXaa: 0.0 ± 0.0
His
1.097HisAla: 1.097 ± 0.448
0.823HisCys: 0.823 ± 0.916
0.0HisAsp: 0.0 ± 0.0
0.549HisGlu: 0.549 ± 0.219
0.823HisPhe: 0.823 ± 1.254
1.646HisGly: 1.646 ± 0.816
1.097HisHis: 1.097 ± 0.912
1.646HisIle: 1.646 ± 0.92
1.097HisLys: 1.097 ± 0.515
2.195HisLeu: 2.195 ± 0.88
0.549HisMet: 0.549 ± 0.709
1.097HisAsn: 1.097 ± 0.694
2.469HisPro: 2.469 ± 1.227
3.018HisGln: 3.018 ± 1.212
0.823HisArg: 0.823 ± 0.387
1.92HisSer: 1.92 ± 0.542
1.92HisThr: 1.92 ± 0.72
0.549HisVal: 0.549 ± 0.367
0.0HisTrp: 0.0 ± 0.0
0.549HisTyr: 0.549 ± 0.452
0.0HisXaa: 0.0 ± 0.0
Ile
2.469IleAla: 2.469 ± 0.964
1.097IleCys: 1.097 ± 0.438
1.646IleAsp: 1.646 ± 0.872
4.664IleGlu: 4.664 ± 0.916
0.823IlePhe: 0.823 ± 0.417
4.664IleGly: 4.664 ± 1.714
2.195IleHis: 2.195 ± 0.802
4.664IleIle: 4.664 ± 1.405
4.39IleLys: 4.39 ± 1.114
5.761IleLeu: 5.761 ± 1.118
1.097IleMet: 1.097 ± 0.389
1.646IleAsn: 1.646 ± 0.535
4.115IlePro: 4.115 ± 0.944
3.018IleGln: 3.018 ± 1.269
5.213IleArg: 5.213 ± 1.351
3.841IleSer: 3.841 ± 0.996
3.292IleThr: 3.292 ± 1.271
6.859IleVal: 6.859 ± 1.303
1.92IleTrp: 1.92 ± 0.634
2.195IleTyr: 2.195 ± 0.907
0.0IleXaa: 0.0 ± 0.0
Lys
6.859LysAla: 6.859 ± 0.955
2.469LysCys: 2.469 ± 0.713
2.195LysAsp: 2.195 ± 0.74
7.407LysGlu: 7.407 ± 1.734
0.549LysPhe: 0.549 ± 0.389
4.115LysGly: 4.115 ± 0.919
1.92LysHis: 1.92 ± 1.068
6.31LysIle: 6.31 ± 1.957
6.859LysLys: 6.859 ± 2.16
5.761LysLeu: 5.761 ± 1.516
0.823LysMet: 0.823 ± 0.371
2.743LysAsn: 2.743 ± 0.919
1.372LysPro: 1.372 ± 0.699
3.841LysGln: 3.841 ± 0.739
2.469LysArg: 2.469 ± 0.803
2.195LysSer: 2.195 ± 0.339
4.115LysThr: 4.115 ± 0.655
4.115LysVal: 4.115 ± 0.882
1.372LysTrp: 1.372 ± 0.391
2.195LysTyr: 2.195 ± 0.595
0.0LysXaa: 0.0 ± 0.0
Leu
3.841LeuAla: 3.841 ± 0.947
0.823LeuCys: 0.823 ± 0.417
4.115LeuAsp: 4.115 ± 1.016
7.133LeuGlu: 7.133 ± 1.93
2.195LeuPhe: 2.195 ± 1.088
6.31LeuGly: 6.31 ± 1.48
1.646LeuHis: 1.646 ± 1.285
3.841LeuIle: 3.841 ± 1.237
7.133LeuLys: 7.133 ± 1.213
8.505LeuLeu: 8.505 ± 2.87
0.823LeuMet: 0.823 ± 0.561
3.841LeuAsn: 3.841 ± 1.034
2.469LeuPro: 2.469 ± 0.723
5.213LeuGln: 5.213 ± 0.999
4.938LeuArg: 4.938 ± 0.859
3.018LeuSer: 3.018 ± 1.049
4.39LeuThr: 4.39 ± 0.808
5.487LeuVal: 5.487 ± 1.157
3.018LeuTrp: 3.018 ± 0.944
2.469LeuTyr: 2.469 ± 0.728
0.0LeuXaa: 0.0 ± 0.0
Met
1.097MetAla: 1.097 ± 0.584
0.0MetCys: 0.0 ± 0.0
0.823MetAsp: 0.823 ± 0.445
1.92MetGlu: 1.92 ± 1.336
0.549MetPhe: 0.549 ± 0.281
1.92MetGly: 1.92 ± 0.932
0.549MetHis: 0.549 ± 0.219
1.372MetIle: 1.372 ± 0.736
0.549MetLys: 0.549 ± 0.281
1.646MetLeu: 1.646 ± 0.486
1.097MetMet: 1.097 ± 0.562
0.549MetAsn: 0.549 ± 0.35
0.0MetPro: 0.0 ± 0.0
1.372MetGln: 1.372 ± 0.691
2.195MetArg: 2.195 ± 0.535
0.823MetSer: 0.823 ± 0.371
2.743MetThr: 2.743 ± 0.692
1.372MetVal: 1.372 ± 0.468
0.549MetTrp: 0.549 ± 0.474
1.097MetTyr: 1.097 ± 0.374
0.0MetXaa: 0.0 ± 0.0
Asn
2.195AsnAla: 2.195 ± 0.586
2.743AsnCys: 2.743 ± 0.946
1.646AsnAsp: 1.646 ± 0.535
2.743AsnGlu: 2.743 ± 1.176
3.292AsnPhe: 3.292 ± 0.989
1.372AsnGly: 1.372 ± 0.736
0.0AsnHis: 0.0 ± 0.0
2.195AsnIle: 2.195 ± 0.72
3.567AsnLys: 3.567 ± 0.591
1.372AsnLeu: 1.372 ± 0.528
1.097AsnMet: 1.097 ± 0.949
4.115AsnAsn: 4.115 ± 1.957
3.567AsnPro: 3.567 ± 1.149
1.646AsnGln: 1.646 ± 0.408
1.372AsnArg: 1.372 ± 0.468
2.743AsnSer: 2.743 ± 1.245
4.938AsnThr: 4.938 ± 0.934
1.097AsnVal: 1.097 ± 0.642
2.195AsnTrp: 2.195 ± 0.6
1.097AsnTyr: 1.097 ± 0.378
0.0AsnXaa: 0.0 ± 0.0
Pro
2.743ProAla: 2.743 ± 0.936
0.823ProCys: 0.823 ± 0.712
2.469ProAsp: 2.469 ± 0.815
4.115ProGlu: 4.115 ± 1.15
1.646ProPhe: 1.646 ± 0.742
5.487ProGly: 5.487 ± 1.328
0.823ProHis: 0.823 ± 0.702
4.938ProIle: 4.938 ± 1.022
2.743ProLys: 2.743 ± 1.338
4.664ProLeu: 4.664 ± 0.978
0.823ProMet: 0.823 ± 0.461
0.823ProAsn: 0.823 ± 0.628
4.115ProPro: 4.115 ± 1.531
3.567ProGln: 3.567 ± 1.222
3.841ProArg: 3.841 ± 1.302
2.195ProSer: 2.195 ± 0.838
3.292ProThr: 3.292 ± 1.119
4.938ProVal: 4.938 ± 1.226
1.097ProTrp: 1.097 ± 1.153
0.549ProTyr: 0.549 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
5.761GlnAla: 5.761 ± 0.96
0.274GlnCys: 0.274 ± 0.237
2.195GlnAsp: 2.195 ± 0.999
3.841GlnGlu: 3.841 ± 0.787
0.274GlnPhe: 0.274 ± 0.237
5.761GlnGly: 5.761 ± 0.81
1.372GlnHis: 1.372 ± 0.618
4.938GlnIle: 4.938 ± 1.109
3.018GlnLys: 3.018 ± 1.176
5.487GlnLeu: 5.487 ± 1.284
3.292GlnMet: 3.292 ± 1.299
3.841GlnAsn: 3.841 ± 0.931
2.195GlnPro: 2.195 ± 1.581
2.195GlnGln: 2.195 ± 1.088
4.115GlnArg: 4.115 ± 1.353
2.469GlnSer: 2.469 ± 0.776
2.195GlnThr: 2.195 ± 0.677
4.39GlnVal: 4.39 ± 2.073
0.549GlnTrp: 0.549 ± 0.372
1.646GlnTyr: 1.646 ± 0.634
0.0GlnXaa: 0.0 ± 0.0
Arg
4.39ArgAla: 4.39 ± 0.723
0.549ArgCys: 0.549 ± 0.452
3.567ArgAsp: 3.567 ± 0.752
4.938ArgGlu: 4.938 ± 1.08
1.372ArgPhe: 1.372 ± 0.665
3.567ArgGly: 3.567 ± 0.907
0.823ArgHis: 0.823 ± 0.76
4.115ArgIle: 4.115 ± 1.884
4.39ArgLys: 4.39 ± 1.402
3.292ArgLeu: 3.292 ± 1.812
1.646ArgMet: 1.646 ± 0.381
1.92ArgAsn: 1.92 ± 0.748
3.567ArgPro: 3.567 ± 1.328
6.31ArgGln: 6.31 ± 1.162
4.664ArgArg: 4.664 ± 3.256
2.743ArgSer: 2.743 ± 1.302
1.372ArgThr: 1.372 ± 0.49
2.469ArgVal: 2.469 ± 0.719
3.292ArgTrp: 3.292 ± 0.641
1.097ArgTyr: 1.097 ± 0.439
0.0ArgXaa: 0.0 ± 0.0
Ser
3.018SerAla: 3.018 ± 0.518
0.549SerCys: 0.549 ± 0.219
2.469SerAsp: 2.469 ± 0.468
4.39SerGlu: 4.39 ± 0.865
1.646SerPhe: 1.646 ± 0.821
4.115SerGly: 4.115 ± 1.37
0.549SerHis: 0.549 ± 0.516
2.743SerIle: 2.743 ± 0.558
2.195SerLys: 2.195 ± 0.795
6.584SerLeu: 6.584 ± 2.371
0.823SerMet: 0.823 ± 0.33
2.195SerAsn: 2.195 ± 0.855
4.664SerPro: 4.664 ± 1.189
5.487SerGln: 5.487 ± 2.164
2.743SerArg: 2.743 ± 1.086
3.567SerSer: 3.567 ± 0.65
3.841SerThr: 3.841 ± 1.739
2.469SerVal: 2.469 ± 0.446
0.549SerTrp: 0.549 ± 0.219
1.372SerTyr: 1.372 ± 0.999
0.0SerXaa: 0.0 ± 0.0
Thr
4.39ThrAla: 4.39 ± 1.21
0.0ThrCys: 0.0 ± 0.0
1.92ThrAsp: 1.92 ± 0.835
4.664ThrGlu: 4.664 ± 1.234
0.823ThrPhe: 0.823 ± 0.314
3.018ThrGly: 3.018 ± 0.533
1.92ThrHis: 1.92 ± 1.112
3.841ThrIle: 3.841 ± 0.923
4.115ThrLys: 4.115 ± 1.051
5.761ThrLeu: 5.761 ± 1.226
1.646ThrMet: 1.646 ± 0.699
3.841ThrAsn: 3.841 ± 0.596
3.567ThrPro: 3.567 ± 0.954
2.195ThrGln: 2.195 ± 0.721
2.195ThrArg: 2.195 ± 0.863
4.664ThrSer: 4.664 ± 1.179
3.292ThrThr: 3.292 ± 0.757
4.664ThrVal: 4.664 ± 1.266
2.195ThrTrp: 2.195 ± 0.799
1.372ThrTyr: 1.372 ± 0.893
0.0ThrXaa: 0.0 ± 0.0
Val
3.292ValAla: 3.292 ± 1.015
0.549ValCys: 0.549 ± 0.911
3.567ValAsp: 3.567 ± 1.322
4.115ValGlu: 4.115 ± 1.246
0.823ValPhe: 0.823 ± 0.33
5.213ValGly: 5.213 ± 0.73
3.292ValHis: 3.292 ± 1.023
3.841ValIle: 3.841 ± 0.689
4.39ValLys: 4.39 ± 1.066
3.567ValLeu: 3.567 ± 0.842
0.274ValMet: 0.274 ± 0.366
2.743ValAsn: 2.743 ± 0.836
3.018ValPro: 3.018 ± 0.842
4.115ValGln: 4.115 ± 1.245
2.469ValArg: 2.469 ± 0.714
4.39ValSer: 4.39 ± 1.764
3.567ValThr: 3.567 ± 0.752
4.39ValVal: 4.39 ± 1.323
2.195ValTrp: 2.195 ± 0.585
1.097ValTyr: 1.097 ± 0.489
0.0ValXaa: 0.0 ± 0.0
Trp
1.92TrpAla: 1.92 ± 0.529
0.274TrpCys: 0.274 ± 0.375
1.646TrpAsp: 1.646 ± 0.643
1.646TrpGlu: 1.646 ± 0.561
0.823TrpPhe: 0.823 ± 0.622
2.195TrpGly: 2.195 ± 0.865
0.274TrpHis: 0.274 ± 0.366
1.097TrpIle: 1.097 ± 0.48
3.018TrpLys: 3.018 ± 0.72
0.823TrpLeu: 0.823 ± 0.687
1.646TrpMet: 1.646 ± 0.509
1.646TrpAsn: 1.646 ± 1.269
1.097TrpPro: 1.097 ± 0.48
1.92TrpGln: 1.92 ± 0.693
2.195TrpArg: 2.195 ± 0.721
1.372TrpSer: 1.372 ± 0.793
1.372TrpThr: 1.372 ± 0.86
1.372TrpVal: 1.372 ± 0.518
0.823TrpTrp: 0.823 ± 0.33
0.549TrpTyr: 0.549 ± 0.219
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.097TyrAla: 1.097 ± 0.438
1.646TyrCys: 1.646 ± 0.859
0.823TyrAsp: 0.823 ± 0.33
0.823TyrGlu: 0.823 ± 0.702
1.372TyrPhe: 1.372 ± 0.752
1.372TyrGly: 1.372 ± 1.163
0.823TyrHis: 0.823 ± 0.314
0.549TyrIle: 0.549 ± 0.219
3.292TyrLys: 3.292 ± 1.101
1.372TyrLeu: 1.372 ± 0.468
0.274TyrMet: 0.274 ± 0.186
1.372TyrAsn: 1.372 ± 0.681
1.372TyrPro: 1.372 ± 0.665
2.195TyrGln: 2.195 ± 0.741
1.097TyrArg: 1.097 ± 0.499
1.372TyrSer: 1.372 ± 0.331
1.097TyrThr: 1.097 ± 0.419
1.646TyrVal: 1.646 ± 0.626
1.097TyrTrp: 1.097 ± 0.538
1.097TyrTyr: 1.097 ± 0.406
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3646 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski