Amino acid dipepetide frequency for HIV-1 M_02CD.LBTB032

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.883AlaAla: 4.883 ± 0.935
1.395AlaCys: 1.395 ± 0.491
1.744AlaAsp: 1.744 ± 1.189
5.581AlaGlu: 5.581 ± 1.571
1.744AlaPhe: 1.744 ± 0.534
6.278AlaGly: 6.278 ± 1.204
1.744AlaHis: 1.744 ± 0.534
5.581AlaIle: 5.581 ± 1.005
3.139AlaLys: 3.139 ± 0.66
5.232AlaLeu: 5.232 ± 1.262
1.046AlaMet: 1.046 ± 0.675
2.093AlaAsn: 2.093 ± 1.068
2.093AlaPro: 2.093 ± 1.236
2.442AlaGln: 2.442 ± 0.578
3.139AlaArg: 3.139 ± 0.497
4.186AlaSer: 4.186 ± 1.016
2.79AlaThr: 2.79 ± 0.897
3.488AlaVal: 3.488 ± 0.808
1.744AlaTrp: 1.744 ± 0.673
1.046AlaTyr: 1.046 ± 0.347
0.0AlaXaa: 0.0 ± 0.0
Cys
1.046CysAla: 1.046 ± 0.808
0.0CysCys: 0.0 ± 0.0
0.349CysAsp: 0.349 ± 0.225
0.349CysGlu: 0.349 ± 0.591
0.698CysPhe: 0.698 ± 0.541
1.046CysGly: 1.046 ± 0.675
0.0CysHis: 0.0 ± 0.0
0.349CysIle: 0.349 ± 0.318
1.046CysLys: 1.046 ± 0.347
1.046CysLeu: 1.046 ± 0.808
0.0CysMet: 0.0 ± 0.574
1.395CysAsn: 1.395 ± 1.144
0.349CysPro: 0.349 ± 0.318
1.046CysGln: 1.046 ± 0.675
1.395CysArg: 1.395 ± 0.83
1.395CysSer: 1.395 ± 0.629
2.79CysThr: 2.79 ± 0.983
2.093CysVal: 2.093 ± 1.045
0.698CysTrp: 0.698 ± 0.45
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.093AspAla: 2.093 ± 1.045
3.139AspCys: 3.139 ± 1.18
1.395AspAsp: 1.395 ± 0.9
0.698AspGlu: 0.698 ± 0.627
1.046AspPhe: 1.046 ± 0.675
2.79AspGly: 2.79 ± 1.107
0.0AspHis: 0.0 ± 0.0
4.883AspIle: 4.883 ± 1.585
4.534AspLys: 4.534 ± 1.015
3.837AspLeu: 3.837 ± 1.599
0.698AspMet: 0.698 ± 0.269
2.442AspAsn: 2.442 ± 0.992
2.442AspPro: 2.442 ± 0.503
2.093AspGln: 2.093 ± 0.801
2.79AspArg: 2.79 ± 0.968
3.139AspSer: 3.139 ± 0.981
2.442AspThr: 2.442 ± 1.25
1.395AspVal: 1.395 ± 0.751
1.046AspTrp: 1.046 ± 1.011
1.046AspTyr: 1.046 ± 0.621
0.0AspXaa: 0.0 ± 0.0
Glu
5.232GluAla: 5.232 ± 1.747
0.0GluCys: 0.0 ± 0.0
2.442GluAsp: 2.442 ± 1.012
6.278GluGlu: 6.278 ± 1.61
1.395GluPhe: 1.395 ± 0.531
5.232GluGly: 5.232 ± 1.601
1.046GluHis: 1.046 ± 0.654
3.837GluIle: 3.837 ± 0.89
4.883GluLys: 4.883 ± 0.906
6.627GluLeu: 6.627 ± 1.698
1.046GluMet: 1.046 ± 0.347
2.79GluAsn: 2.79 ± 0.566
3.837GluPro: 3.837 ± 1.03
3.488GluGln: 3.488 ± 1.212
3.837GluArg: 3.837 ± 1.017
2.093GluSer: 2.093 ± 1.16
5.232GluThr: 5.232 ± 1.435
3.837GluVal: 3.837 ± 0.989
1.744GluTrp: 1.744 ± 0.79
1.395GluTyr: 1.395 ± 0.786
0.0GluXaa: 0.0 ± 0.0
Phe
1.744PheAla: 1.744 ± 0.752
0.349PheCys: 0.349 ± 0.318
1.395PheAsp: 1.395 ± 1.515
0.349PheGlu: 0.349 ± 0.318
2.093PhePhe: 2.093 ± 1.09
1.395PheGly: 1.395 ± 1.205
0.0PheHis: 0.0 ± 0.0
1.395PheIle: 1.395 ± 0.531
1.395PheLys: 1.395 ± 0.531
1.744PheLeu: 1.744 ± 0.505
0.0PheMet: 0.0 ± 0.0
2.79PheAsn: 2.79 ± 1.366
1.744PhePro: 1.744 ± 1.515
1.046PheGln: 1.046 ± 0.347
3.488PheArg: 3.488 ± 1.429
2.093PheSer: 2.093 ± 0.76
1.744PheThr: 1.744 ± 0.738
1.046PheVal: 1.046 ± 1.094
0.349PheTrp: 0.349 ± 0.225
1.395PheTyr: 1.395 ± 1.041
0.0PheXaa: 0.0 ± 0.0
Gly
5.93GlyAla: 5.93 ± 1.41
1.744GlyCys: 1.744 ± 0.61
2.442GlyAsp: 2.442 ± 1.06
3.488GlyGlu: 3.488 ± 0.361
2.79GlyPhe: 2.79 ± 1.745
6.976GlyGly: 6.976 ± 1.42
3.488GlyHis: 3.488 ± 2.283
6.976GlyIle: 6.976 ± 1.214
6.627GlyLys: 6.627 ± 2.509
5.581GlyLeu: 5.581 ± 1.9
0.698GlyMet: 0.698 ± 0.627
3.139GlyAsn: 3.139 ± 1.159
4.534GlyPro: 4.534 ± 1.241
4.186GlyGln: 4.186 ± 1.455
3.488GlyArg: 3.488 ± 1.255
3.488GlySer: 3.488 ± 0.78
1.744GlyThr: 1.744 ± 0.768
4.186GlyVal: 4.186 ± 1.213
1.744GlyTrp: 1.744 ± 1.383
1.744GlyTyr: 1.744 ± 0.738
0.0GlyXaa: 0.0 ± 0.0
His
0.698HisAla: 0.698 ± 0.246
0.349HisCys: 0.349 ± 0.318
0.0HisAsp: 0.0 ± 0.0
0.698HisGlu: 0.698 ± 0.669
0.698HisPhe: 0.698 ± 1.454
2.093HisGly: 2.093 ± 1.054
1.395HisHis: 1.395 ± 2.027
1.046HisIle: 1.046 ± 1.041
1.046HisLys: 1.046 ± 0.654
3.139HisLeu: 3.139 ± 1.166
1.046HisMet: 1.046 ± 1.085
0.698HisAsn: 0.698 ± 0.246
2.442HisPro: 2.442 ± 1.658
1.744HisGln: 1.744 ± 1.125
1.395HisArg: 1.395 ± 1.06
2.442HisSer: 2.442 ± 1.916
1.395HisThr: 1.395 ± 0.49
0.698HisVal: 0.698 ± 0.627
0.0HisTrp: 0.0 ± 0.0
1.046HisTyr: 1.046 ± 0.798
0.0HisXaa: 0.0 ± 0.0
Ile
3.488IleAla: 3.488 ± 1.504
1.046IleCys: 1.046 ± 0.347
1.395IleAsp: 1.395 ± 1.164
4.186IleGlu: 4.186 ± 0.934
1.395IlePhe: 1.395 ± 0.491
5.232IleGly: 5.232 ± 1.565
2.442IleHis: 2.442 ± 0.908
8.72IleIle: 8.72 ± 3.399
4.534IleLys: 4.534 ± 1.557
5.581IleLeu: 5.581 ± 1.031
1.046IleMet: 1.046 ± 0.522
3.488IleAsn: 3.488 ± 1.448
4.534IlePro: 4.534 ± 1.304
4.186IleGln: 4.186 ± 1.876
6.278IleArg: 6.278 ± 1.616
1.744IleSer: 1.744 ± 0.558
3.837IleThr: 3.837 ± 1.693
8.022IleVal: 8.022 ± 1.465
1.395IleTrp: 1.395 ± 0.491
3.139IleTyr: 3.139 ± 0.656
0.0IleXaa: 0.0 ± 0.0
Lys
5.232LysAla: 5.232 ± 1.531
1.395LysCys: 1.395 ± 0.531
3.837LysAsp: 3.837 ± 1.175
5.232LysGlu: 5.232 ± 2.084
0.698LysPhe: 0.698 ± 0.627
4.534LysGly: 4.534 ± 0.852
1.744LysHis: 1.744 ± 0.79
6.627LysIle: 6.627 ± 1.949
5.581LysLys: 5.581 ± 2.308
6.278LysLeu: 6.278 ± 1.155
0.349LysMet: 0.349 ± 0.225
3.139LysAsn: 3.139 ± 1.972
2.093LysPro: 2.093 ± 0.76
3.837LysGln: 3.837 ± 1.398
3.139LysArg: 3.139 ± 0.621
2.442LysSer: 2.442 ± 0.578
3.837LysThr: 3.837 ± 1.47
4.883LysVal: 4.883 ± 1.624
2.79LysTrp: 2.79 ± 0.98
2.093LysTyr: 2.093 ± 0.467
0.0LysXaa: 0.0 ± 0.0
Leu
4.534LeuAla: 4.534 ± 0.695
1.046LeuCys: 1.046 ± 0.522
4.883LeuAsp: 4.883 ± 1.525
6.278LeuGlu: 6.278 ± 1.132
2.442LeuPhe: 2.442 ± 1.518
5.232LeuGly: 5.232 ± 1.241
2.79LeuHis: 2.79 ± 1.236
3.139LeuIle: 3.139 ± 1.221
8.022LeuLys: 8.022 ± 0.498
6.976LeuLeu: 6.976 ± 2.341
0.349LeuMet: 0.349 ± 0.287
3.837LeuAsn: 3.837 ± 1.398
2.79LeuPro: 2.79 ± 0.668
3.837LeuGln: 3.837 ± 1.56
4.883LeuArg: 4.883 ± 0.733
3.488LeuSer: 3.488 ± 0.794
5.232LeuThr: 5.232 ± 0.656
6.278LeuVal: 6.278 ± 0.889
3.488LeuTrp: 3.488 ± 0.987
1.395LeuTyr: 1.395 ± 0.531
0.0LeuXaa: 0.0 ± 0.0
Met
1.744MetAla: 1.744 ± 0.558
0.0MetCys: 0.0 ± 0.0
0.698MetAsp: 0.698 ± 0.45
1.744MetGlu: 1.744 ± 1.185
0.698MetPhe: 0.698 ± 0.246
1.395MetGly: 1.395 ± 0.49
0.698MetHis: 0.698 ± 0.637
1.046MetIle: 1.046 ± 0.493
1.744MetLys: 1.744 ± 0.528
1.744MetLeu: 1.744 ± 0.558
0.698MetMet: 0.698 ± 0.45
0.698MetAsn: 0.698 ± 0.541
0.0MetPro: 0.0 ± 0.0
0.698MetGln: 0.698 ± 0.45
1.046MetArg: 1.046 ± 0.94
0.698MetSer: 0.698 ± 0.627
2.442MetThr: 2.442 ± 0.512
0.698MetVal: 0.698 ± 0.246
0.698MetTrp: 0.698 ± 0.637
0.698MetTyr: 0.698 ± 0.541
0.0MetXaa: 0.0 ± 0.0
Asn
1.744AsnAla: 1.744 ± 0.534
2.79AsnCys: 2.79 ± 1.232
1.744AsnAsp: 1.744 ± 1.125
3.139AsnGlu: 3.139 ± 0.872
3.139AsnPhe: 3.139 ± 1.042
2.442AsnGly: 2.442 ± 1.439
0.698AsnHis: 0.698 ± 1.042
3.488AsnIle: 3.488 ± 2.286
2.093AsnLys: 2.093 ± 0.695
2.79AsnLeu: 2.79 ± 0.983
2.442AsnMet: 2.442 ± 1.775
5.232AsnAsn: 5.232 ± 2.779
2.442AsnPro: 2.442 ± 1.576
1.744AsnGln: 1.744 ± 0.752
1.395AsnArg: 1.395 ± 0.441
2.093AsnSer: 2.093 ± 1.045
5.232AsnThr: 5.232 ± 1.207
2.442AsnVal: 2.442 ± 0.992
1.744AsnTrp: 1.744 ± 0.558
1.046AsnTyr: 1.046 ± 0.545
0.349AsnXaa: 0.349 ± 0.627
Pro
2.79ProAla: 2.79 ± 1.013
1.046ProCys: 1.046 ± 0.955
2.79ProAsp: 2.79 ± 1.171
3.139ProGlu: 3.139 ± 0.976
1.744ProPhe: 1.744 ± 0.902
5.581ProGly: 5.581 ± 0.782
0.349ProHis: 0.349 ± 0.225
5.581ProIle: 5.581 ± 1.625
2.093ProLys: 2.093 ± 0.635
4.883ProLeu: 4.883 ± 0.97
1.046ProMet: 1.046 ± 1.286
0.698ProAsn: 0.698 ± 0.637
2.79ProPro: 2.79 ± 0.859
3.139ProGln: 3.139 ± 0.743
2.79ProArg: 2.79 ± 0.907
2.093ProSer: 2.093 ± 0.639
1.395ProThr: 1.395 ± 0.645
3.837ProVal: 3.837 ± 1.175
1.046ProTrp: 1.046 ± 0.643
1.046ProTyr: 1.046 ± 0.654
0.349ProXaa: 0.349 ± 0.225
Gln
5.581GlnAla: 5.581 ± 1.35
0.698GlnCys: 0.698 ± 0.637
3.139GlnAsp: 3.139 ± 1.027
3.488GlnGlu: 3.488 ± 0.914
0.349GlnPhe: 0.349 ± 0.318
5.581GlnGly: 5.581 ± 0.354
1.395GlnHis: 1.395 ± 0.938
4.186GlnIle: 4.186 ± 1.907
3.488GlnLys: 3.488 ± 1.866
5.93GlnLeu: 5.93 ± 1.091
2.093GlnMet: 2.093 ± 0.954
3.488GlnAsn: 3.488 ± 1.424
1.744GlnPro: 1.744 ± 0.498
3.488GlnGln: 3.488 ± 1.138
1.744GlnArg: 1.744 ± 0.956
1.395GlnSer: 1.395 ± 0.531
1.395GlnThr: 1.395 ± 0.83
3.139GlnVal: 3.139 ± 1.756
1.395GlnTrp: 1.395 ± 0.491
2.093GlnTyr: 2.093 ± 1.16
0.0GlnXaa: 0.0 ± 0.0
Arg
4.534ArgAla: 4.534 ± 0.765
0.349ArgCys: 0.349 ± 0.591
3.139ArgAsp: 3.139 ± 0.891
5.581ArgGlu: 5.581 ± 2.566
0.698ArgPhe: 0.698 ± 0.45
4.883ArgGly: 4.883 ± 1.759
1.046ArgHis: 1.046 ± 1.53
4.883ArgIle: 4.883 ± 3.328
3.837ArgLys: 3.837 ± 1.119
3.139ArgLeu: 3.139 ± 0.804
1.046ArgMet: 1.046 ± 0.88
1.395ArgAsn: 1.395 ± 0.531
2.79ArgPro: 2.79 ± 1.376
3.139ArgGln: 3.139 ± 0.73
3.139ArgArg: 3.139 ± 2.851
3.488ArgSer: 3.488 ± 1.211
2.442ArgThr: 2.442 ± 0.908
2.442ArgVal: 2.442 ± 0.989
1.395ArgTrp: 1.395 ± 1.081
1.395ArgTyr: 1.395 ± 0.49
0.0ArgXaa: 0.0 ± 0.0
Ser
1.744SerAla: 1.744 ± 0.536
0.698SerCys: 0.698 ± 0.246
2.442SerAsp: 2.442 ± 0.635
3.837SerGlu: 3.837 ± 0.912
1.395SerPhe: 1.395 ± 0.83
2.442SerGly: 2.442 ± 0.789
0.0SerHis: 0.0 ± 0.0
3.837SerIle: 3.837 ± 0.933
2.442SerLys: 2.442 ± 2.331
4.186SerLeu: 4.186 ± 1.617
1.395SerMet: 1.395 ± 0.773
3.139SerAsn: 3.139 ± 0.834
3.139SerPro: 3.139 ± 0.821
3.139SerGln: 3.139 ± 1.362
2.093SerArg: 2.093 ± 0.969
4.534SerSer: 4.534 ± 1.574
4.186SerThr: 4.186 ± 1.469
2.79SerVal: 2.79 ± 0.668
1.395SerTrp: 1.395 ± 0.491
0.349SerTyr: 0.349 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
2.093ThrAla: 2.093 ± 1.064
0.0ThrCys: 0.0 ± 0.0
3.488ThrAsp: 3.488 ± 1.229
5.93ThrGlu: 5.93 ± 1.367
1.046ThrPhe: 1.046 ± 0.808
3.488ThrGly: 3.488 ± 1.116
1.046ThrHis: 1.046 ± 0.522
4.186ThrIle: 4.186 ± 1.306
3.837ThrLys: 3.837 ± 0.842
5.232ThrLeu: 5.232 ± 1.035
1.395ThrMet: 1.395 ± 0.83
2.442ThrAsn: 2.442 ± 0.578
3.488ThrPro: 3.488 ± 1.389
2.79ThrGln: 2.79 ± 0.963
2.093ThrArg: 2.093 ± 1.145
2.79ThrSer: 2.79 ± 0.646
3.139ThrThr: 3.139 ± 0.805
4.883ThrVal: 4.883 ± 1.585
2.442ThrTrp: 2.442 ± 0.755
2.093ThrTyr: 2.093 ± 1.054
0.0ThrXaa: 0.0 ± 0.0
Val
2.79ValAla: 2.79 ± 1.261
0.349ValCys: 0.349 ± 0.591
3.837ValAsp: 3.837 ± 1.233
3.139ValGlu: 3.139 ± 1.447
1.395ValPhe: 1.395 ± 0.871
5.232ValGly: 5.232 ± 1.432
2.442ValHis: 2.442 ± 1.075
4.186ValIle: 4.186 ± 1.532
4.883ValLys: 4.883 ± 1.523
3.488ValLeu: 3.488 ± 1.016
1.395ValMet: 1.395 ± 0.858
3.488ValAsn: 3.488 ± 1.229
4.186ValPro: 4.186 ± 1.057
5.232ValGln: 5.232 ± 1.754
2.79ValArg: 2.79 ± 1.485
2.442ValSer: 2.442 ± 0.625
3.837ValThr: 3.837 ± 1.472
3.837ValVal: 3.837 ± 1.766
2.442ValTrp: 2.442 ± 0.877
1.744ValTyr: 1.744 ± 0.505
0.0ValXaa: 0.0 ± 0.0
Trp
2.093TrpAla: 2.093 ± 0.598
0.0TrpCys: 0.0 ± 0.0
2.093TrpAsp: 2.093 ± 0.737
1.744TrpGlu: 1.744 ± 0.558
1.046TrpPhe: 1.046 ± 0.998
2.093TrpGly: 2.093 ± 0.721
0.698TrpHis: 0.698 ± 1.183
0.698TrpIle: 0.698 ± 0.45
2.093TrpLys: 2.093 ± 0.954
1.744TrpLeu: 1.744 ± 1.51
1.395TrpMet: 1.395 ± 0.9
2.093TrpAsn: 2.093 ± 1.609
1.395TrpPro: 1.395 ± 1.254
2.442TrpGln: 2.442 ± 1.093
1.744TrpArg: 1.744 ± 0.752
1.046TrpSer: 1.046 ± 0.94
1.744TrpThr: 1.744 ± 0.61
1.744TrpVal: 1.744 ± 1.131
0.698TrpTrp: 0.698 ± 0.45
0.698TrpTyr: 0.698 ± 0.246
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.395TyrAla: 1.395 ± 0.491
1.046TyrCys: 1.046 ± 0.522
0.698TyrAsp: 0.698 ± 0.45
1.395TyrGlu: 1.395 ± 0.896
1.046TyrPhe: 1.046 ± 0.58
1.046TyrGly: 1.046 ± 0.643
1.046TyrHis: 1.046 ± 0.545
0.698TyrIle: 0.698 ± 0.246
2.442TyrLys: 2.442 ± 1.876
1.744TyrLeu: 1.744 ± 0.867
0.349TyrMet: 0.349 ± 0.225
1.744TyrAsn: 1.744 ± 0.738
1.395TyrPro: 1.395 ± 0.49
2.093TyrGln: 2.093 ± 1.037
2.093TyrArg: 2.093 ± 1.283
1.744TyrSer: 1.744 ± 0.673
1.046TyrThr: 1.046 ± 0.545
1.395TyrVal: 1.395 ± 0.9
1.046TyrTrp: 1.046 ± 0.493
1.744TyrTyr: 1.744 ± 0.534
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.349XaaIle: 0.349 ± 0.225
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.349XaaSer: 0.349 ± 0.627
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2868 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski