Amino acid dipepetide frequency for HIV-1 M_97CD.KFE267

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.128AlaAla: 3.128 ± 2.268
1.738AlaCys: 1.738 ± 0.721
1.043AlaAsp: 1.043 ± 0.568
5.909AlaGlu: 5.909 ± 1.935
1.738AlaPhe: 1.738 ± 0.483
4.866AlaGly: 4.866 ± 1.666
1.39AlaHis: 1.39 ± 0.545
5.561AlaIle: 5.561 ± 2.174
3.128AlaLys: 3.128 ± 0.895
5.909AlaLeu: 5.909 ± 2.144
1.39AlaMet: 1.39 ± 0.545
2.781AlaAsn: 2.781 ± 0.617
2.433AlaPro: 2.433 ± 1.133
2.433AlaGln: 2.433 ± 0.587
4.171AlaArg: 4.171 ± 1.557
3.823AlaSer: 3.823 ± 1.158
2.781AlaThr: 2.781 ± 1.09
5.909AlaVal: 5.909 ± 1.585
1.39AlaTrp: 1.39 ± 0.467
1.738AlaTyr: 1.738 ± 0.583
0.0AlaXaa: 0.0 ± 0.0
Cys
1.39CysAla: 1.39 ± 0.996
0.0CysCys: 0.0 ± 0.0
0.348CysAsp: 0.348 ± 0.216
0.348CysGlu: 0.348 ± 0.535
0.695CysPhe: 0.695 ± 0.508
1.043CysGly: 1.043 ± 0.648
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.39CysLys: 1.39 ± 0.493
0.695CysLeu: 0.695 ± 0.575
0.348CysMet: 0.348 ± 0.743
1.738CysAsn: 1.738 ± 1.047
0.348CysPro: 0.348 ± 0.288
1.738CysGln: 1.738 ± 0.573
2.086CysArg: 2.086 ± 0.74
1.043CysSer: 1.043 ± 0.49
2.086CysThr: 2.086 ± 0.74
1.738CysVal: 1.738 ± 0.721
0.695CysTrp: 0.695 ± 0.432
0.348CysTyr: 0.348 ± 0.673
0.0CysXaa: 0.0 ± 0.0
Asp
1.738AspAla: 1.738 ± 1.042
3.128AspCys: 3.128 ± 1.284
1.043AspAsp: 1.043 ± 0.648
1.39AspGlu: 1.39 ± 0.467
1.39AspPhe: 1.39 ± 0.545
2.781AspGly: 2.781 ± 1.505
0.695AspHis: 0.695 ± 1.164
5.214AspIle: 5.214 ± 1.235
3.823AspLys: 3.823 ± 1.444
3.823AspLeu: 3.823 ± 1.538
0.695AspMet: 0.695 ± 0.253
3.128AspAsn: 3.128 ± 1.471
2.781AspPro: 2.781 ± 0.585
2.086AspGln: 2.086 ± 0.722
3.128AspArg: 3.128 ± 3.536
2.086AspSer: 2.086 ± 1.3
2.781AspThr: 2.781 ± 0.853
1.043AspVal: 1.043 ± 0.648
0.695AspTrp: 0.695 ± 0.891
1.043AspTyr: 1.043 ± 0.648
0.0AspXaa: 0.0 ± 0.0
Glu
5.561GluAla: 5.561 ± 1.485
0.0GluCys: 0.0 ± 0.0
3.128GluAsp: 3.128 ± 1.3
6.604GluGlu: 6.604 ± 4.061
1.738GluPhe: 1.738 ± 0.51
5.561GluGly: 5.561 ± 0.979
0.348GluHis: 0.348 ± 0.216
4.866GluIle: 4.866 ± 1.318
4.866GluLys: 4.866 ± 0.876
7.299GluLeu: 7.299 ± 1.661
1.043GluMet: 1.043 ± 1.145
2.433GluAsn: 2.433 ± 0.978
3.128GluPro: 3.128 ± 0.849
3.823GluGln: 3.823 ± 0.941
3.476GluArg: 3.476 ± 0.966
2.433GluSer: 2.433 ± 1.457
4.866GluThr: 4.866 ± 1.503
4.171GluVal: 4.171 ± 2.139
1.738GluTrp: 1.738 ± 0.747
1.39GluTyr: 1.39 ± 1.337
0.0GluXaa: 0.0 ± 0.0
Phe
1.39PheAla: 1.39 ± 0.493
0.695PheCys: 0.695 ± 0.575
1.738PheAsp: 1.738 ± 1.478
0.0PheGlu: 0.0 ± 0.0
1.39PhePhe: 1.39 ± 0.467
0.695PheGly: 0.695 ± 0.592
0.0PheHis: 0.0 ± 0.0
1.738PheIle: 1.738 ± 0.583
1.043PheLys: 1.043 ± 0.648
2.433PheLeu: 2.433 ± 0.587
0.0PheMet: 0.0 ± 0.0
3.128PheAsn: 3.128 ± 1.471
2.086PhePro: 2.086 ± 1.483
0.695PheGln: 0.695 ± 0.247
3.128PheArg: 3.128 ± 1.184
1.39PheSer: 1.39 ± 0.478
1.738PheThr: 1.738 ± 0.745
0.695PheVal: 0.695 ± 0.668
0.348PheTrp: 0.348 ± 0.216
2.086PheTyr: 2.086 ± 0.911
0.0PheXaa: 0.0 ± 0.0
Gly
4.171GlyAla: 4.171 ± 1.17
2.086GlyCys: 2.086 ± 0.719
2.781GlyAsp: 2.781 ± 2.486
3.128GlyGlu: 3.128 ± 0.618
2.433GlyPhe: 2.433 ± 1.15
6.257GlyGly: 6.257 ± 1.235
3.128GlyHis: 3.128 ± 1.496
5.909GlyIle: 5.909 ± 1.452
4.866GlyLys: 4.866 ± 1.773
6.257GlyLeu: 6.257 ± 2.231
1.738GlyMet: 1.738 ± 0.826
1.738GlyAsn: 1.738 ± 1.042
4.171GlyPro: 4.171 ± 1.272
5.214GlyGln: 5.214 ± 1.087
4.171GlyArg: 4.171 ± 1.796
3.823GlySer: 3.823 ± 1.111
2.433GlyThr: 2.433 ± 0.587
4.171GlyVal: 4.171 ± 0.619
2.086GlyTrp: 2.086 ± 1.19
2.433GlyTyr: 2.433 ± 1.134
0.0GlyXaa: 0.0 ± 0.0
His
1.043HisAla: 1.043 ± 0.49
0.348HisCys: 0.348 ± 0.288
0.348HisAsp: 0.348 ± 0.216
1.043HisGlu: 1.043 ± 0.594
1.043HisPhe: 1.043 ± 1.41
2.086HisGly: 2.086 ± 1.054
0.348HisHis: 0.348 ± 0.535
1.043HisIle: 1.043 ± 0.958
1.39HisLys: 1.39 ± 1.015
2.433HisLeu: 2.433 ± 1.232
0.695HisMet: 0.695 ± 0.573
1.738HisAsn: 1.738 ± 1.042
2.086HisPro: 2.086 ± 1.228
1.738HisGln: 1.738 ± 1.081
1.39HisArg: 1.39 ± 0.722
1.738HisSer: 1.738 ± 0.851
1.043HisThr: 1.043 ± 0.568
0.695HisVal: 0.695 ± 0.247
0.348HisTrp: 0.348 ± 0.288
1.043HisTyr: 1.043 ± 0.482
0.0HisXaa: 0.0 ± 0.0
Ile
3.476IleAla: 3.476 ± 1.06
1.39IleCys: 1.39 ± 0.545
1.738IleAsp: 1.738 ± 1.208
4.519IleGlu: 4.519 ± 0.983
1.043IlePhe: 1.043 ± 0.364
5.909IleGly: 5.909 ± 1.829
2.433IleHis: 2.433 ± 0.66
5.909IleIle: 5.909 ± 2.126
3.823IleLys: 3.823 ± 1.3
6.257IleLeu: 6.257 ± 0.741
1.043IleMet: 1.043 ± 0.49
2.086IleAsn: 2.086 ± 0.74
4.519IlePro: 4.519 ± 1.473
3.476IleGln: 3.476 ± 1.491
5.561IleArg: 5.561 ± 1.578
2.433IleSer: 2.433 ± 1.233
2.086IleThr: 2.086 ± 1.357
7.299IleVal: 7.299 ± 1.652
1.39IleTrp: 1.39 ± 0.493
2.781IleTyr: 2.781 ± 0.908
0.0IleXaa: 0.0 ± 0.0
Lys
4.171LysAla: 4.171 ± 1.299
1.738LysCys: 1.738 ± 0.565
5.214LysAsp: 5.214 ± 1.253
6.257LysGlu: 6.257 ± 1.864
0.348LysPhe: 0.348 ± 0.216
4.171LysGly: 4.171 ± 0.987
1.39LysHis: 1.39 ± 1.012
5.561LysIle: 5.561 ± 2.823
5.214LysLys: 5.214 ± 2.522
4.866LysLeu: 4.866 ± 1.06
0.695LysMet: 0.695 ± 0.432
2.781LysAsn: 2.781 ± 1.53
2.086LysPro: 2.086 ± 0.964
3.476LysGln: 3.476 ± 1.042
2.086LysArg: 2.086 ± 0.567
2.781LysSer: 2.781 ± 0.806
4.171LysThr: 4.171 ± 1.286
5.909LysVal: 5.909 ± 1.996
2.433LysTrp: 2.433 ± 0.99
3.128LysTyr: 3.128 ± 0.489
0.0LysXaa: 0.0 ± 0.0
Leu
5.909LeuAla: 5.909 ± 1.246
1.043LeuCys: 1.043 ± 0.49
4.519LeuAsp: 4.519 ± 1.157
6.257LeuGlu: 6.257 ± 1.465
2.781LeuPhe: 2.781 ± 1.139
5.909LeuGly: 5.909 ± 1.984
2.781LeuHis: 2.781 ± 2.075
3.476LeuIle: 3.476 ± 1.564
9.385LeuLys: 9.385 ± 1.428
7.299LeuLeu: 7.299 ± 2.336
0.695LeuMet: 0.695 ± 1.153
3.476LeuAsn: 3.476 ± 1.491
2.433LeuPro: 2.433 ± 0.883
4.866LeuGln: 4.866 ± 1.814
5.214LeuArg: 5.214 ± 0.876
3.128LeuSer: 3.128 ± 1.078
4.866LeuThr: 4.866 ± 0.756
5.909LeuVal: 5.909 ± 2.139
3.823LeuTrp: 3.823 ± 0.938
1.043LeuTyr: 1.043 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
1.043MetAla: 1.043 ± 0.648
0.0MetCys: 0.0 ± 0.0
1.043MetAsp: 1.043 ± 0.584
1.043MetGlu: 1.043 ± 0.908
0.695MetPhe: 0.695 ± 0.247
3.128MetGly: 3.128 ± 1.459
0.695MetHis: 0.695 ± 0.575
1.39MetIle: 1.39 ± 0.61
0.695MetLys: 0.695 ± 0.247
2.086MetLeu: 2.086 ± 1.192
0.695MetMet: 0.695 ± 0.432
0.695MetAsn: 0.695 ± 0.508
0.0MetPro: 0.0 ± 0.0
1.043MetGln: 1.043 ± 0.364
2.086MetArg: 2.086 ± 1.19
1.043MetSer: 1.043 ± 1.411
2.781MetThr: 2.781 ± 0.563
1.043MetVal: 1.043 ± 0.49
0.695MetTrp: 0.695 ± 0.575
1.043MetTyr: 1.043 ± 0.482
0.0MetXaa: 0.0 ± 0.0
Asn
1.738AsnAla: 1.738 ± 0.745
2.781AsnCys: 2.781 ± 1.53
1.39AsnAsp: 1.39 ± 0.545
1.738AsnGlu: 1.738 ± 1.393
2.433AsnPhe: 2.433 ± 1.163
2.433AsnGly: 2.433 ± 0.969
0.348AsnHis: 0.348 ± 0.535
2.781AsnIle: 2.781 ± 1.208
2.781AsnLys: 2.781 ± 0.457
2.781AsnLeu: 2.781 ± 0.986
2.433AsnMet: 2.433 ± 1.616
3.476AsnAsn: 3.476 ± 1.718
3.476AsnPro: 3.476 ± 1.272
1.043AsnGln: 1.043 ± 0.364
1.39AsnArg: 1.39 ± 0.478
4.171AsnSer: 4.171 ± 1.876
4.519AsnThr: 4.519 ± 1.01
1.738AsnVal: 1.738 ± 1.213
2.086AsnTrp: 2.086 ± 0.74
1.043AsnTyr: 1.043 ± 0.544
0.0AsnXaa: 0.0 ± 0.0
Pro
3.128ProAla: 3.128 ± 0.779
1.043ProCys: 1.043 ± 0.863
3.476ProAsp: 3.476 ± 1.605
3.823ProGlu: 3.823 ± 1.136
1.39ProPhe: 1.39 ± 0.865
4.519ProGly: 4.519 ± 1.211
0.348ProHis: 0.348 ± 0.216
5.561ProIle: 5.561 ± 2.066
2.433ProLys: 2.433 ± 1.01
4.519ProLeu: 4.519 ± 1.207
1.043ProMet: 1.043 ± 1.25
0.695ProAsn: 0.695 ± 0.575
2.433ProPro: 2.433 ± 0.883
2.781ProGln: 2.781 ± 0.908
2.086ProArg: 2.086 ± 0.675
1.738ProSer: 1.738 ± 0.607
1.738ProThr: 1.738 ± 0.721
4.171ProVal: 4.171 ± 0.908
1.043ProTrp: 1.043 ± 1.588
1.043ProTyr: 1.043 ± 0.675
0.0ProXaa: 0.0 ± 0.0
Gln
5.561GlnAla: 5.561 ± 1.077
0.695GlnCys: 0.695 ± 0.575
2.086GlnAsp: 2.086 ± 0.953
3.128GlnGlu: 3.128 ± 0.618
0.348GlnPhe: 0.348 ± 0.673
4.866GlnGly: 4.866 ± 0.71
1.738GlnHis: 1.738 ± 1.292
4.171GlnIle: 4.171 ± 1.418
3.823GlnLys: 3.823 ± 2.052
7.647GlnLeu: 7.647 ± 0.941
2.433GlnMet: 2.433 ± 1.163
2.433GlnAsn: 2.433 ± 1.113
1.043GlnPro: 1.043 ± 0.648
3.476GlnGln: 3.476 ± 1.064
2.781GlnArg: 2.781 ± 2.18
2.086GlnSer: 2.086 ± 0.98
1.043GlnThr: 1.043 ± 0.482
3.823GlnVal: 3.823 ± 1.691
1.39GlnTrp: 1.39 ± 0.493
1.738GlnTyr: 1.738 ± 1.056
0.0GlnXaa: 0.0 ± 0.0
Arg
5.561ArgAla: 5.561 ± 0.925
0.0ArgCys: 0.0 ± 0.0
3.476ArgAsp: 3.476 ± 1.116
6.604ArgGlu: 6.604 ± 1.531
0.695ArgPhe: 0.695 ± 0.432
3.476ArgGly: 3.476 ± 1.404
1.043ArgHis: 1.043 ± 1.417
3.823ArgIle: 3.823 ± 2.036
5.214ArgLys: 5.214 ± 2.482
4.171ArgLeu: 4.171 ± 1.066
1.738ArgMet: 1.738 ± 1.81
1.39ArgAsn: 1.39 ± 0.493
3.128ArgPro: 3.128 ± 2.066
3.128ArgGln: 3.128 ± 1.057
2.781ArgArg: 2.781 ± 2.463
2.086ArgSer: 2.086 ± 1.083
3.128ArgThr: 3.128 ± 1.402
2.433ArgVal: 2.433 ± 1.252
1.738ArgTrp: 1.738 ± 1.056
1.043ArgTyr: 1.043 ± 0.785
0.0ArgXaa: 0.0 ± 0.0
Ser
2.433SerAla: 2.433 ± 2.525
0.348SerCys: 0.348 ± 0.216
2.086SerAsp: 2.086 ± 1.081
3.823SerGlu: 3.823 ± 1.094
2.086SerPhe: 2.086 ± 0.98
2.433SerGly: 2.433 ± 1.233
0.695SerHis: 0.695 ± 0.247
2.781SerIle: 2.781 ± 1.092
2.086SerLys: 2.086 ± 1.299
5.214SerLeu: 5.214 ± 2.04
1.39SerMet: 1.39 ± 0.546
4.519SerAsn: 4.519 ± 1.774
3.476SerPro: 3.476 ± 1.255
3.128SerGln: 3.128 ± 1.284
2.086SerArg: 2.086 ± 1.26
5.214SerSer: 5.214 ± 2.562
3.823SerThr: 3.823 ± 1.233
2.086SerVal: 2.086 ± 0.955
1.043SerTrp: 1.043 ± 0.364
0.348SerTyr: 0.348 ± 0.288
0.0SerXaa: 0.0 ± 0.0
Thr
3.476ThrAla: 3.476 ± 1.262
0.0ThrCys: 0.0 ± 0.0
3.476ThrAsp: 3.476 ± 1.165
5.214ThrGlu: 5.214 ± 0.914
1.39ThrPhe: 1.39 ± 0.637
3.128ThrGly: 3.128 ± 1.078
1.043ThrHis: 1.043 ± 0.49
2.781ThrIle: 2.781 ± 0.939
2.781ThrLys: 2.781 ± 1.432
4.519ThrLeu: 4.519 ± 1.52
1.738ThrMet: 1.738 ± 0.823
2.086ThrAsn: 2.086 ± 0.585
4.171ThrPro: 4.171 ± 1.17
2.781ThrGln: 2.781 ± 0.457
2.086ThrArg: 2.086 ± 1.476
2.781ThrSer: 2.781 ± 0.563
3.476ThrThr: 3.476 ± 1.441
4.866ThrVal: 4.866 ± 1.662
2.433ThrTrp: 2.433 ± 0.642
1.39ThrTyr: 1.39 ± 1.264
0.0ThrXaa: 0.0 ± 0.0
Val
5.214ValAla: 5.214 ± 0.945
0.0ValCys: 0.0 ± 0.0
2.781ValAsp: 2.781 ± 1.063
4.171ValGlu: 4.171 ± 1.348
0.695ValPhe: 0.695 ± 0.247
5.214ValGly: 5.214 ± 1.3
3.128ValHis: 3.128 ± 0.689
5.214ValIle: 5.214 ± 1.526
5.214ValLys: 5.214 ± 2.664
3.476ValLeu: 3.476 ± 1.272
1.043ValMet: 1.043 ± 0.733
2.781ValAsn: 2.781 ± 0.986
3.128ValPro: 3.128 ± 1.134
4.519ValGln: 4.519 ± 1.52
3.128ValArg: 3.128 ± 1.221
4.171ValSer: 4.171 ± 1.588
3.476ValThr: 3.476 ± 1.165
3.128ValVal: 3.128 ± 2.323
2.433ValTrp: 2.433 ± 1.13
1.39ValTyr: 1.39 ± 0.467
0.0ValXaa: 0.0 ± 0.0
Trp
1.738TrpAla: 1.738 ± 0.583
0.348TrpCys: 0.348 ± 0.673
1.738TrpAsp: 1.738 ± 0.721
2.433TrpGlu: 2.433 ± 0.567
0.695TrpPhe: 0.695 ± 0.573
2.781TrpGly: 2.781 ± 1.24
0.348TrpHis: 0.348 ± 0.535
0.695TrpIle: 0.695 ± 0.432
2.433TrpLys: 2.433 ± 0.901
1.39TrpLeu: 1.39 ± 0.975
1.39TrpMet: 1.39 ± 0.748
1.39TrpAsn: 1.39 ± 0.955
1.043TrpPro: 1.043 ± 0.584
3.128TrpGln: 3.128 ± 1.784
1.39TrpArg: 1.39 ± 0.493
1.738TrpSer: 1.738 ± 1.578
1.738TrpThr: 1.738 ± 0.745
1.738TrpVal: 1.738 ± 1.1
1.043TrpTrp: 1.043 ± 0.364
0.695TrpTyr: 0.695 ± 0.247
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.39TyrAla: 1.39 ± 0.493
1.043TyrCys: 1.043 ± 0.49
1.043TyrAsp: 1.043 ± 0.364
1.043TyrGlu: 1.043 ± 0.908
1.39TyrPhe: 1.39 ± 0.722
1.738TyrGly: 1.738 ± 1.002
1.738TyrHis: 1.738 ± 0.947
0.695TyrIle: 0.695 ± 0.247
2.086TyrLys: 2.086 ± 1.86
1.738TyrLeu: 1.738 ± 0.548
0.348TyrMet: 0.348 ± 0.216
2.086TyrAsn: 2.086 ± 1.706
1.043TyrPro: 1.043 ± 0.584
1.738TyrGln: 1.738 ± 0.857
2.433TyrArg: 2.433 ± 1.359
1.39TyrSer: 1.39 ± 0.467
1.043TyrThr: 1.043 ± 0.544
1.738TyrVal: 1.738 ± 0.857
1.043TyrTrp: 1.043 ± 0.482
1.043TyrTyr: 1.043 ± 0.364
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2878 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski