Amino acid dipepetide frequency for Murrumbidgee virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.598AlaAla: 2.598 ± 4.059
1.819AlaCys: 1.819 ± 0.535
2.598AlaAsp: 2.598 ± 2.802
2.598AlaGlu: 2.598 ± 1.713
1.299AlaPhe: 1.299 ± 0.368
0.52AlaGly: 0.52 ± 2.211
2.078AlaHis: 2.078 ± 0.682
5.456AlaIle: 5.456 ± 0.399
5.196AlaLys: 5.196 ± 4.543
4.417AlaLeu: 4.417 ± 0.058
1.299AlaMet: 1.299 ± 0.856
2.858AlaAsn: 2.858 ± 1.194
0.779AlaPro: 0.779 ± 0.22
2.858AlaGln: 2.858 ± 2.912
2.338AlaArg: 2.338 ± 0.691
3.378AlaSer: 3.378 ± 0.983
2.338AlaThr: 2.338 ± 0.708
2.338AlaVal: 2.338 ± 1.873
0.52AlaTrp: 0.52 ± 0.179
1.299AlaTyr: 1.299 ± 0.856
0.0AlaXaa: 0.0 ± 0.0
Cys
1.039CysAla: 1.039 ± 0.341
0.26CysCys: 0.26 ± 0.161
0.52CysAsp: 0.52 ± 0.322
1.819CysGlu: 1.819 ± 1.824
1.299CysPhe: 1.299 ± 1.303
2.598CysGly: 2.598 ± 2.229
1.039CysHis: 1.039 ± 1.042
2.858CysIle: 2.858 ± 1.42
2.078CysLys: 2.078 ± 1.003
2.858CysLeu: 2.858 ± 1.42
0.52CysMet: 0.52 ± 0.179
1.819CysAsn: 1.819 ± 0.762
1.299CysPro: 1.299 ± 0.587
1.299CysGln: 1.299 ± 0.587
0.26CysArg: 0.26 ± 0.261
1.559CysSer: 1.559 ± 0.44
2.598CysThr: 2.598 ± 1.505
0.779CysVal: 0.779 ± 0.782
0.0CysTrp: 0.0 ± 0.0
0.779CysTyr: 0.779 ± 0.417
0.0CysXaa: 0.0 ± 0.0
Asp
1.039AspAla: 1.039 ± 0.644
1.299AspCys: 1.299 ± 0.93
2.858AspAsp: 2.858 ± 0.83
3.637AspGlu: 3.637 ± 0.792
4.417AspPhe: 4.417 ± 1.606
1.559AspGly: 1.559 ± 0.638
0.779AspHis: 0.779 ± 0.417
6.755AspIle: 6.755 ± 0.716
4.157AspLys: 4.157 ± 1.364
5.196AspLeu: 5.196 ± 1.203
1.039AspMet: 1.039 ± 0.644
5.196AspAsn: 5.196 ± 1.47
1.819AspPro: 1.819 ± 2.048
1.299AspGln: 1.299 ± 0.368
1.299AspArg: 1.299 ± 0.805
2.598AspSer: 2.598 ± 0.766
3.637AspThr: 3.637 ± 1.376
2.858AspVal: 2.858 ± 0.512
0.0AspTrp: 0.0 ± 0.0
2.858AspTyr: 2.858 ± 1.429
0.0AspXaa: 0.0 ± 0.0
Glu
2.598GluAla: 2.598 ± 1.713
1.559GluCys: 1.559 ± 1.563
2.598GluAsp: 2.598 ± 0.766
2.078GluGlu: 2.078 ± 0.67
3.637GluPhe: 3.637 ± 1.101
2.078GluGly: 2.078 ± 0.718
2.338GluHis: 2.338 ± 0.527
7.794GluIle: 7.794 ± 2.312
4.677GluLys: 4.677 ± 1.439
6.755GluLeu: 6.755 ± 0.478
1.559GluMet: 1.559 ± 0.848
3.118GluAsn: 3.118 ± 0.879
2.858GluPro: 2.858 ± 1.123
2.858GluGln: 2.858 ± 1.194
2.598GluArg: 2.598 ± 0.766
3.637GluSer: 3.637 ± 1.07
3.378GluThr: 3.378 ± 1.293
2.078GluVal: 2.078 ± 4.222
0.26GluTrp: 0.26 ± 0.161
2.858GluTyr: 2.858 ± 0.921
0.0GluXaa: 0.0 ± 0.0
Phe
1.819PheAla: 1.819 ± 0.535
1.299PheCys: 1.299 ± 0.587
2.338PheAsp: 2.338 ± 1.872
4.677PheGlu: 4.677 ± 1.439
2.338PhePhe: 2.338 ± 0.993
2.338PheGly: 2.338 ± 1.744
0.779PheHis: 0.779 ± 0.417
4.677PheIle: 4.677 ± 1.356
4.417PheLys: 4.417 ± 0.421
4.417PheLeu: 4.417 ± 0.421
0.52PheMet: 0.52 ± 2.211
1.819PheAsn: 1.819 ± 0.794
1.559PhePro: 1.559 ± 0.638
0.779PheGln: 0.779 ± 0.22
2.078PheArg: 2.078 ± 1.288
2.338PheSer: 2.338 ± 1.11
4.417PheThr: 4.417 ± 0.766
1.819PheVal: 1.819 ± 0.708
0.52PheTrp: 0.52 ± 0.322
2.598PheTyr: 2.598 ± 0.743
0.0PheXaa: 0.0 ± 0.0
Gly
0.779GlyAla: 0.779 ± 0.958
2.338GlyCys: 2.338 ± 1.252
2.598GlyAsp: 2.598 ± 1.633
2.338GlyGlu: 2.338 ± 0.659
2.078GlyPhe: 2.078 ± 0.579
1.039GlyGly: 1.039 ± 0.359
0.26GlyHis: 0.26 ± 0.161
2.858GlyIle: 2.858 ± 1.558
1.819GlyLys: 1.819 ± 0.762
3.378GlyLeu: 3.378 ± 0.334
1.299GlyMet: 1.299 ± 0.587
3.378GlyAsn: 3.378 ± 1.293
1.039GlyPro: 1.039 ± 0.359
1.039GlyGln: 1.039 ± 0.926
0.52GlyArg: 0.52 ± 0.322
2.858GlySer: 2.858 ± 1.291
2.338GlyThr: 2.338 ± 1.602
2.338GlyVal: 2.338 ± 1.609
1.039GlyTrp: 1.039 ± 0.672
1.559GlyTyr: 1.559 ± 3.128
0.0GlyXaa: 0.0 ± 0.0
His
1.299HisAla: 1.299 ± 0.587
0.26HisCys: 0.26 ± 0.261
1.039HisAsp: 1.039 ± 0.359
1.299HisGlu: 1.299 ± 0.587
1.299HisPhe: 1.299 ± 0.486
2.338HisGly: 2.338 ± 1.699
1.299HisHis: 1.299 ± 0.368
2.858HisIle: 2.858 ± 0.794
2.598HisLys: 2.598 ± 0.897
2.078HisLeu: 2.078 ± 0.579
0.52HisMet: 0.52 ± 0.521
2.338HisAsn: 2.338 ± 0.691
0.779HisPro: 0.779 ± 0.22
0.26HisGln: 0.26 ± 1.106
0.779HisArg: 0.779 ± 0.958
2.858HisSer: 2.858 ± 1.42
1.559HisThr: 1.559 ± 0.44
1.559HisVal: 1.559 ± 0.538
0.0HisTrp: 0.0 ± 0.0
0.779HisTyr: 0.779 ± 0.483
0.0HisXaa: 0.0 ± 0.0
Ile
5.196IleAla: 5.196 ± 1.532
3.378IleCys: 3.378 ± 1.59
6.755IleAsp: 6.755 ± 0.881
7.275IleGlu: 7.275 ± 2.078
3.378IlePhe: 3.378 ± 1.293
3.897IleGly: 3.897 ± 1.449
4.417IleHis: 4.417 ± 0.766
8.574IleIle: 8.574 ± 3.939
8.314IleLys: 8.314 ± 0.243
10.133IleLeu: 10.133 ± 0.898
2.078IleMet: 2.078 ± 0.377
5.196IleAsn: 5.196 ± 1.449
2.338IlePro: 2.338 ± 0.659
3.378IleGln: 3.378 ± 0.944
2.338IleArg: 2.338 ± 0.708
6.235IleSer: 6.235 ± 1.172
5.456IleThr: 5.456 ± 1.539
4.157IleVal: 4.157 ± 1.699
0.779IleTrp: 0.779 ± 0.483
3.378IleTyr: 3.378 ± 0.283
0.0IleXaa: 0.0 ± 0.0
Lys
4.417LysAla: 4.417 ± 5.891
2.338LysCys: 2.338 ± 1.602
4.417LysAsp: 4.417 ± 0.766
5.456LysGlu: 5.456 ± 1.429
2.858LysPhe: 2.858 ± 1.697
4.417LysGly: 4.417 ± 0.374
2.078LysHis: 2.078 ± 0.67
8.314LysIle: 8.314 ± 1.275
6.495LysLys: 6.495 ± 1.004
8.574LysLeu: 8.574 ± 1.259
1.559LysMet: 1.559 ± 0.575
4.936LysAsn: 4.936 ± 0.893
2.598LysPro: 2.598 ± 1.175
3.637LysGln: 3.637 ± 0.439
3.118LysArg: 3.118 ± 1.312
5.456LysSer: 5.456 ± 0.399
5.456LysThr: 5.456 ± 1.539
2.858LysVal: 2.858 ± 0.889
0.52LysTrp: 0.52 ± 0.322
4.157LysTyr: 4.157 ± 1.697
0.0LysXaa: 0.0 ± 0.0
Leu
5.456LeuAla: 5.456 ± 1.051
2.598LeuCys: 2.598 ± 1.505
6.235LeuAsp: 6.235 ± 2.28
8.574LeuGlu: 8.574 ± 2.018
4.677LeuPhe: 4.677 ± 1.439
2.338LeuGly: 2.338 ± 0.624
1.819LeuHis: 1.819 ± 0.688
9.613LeuIle: 9.613 ± 0.774
8.054LeuLys: 8.054 ± 1.958
9.353LeuLeu: 9.353 ± 2.638
2.858LeuMet: 2.858 ± 0.794
6.495LeuAsn: 6.495 ± 0.905
3.637LeuPro: 3.637 ± 1.376
4.157LeuGln: 4.157 ± 1.435
3.897LeuArg: 3.897 ± 1.783
9.093LeuSer: 9.093 ± 2.193
5.196LeuThr: 5.196 ± 0.529
2.598LeuVal: 2.598 ± 0.972
0.26LeuTrp: 0.26 ± 0.161
4.936LeuTyr: 4.936 ± 0.214
0.0LeuXaa: 0.0 ± 0.0
Met
2.078MetAla: 2.078 ± 0.603
0.26MetCys: 0.26 ± 0.161
1.299MetAsp: 1.299 ± 0.856
1.819MetGlu: 1.819 ± 0.794
1.039MetPhe: 1.039 ± 2.111
1.039MetGly: 1.039 ± 0.359
1.299MetHis: 1.299 ± 0.587
1.559MetIle: 1.559 ± 0.538
1.819MetLys: 1.819 ± 2.048
2.338MetLeu: 2.338 ± 0.708
1.299MetMet: 1.299 ± 0.486
0.779MetAsn: 0.779 ± 0.22
1.299MetPro: 1.299 ± 0.486
1.039MetGln: 1.039 ± 0.341
1.819MetArg: 1.819 ± 1.162
2.598MetSer: 2.598 ± 1.713
0.779MetThr: 0.779 ± 0.22
1.559MetVal: 1.559 ± 1.069
0.0MetTrp: 0.0 ± 0.0
0.779MetTyr: 0.779 ± 0.22
0.0MetXaa: 0.0 ± 0.0
Asn
3.378AsnAla: 3.378 ± 0.983
2.078AsnCys: 2.078 ± 1.709
3.637AsnAsp: 3.637 ± 1.309
3.637AsnGlu: 3.637 ± 0.164
3.637AsnPhe: 3.637 ± 1.012
1.559AsnGly: 1.559 ± 0.538
1.559AsnHis: 1.559 ± 0.638
5.456AsnIle: 5.456 ± 1.539
3.378AsnLys: 3.378 ± 1.061
8.054AsnLeu: 8.054 ± 1.16
1.299AsnMet: 1.299 ± 0.486
3.637AsnAsn: 3.637 ± 3.691
1.819AsnPro: 1.819 ± 0.551
3.118AsnGln: 3.118 ± 1.277
2.078AsnArg: 2.078 ± 0.603
3.637AsnSer: 3.637 ± 1.012
2.078AsnThr: 2.078 ± 0.603
2.338AsnVal: 2.338 ± 1.609
0.52AsnTrp: 0.52 ± 0.179
4.157AsnTyr: 4.157 ± 1.839
0.0AsnXaa: 0.0 ± 0.0
Pro
2.338ProAla: 2.338 ± 0.824
0.0ProCys: 0.0 ± 0.0
2.078ProAsp: 2.078 ± 1.288
2.598ProGlu: 2.598 ± 0.494
1.039ProPhe: 1.039 ± 0.359
2.078ProGly: 2.078 ± 1.851
0.779ProHis: 0.779 ± 0.417
2.598ProIle: 2.598 ± 0.735
1.559ProLys: 1.559 ± 0.741
3.637ProLeu: 3.637 ± 0.414
0.779ProMet: 0.779 ± 0.22
2.598ProAsn: 2.598 ± 0.735
0.0ProPro: 0.0 ± 0.0
0.26ProGln: 0.26 ± 0.261
0.52ProArg: 0.52 ± 0.179
0.52ProSer: 0.52 ± 0.322
1.559ProThr: 1.559 ± 0.538
2.338ProVal: 2.338 ± 0.624
0.26ProTrp: 0.26 ± 1.106
2.338ProTyr: 2.338 ± 1.11
0.0ProXaa: 0.0 ± 0.0
Gln
1.559GlnAla: 1.559 ± 3.104
0.52GlnCys: 0.52 ± 0.179
1.039GlnAsp: 1.039 ± 1.026
1.819GlnGlu: 1.819 ± 1.088
2.338GlnPhe: 2.338 ± 0.993
0.779GlnGly: 0.779 ± 0.483
0.779GlnHis: 0.779 ± 0.417
3.378GlnIle: 3.378 ± 1.061
3.897GlnLys: 3.897 ± 1.453
3.118GlnLeu: 3.118 ± 0.761
0.779GlnMet: 0.779 ± 0.417
2.078GlnAsn: 2.078 ± 0.67
0.779GlnPro: 0.779 ± 0.417
1.299GlnGln: 1.299 ± 0.368
2.598GlnArg: 2.598 ± 1.086
1.819GlnSer: 1.819 ± 0.762
3.118GlnThr: 3.118 ± 1.023
1.819GlnVal: 1.819 ± 0.87
0.779GlnTrp: 0.779 ± 0.22
2.078GlnTyr: 2.078 ± 0.767
0.0GlnXaa: 0.0 ± 0.0
Arg
1.819ArgAla: 1.819 ± 3.131
0.779ArgCys: 0.779 ± 0.22
1.819ArgAsp: 1.819 ± 1.127
2.338ArgGlu: 2.338 ± 0.824
1.299ArgPhe: 1.299 ± 0.805
0.26ArgGly: 0.26 ± 0.161
1.559ArgHis: 1.559 ± 0.44
2.078ArgIle: 2.078 ± 0.682
1.819ArgLys: 1.819 ± 0.551
3.637ArgLeu: 3.637 ± 0.414
1.299ArgMet: 1.299 ± 0.856
2.338ArgAsn: 2.338 ± 1.11
0.26ArgPro: 0.26 ± 0.261
1.819ArgGln: 1.819 ± 3.037
1.299ArgArg: 1.299 ± 0.93
3.378ArgSer: 3.378 ± 1.749
2.338ArgThr: 2.338 ± 1.11
2.338ArgVal: 2.338 ± 0.708
0.52ArgTrp: 0.52 ± 0.179
1.819ArgTyr: 1.819 ± 0.87
0.0ArgXaa: 0.0 ± 0.0
Ser
3.637SerAla: 3.637 ± 0.439
3.118SerCys: 3.118 ± 2.017
5.456SerAsp: 5.456 ± 1.844
2.858SerGlu: 2.858 ± 1.697
2.598SerPhe: 2.598 ± 0.743
2.338SerGly: 2.338 ± 1.252
1.299SerHis: 1.299 ± 0.856
7.794SerIle: 7.794 ± 2.667
6.495SerLys: 6.495 ± 0.631
7.275SerLeu: 7.275 ± 1.201
1.559SerMet: 1.559 ± 0.774
2.338SerAsn: 2.338 ± 0.527
1.299SerPro: 1.299 ± 0.805
1.559SerGln: 1.559 ± 0.44
3.378SerArg: 3.378 ± 0.944
4.417SerSer: 4.417 ± 0.814
3.637SerThr: 3.637 ± 0.414
3.637SerVal: 3.637 ± 0.414
0.26SerTrp: 0.26 ± 0.161
3.378SerTyr: 3.378 ± 0.944
0.0SerXaa: 0.0 ± 0.0
Thr
1.819ThrAla: 1.819 ± 0.688
1.299ThrCys: 1.299 ± 0.587
3.897ThrAsp: 3.897 ± 1.103
1.559ThrGlu: 1.559 ± 0.44
3.637ThrPhe: 3.637 ± 2.324
2.598ThrGly: 2.598 ± 1.861
1.039ThrHis: 1.039 ± 0.359
6.495ThrIle: 6.495 ± 1.806
6.235ThrLys: 6.235 ± 0.847
4.936ThrLeu: 4.936 ± 0.214
2.858ThrMet: 2.858 ± 0.83
4.417ThrAsn: 4.417 ± 1.23
2.598ThrPro: 2.598 ± 0.492
1.559ThrGln: 1.559 ± 0.835
1.299ThrArg: 1.299 ± 0.368
4.157ThrSer: 4.157 ± 1.213
3.118ThrThr: 3.118 ± 0.431
2.078ThrVal: 2.078 ± 1.345
1.039ThrTrp: 1.039 ± 0.894
2.858ThrTyr: 2.858 ± 1.115
0.0ThrXaa: 0.0 ± 0.0
Val
1.299ValAla: 1.299 ± 1.999
1.039ValCys: 1.039 ± 0.359
1.819ValAsp: 1.819 ± 0.535
1.819ValGlu: 1.819 ± 0.688
2.338ValPhe: 2.338 ± 0.527
2.078ValGly: 2.078 ± 0.67
2.078ValHis: 2.078 ± 0.718
2.858ValIle: 2.858 ± 1.291
4.417ValLys: 4.417 ± 1.653
5.456ValLeu: 5.456 ± 1.522
1.039ValMet: 1.039 ± 0.359
2.858ValAsn: 2.858 ± 1.115
1.299ValPro: 1.299 ± 0.368
1.819ValGln: 1.819 ± 0.916
0.779ValArg: 0.779 ± 0.22
4.157ValSer: 4.157 ± 0.275
2.598ValThr: 2.598 ± 1.75
3.637ValVal: 3.637 ± 1.59
0.52ValTrp: 0.52 ± 2.211
1.299ValTyr: 1.299 ± 0.587
0.0ValXaa: 0.0 ± 0.0
Trp
1.039TrpAla: 1.039 ± 0.359
0.0TrpCys: 0.0 ± 0.0
0.26TrpAsp: 0.26 ± 0.161
1.039TrpGlu: 1.039 ± 0.894
0.52TrpPhe: 0.52 ± 0.179
0.26TrpGly: 0.26 ± 0.261
0.0TrpHis: 0.0 ± 0.0
0.26TrpIle: 0.26 ± 0.261
0.26TrpLys: 0.26 ± 1.106
1.299TrpLeu: 1.299 ± 0.368
0.26TrpMet: 0.26 ± 1.106
0.52TrpAsn: 0.52 ± 0.322
0.0TrpPro: 0.0 ± 0.0
0.26TrpGln: 0.26 ± 0.161
0.0TrpArg: 0.0 ± 0.0
1.039TrpSer: 1.039 ± 0.644
0.52TrpThr: 0.52 ± 1.056
0.26TrpVal: 0.26 ± 0.161
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.378TyrAla: 3.378 ± 1.426
1.039TyrCys: 1.039 ± 0.672
1.039TyrAsp: 1.039 ± 0.341
2.078TyrGlu: 2.078 ± 0.767
1.819TyrPhe: 1.819 ± 0.688
0.779TyrGly: 0.779 ± 0.417
0.52TyrHis: 0.52 ± 0.179
4.417TyrIle: 4.417 ± 0.058
6.235TyrLys: 6.235 ± 3.009
4.936TyrLeu: 4.936 ± 1.173
1.819TyrMet: 1.819 ± 0.87
2.338TyrAsn: 2.338 ± 0.708
1.819TyrPro: 1.819 ± 1.866
2.078TyrGln: 2.078 ± 0.603
1.819TyrArg: 1.819 ± 1.162
2.598TyrSer: 2.598 ± 0.735
3.378TyrThr: 3.378 ± 0.673
1.819TyrVal: 1.819 ± 0.794
0.0TyrTrp: 0.0 ± 0.0
0.779TyrTyr: 0.779 ± 0.417
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3850 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski