Amino acid dipepetide frequency for Streptococcus phage Javan64

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.901AlaAla: 4.901 ± 1.11
0.085AlaCys: 0.085 ± 0.093
3.718AlaAsp: 3.718 ± 0.476
6.253AlaGlu: 6.253 ± 0.826
2.535AlaPhe: 2.535 ± 0.509
5.662AlaGly: 5.662 ± 0.75
1.014AlaHis: 1.014 ± 0.292
4.901AlaIle: 4.901 ± 0.786
4.901AlaLys: 4.901 ± 0.861
6.76AlaLeu: 6.76 ± 1.149
1.69AlaMet: 1.69 ± 0.482
4.394AlaAsn: 4.394 ± 0.934
1.69AlaPro: 1.69 ± 0.408
2.704AlaGln: 2.704 ± 0.962
3.718AlaArg: 3.718 ± 0.808
5.577AlaSer: 5.577 ± 0.625
4.141AlaThr: 4.141 ± 0.964
4.056AlaVal: 4.056 ± 0.885
1.014AlaTrp: 1.014 ± 0.382
2.535AlaTyr: 2.535 ± 0.471
0.0AlaXaa: 0.0 ± 0.0
Cys
0.676CysAla: 0.676 ± 0.266
0.0CysCys: 0.0 ± 0.0
0.085CysAsp: 0.085 ± 0.082
0.676CysGlu: 0.676 ± 0.225
0.254CysPhe: 0.254 ± 0.14
0.338CysGly: 0.338 ± 0.172
0.0CysHis: 0.0 ± 0.0
0.169CysIle: 0.169 ± 0.127
0.423CysLys: 0.423 ± 0.184
0.169CysLeu: 0.169 ± 0.126
0.169CysMet: 0.169 ± 0.121
0.423CysAsn: 0.423 ± 0.237
0.169CysPro: 0.169 ± 0.126
0.254CysGln: 0.254 ± 0.147
0.085CysArg: 0.085 ± 0.066
0.592CysSer: 0.592 ± 0.233
0.254CysThr: 0.254 ± 0.135
0.338CysVal: 0.338 ± 0.167
0.169CysTrp: 0.169 ± 0.112
0.507CysTyr: 0.507 ± 0.267
0.0CysXaa: 0.0 ± 0.0
Asp
4.901AspAla: 4.901 ± 1.068
0.507AspCys: 0.507 ± 0.202
3.296AspAsp: 3.296 ± 0.874
4.141AspGlu: 4.141 ± 0.626
3.549AspPhe: 3.549 ± 0.55
3.803AspGly: 3.803 ± 0.669
0.507AspHis: 0.507 ± 0.185
3.972AspIle: 3.972 ± 0.655
5.493AspLys: 5.493 ± 0.742
4.563AspLeu: 4.563 ± 0.482
1.606AspMet: 1.606 ± 0.387
3.718AspAsn: 3.718 ± 0.577
1.099AspPro: 1.099 ± 0.309
1.437AspGln: 1.437 ± 0.347
2.535AspArg: 2.535 ± 0.424
2.62AspSer: 2.62 ± 0.495
3.972AspThr: 3.972 ± 0.546
2.197AspVal: 2.197 ± 0.408
1.437AspTrp: 1.437 ± 0.298
2.958AspTyr: 2.958 ± 0.613
0.0AspXaa: 0.0 ± 0.0
Glu
5.155GluAla: 5.155 ± 0.821
0.338GluCys: 0.338 ± 0.247
3.296GluAsp: 3.296 ± 0.557
5.324GluGlu: 5.324 ± 0.781
2.789GluPhe: 2.789 ± 0.441
3.549GluGly: 3.549 ± 0.467
1.014GluHis: 1.014 ± 0.314
6.338GluIle: 6.338 ± 0.731
6.929GluLys: 6.929 ± 0.928
7.267GluLeu: 7.267 ± 0.763
2.366GluMet: 2.366 ± 0.45
3.803GluAsn: 3.803 ± 0.456
1.606GluPro: 1.606 ± 0.512
3.042GluGln: 3.042 ± 0.584
3.718GluArg: 3.718 ± 0.687
3.211GluSer: 3.211 ± 0.565
3.887GluThr: 3.887 ± 0.558
4.901GluVal: 4.901 ± 0.835
0.845GluTrp: 0.845 ± 0.218
1.775GluTyr: 1.775 ± 0.365
0.0GluXaa: 0.0 ± 0.0
Phe
2.282PheAla: 2.282 ± 0.52
0.676PheCys: 0.676 ± 0.256
2.535PheAsp: 2.535 ± 0.508
3.296PheGlu: 3.296 ± 0.591
1.521PhePhe: 1.521 ± 0.347
2.958PheGly: 2.958 ± 0.567
0.592PheHis: 0.592 ± 0.218
2.62PheIle: 2.62 ± 0.454
3.634PheLys: 3.634 ± 0.625
3.972PheLeu: 3.972 ± 0.648
1.099PheMet: 1.099 ± 0.391
2.535PheAsn: 2.535 ± 0.448
1.521PhePro: 1.521 ± 0.408
1.521PheGln: 1.521 ± 0.453
1.521PheArg: 1.521 ± 0.407
2.789PheSer: 2.789 ± 0.481
2.789PheThr: 2.789 ± 0.608
2.366PheVal: 2.366 ± 0.499
0.507PheTrp: 0.507 ± 0.235
2.113PheTyr: 2.113 ± 0.461
0.0PheXaa: 0.0 ± 0.0
Gly
4.31GlyAla: 4.31 ± 1.017
0.423GlyCys: 0.423 ± 0.234
4.732GlyAsp: 4.732 ± 1.127
3.887GlyGlu: 3.887 ± 0.518
3.465GlyPhe: 3.465 ± 0.709
3.887GlyGly: 3.887 ± 0.698
0.845GlyHis: 0.845 ± 0.361
4.901GlyIle: 4.901 ± 0.798
5.662GlyLys: 5.662 ± 0.569
4.817GlyLeu: 4.817 ± 0.832
1.859GlyMet: 1.859 ± 0.459
4.732GlyAsn: 4.732 ± 0.928
0.338GlyPro: 0.338 ± 0.233
3.549GlyGln: 3.549 ± 0.525
2.113GlyArg: 2.113 ± 0.53
3.549GlySer: 3.549 ± 0.616
4.141GlyThr: 4.141 ± 0.559
3.549GlyVal: 3.549 ± 0.549
1.352GlyTrp: 1.352 ± 0.321
3.296GlyTyr: 3.296 ± 0.467
0.0GlyXaa: 0.0 ± 0.0
His
1.014HisAla: 1.014 ± 0.288
0.085HisCys: 0.085 ± 0.083
1.268HisAsp: 1.268 ± 0.313
0.507HisGlu: 0.507 ± 0.264
0.507HisPhe: 0.507 ± 0.265
0.93HisGly: 0.93 ± 0.233
0.169HisHis: 0.169 ± 0.11
0.93HisIle: 0.93 ± 0.335
0.761HisLys: 0.761 ± 0.209
1.099HisLeu: 1.099 ± 0.288
0.423HisMet: 0.423 ± 0.197
1.099HisAsn: 1.099 ± 0.397
0.676HisPro: 0.676 ± 0.25
0.676HisGln: 0.676 ± 0.22
0.507HisArg: 0.507 ± 0.167
1.183HisSer: 1.183 ± 0.351
0.592HisThr: 0.592 ± 0.292
0.845HisVal: 0.845 ± 0.243
0.085HisTrp: 0.085 ± 0.1
0.423HisTyr: 0.423 ± 0.172
0.0HisXaa: 0.0 ± 0.0
Ile
5.408IleAla: 5.408 ± 0.728
0.423IleCys: 0.423 ± 0.164
5.408IleAsp: 5.408 ± 0.794
4.817IleGlu: 4.817 ± 0.649
2.028IlePhe: 2.028 ± 0.522
4.479IleGly: 4.479 ± 0.467
1.183IleHis: 1.183 ± 0.383
4.056IleIle: 4.056 ± 0.724
7.267IleLys: 7.267 ± 1.023
4.563IleLeu: 4.563 ± 0.647
1.099IleMet: 1.099 ± 0.267
4.479IleAsn: 4.479 ± 0.599
2.535IlePro: 2.535 ± 0.421
2.535IleGln: 2.535 ± 0.411
2.282IleArg: 2.282 ± 0.334
4.225IleSer: 4.225 ± 0.633
3.887IleThr: 3.887 ± 0.725
4.31IleVal: 4.31 ± 0.762
0.93IleTrp: 0.93 ± 0.307
3.042IleTyr: 3.042 ± 0.643
0.0IleXaa: 0.0 ± 0.0
Lys
7.521LysAla: 7.521 ± 1.29
0.169LysCys: 0.169 ± 0.119
5.155LysAsp: 5.155 ± 0.628
7.352LysGlu: 7.352 ± 0.855
3.211LysPhe: 3.211 ± 0.517
5.324LysGly: 5.324 ± 0.589
1.606LysHis: 1.606 ± 0.358
5.239LysIle: 5.239 ± 0.644
7.521LysLys: 7.521 ± 0.967
6.76LysLeu: 6.76 ± 0.845
2.366LysMet: 2.366 ± 0.469
4.394LysAsn: 4.394 ± 0.601
2.451LysPro: 2.451 ± 0.452
4.056LysGln: 4.056 ± 0.528
4.901LysArg: 4.901 ± 0.726
6.338LysSer: 6.338 ± 0.702
5.831LysThr: 5.831 ± 0.723
4.732LysVal: 4.732 ± 0.825
1.099LysTrp: 1.099 ± 0.265
2.113LysTyr: 2.113 ± 0.397
0.0LysXaa: 0.0 ± 0.0
Leu
8.028LeuAla: 8.028 ± 1.07
0.423LeuCys: 0.423 ± 0.201
4.479LeuAsp: 4.479 ± 0.604
6.422LeuGlu: 6.422 ± 0.958
4.056LeuPhe: 4.056 ± 0.548
5.155LeuGly: 5.155 ± 0.818
0.761LeuHis: 0.761 ± 0.3
4.817LeuIle: 4.817 ± 0.629
7.183LeuLys: 7.183 ± 0.959
6.422LeuLeu: 6.422 ± 0.779
1.69LeuMet: 1.69 ± 0.404
5.577LeuAsn: 5.577 ± 0.745
2.451LeuPro: 2.451 ± 0.428
2.535LeuGln: 2.535 ± 0.541
3.127LeuArg: 3.127 ± 0.543
5.746LeuSer: 5.746 ± 0.642
6.253LeuThr: 6.253 ± 0.751
3.972LeuVal: 3.972 ± 0.433
0.676LeuTrp: 0.676 ± 0.185
2.028LeuTyr: 2.028 ± 0.477
0.0LeuXaa: 0.0 ± 0.0
Met
0.93MetAla: 0.93 ± 0.243
0.169MetCys: 0.169 ± 0.112
1.183MetAsp: 1.183 ± 0.288
2.197MetGlu: 2.197 ± 0.418
0.93MetPhe: 0.93 ± 0.279
1.014MetGly: 1.014 ± 0.28
0.423MetHis: 0.423 ± 0.181
1.606MetIle: 1.606 ± 0.46
2.62MetLys: 2.62 ± 0.502
2.535MetLeu: 2.535 ± 0.509
0.676MetMet: 0.676 ± 0.229
1.099MetAsn: 1.099 ± 0.199
1.014MetPro: 1.014 ± 0.295
0.845MetGln: 0.845 ± 0.263
0.93MetArg: 0.93 ± 0.272
2.113MetSer: 2.113 ± 0.359
1.69MetThr: 1.69 ± 0.459
0.93MetVal: 0.93 ± 0.277
0.254MetTrp: 0.254 ± 0.132
0.676MetTyr: 0.676 ± 0.248
0.0MetXaa: 0.0 ± 0.0
Asn
4.732AsnAla: 4.732 ± 1.133
0.169AsnCys: 0.169 ± 0.117
2.958AsnAsp: 2.958 ± 0.744
3.972AsnGlu: 3.972 ± 0.539
2.958AsnPhe: 2.958 ± 0.612
5.408AsnGly: 5.408 ± 0.812
0.93AsnHis: 0.93 ± 0.25
3.718AsnIle: 3.718 ± 0.753
4.648AsnLys: 4.648 ± 0.577
4.563AsnLeu: 4.563 ± 0.657
0.845AsnMet: 0.845 ± 0.301
3.465AsnAsn: 3.465 ± 0.735
2.451AsnPro: 2.451 ± 0.543
3.211AsnGln: 3.211 ± 0.646
2.028AsnArg: 2.028 ± 0.478
3.803AsnSer: 3.803 ± 0.595
3.296AsnThr: 3.296 ± 0.445
4.225AsnVal: 4.225 ± 0.382
0.507AsnTrp: 0.507 ± 0.244
2.535AsnTyr: 2.535 ± 0.599
0.0AsnXaa: 0.0 ± 0.0
Pro
1.352ProAla: 1.352 ± 0.341
0.254ProCys: 0.254 ± 0.18
1.775ProAsp: 1.775 ± 0.524
1.606ProGlu: 1.606 ± 0.346
1.521ProPhe: 1.521 ± 0.363
0.93ProGly: 0.93 ± 0.274
0.507ProHis: 0.507 ± 0.223
2.451ProIle: 2.451 ± 0.39
3.803ProLys: 3.803 ± 0.89
1.775ProLeu: 1.775 ± 0.236
0.507ProMet: 0.507 ± 0.292
2.028ProAsn: 2.028 ± 0.51
0.676ProPro: 0.676 ± 0.249
1.183ProGln: 1.183 ± 0.31
1.859ProArg: 1.859 ± 0.468
1.183ProSer: 1.183 ± 0.407
1.775ProThr: 1.775 ± 0.326
1.606ProVal: 1.606 ± 0.496
0.169ProTrp: 0.169 ± 0.123
1.268ProTyr: 1.268 ± 0.377
0.0ProXaa: 0.0 ± 0.0
Gln
3.042GlnAla: 3.042 ± 0.648
0.338GlnCys: 0.338 ± 0.228
1.859GlnAsp: 1.859 ± 0.354
2.958GlnGlu: 2.958 ± 0.52
1.69GlnPhe: 1.69 ± 0.291
2.535GlnGly: 2.535 ± 0.536
0.592GlnHis: 0.592 ± 0.186
2.535GlnIle: 2.535 ± 0.43
4.394GlnLys: 4.394 ± 0.653
2.704GlnLeu: 2.704 ± 0.375
1.099GlnMet: 1.099 ± 0.392
1.944GlnAsn: 1.944 ± 0.361
1.352GlnPro: 1.352 ± 0.545
2.958GlnGln: 2.958 ± 0.544
1.944GlnArg: 1.944 ± 0.399
2.535GlnSer: 2.535 ± 0.609
2.451GlnThr: 2.451 ± 0.405
2.789GlnVal: 2.789 ± 0.588
0.592GlnTrp: 0.592 ± 0.174
1.69GlnTyr: 1.69 ± 0.437
0.0GlnXaa: 0.0 ± 0.0
Arg
2.704ArgAla: 2.704 ± 0.493
0.423ArgCys: 0.423 ± 0.246
2.704ArgAsp: 2.704 ± 0.558
3.211ArgGlu: 3.211 ± 0.5
1.352ArgPhe: 1.352 ± 0.35
2.366ArgGly: 2.366 ± 0.636
0.507ArgHis: 0.507 ± 0.189
3.38ArgIle: 3.38 ± 0.607
3.549ArgLys: 3.549 ± 0.688
4.732ArgLeu: 4.732 ± 0.584
1.014ArgMet: 1.014 ± 0.276
2.535ArgAsn: 2.535 ± 0.385
1.521ArgPro: 1.521 ± 0.53
2.451ArgGln: 2.451 ± 0.448
2.282ArgArg: 2.282 ± 0.35
2.113ArgSer: 2.113 ± 0.399
1.775ArgThr: 1.775 ± 0.436
2.704ArgVal: 2.704 ± 0.614
0.845ArgTrp: 0.845 ± 0.313
1.859ArgTyr: 1.859 ± 0.465
0.0ArgXaa: 0.0 ± 0.0
Ser
3.803SerAla: 3.803 ± 0.676
0.338SerCys: 0.338 ± 0.251
4.394SerAsp: 4.394 ± 0.599
3.887SerGlu: 3.887 ± 0.519
2.704SerPhe: 2.704 ± 0.461
4.225SerGly: 4.225 ± 0.86
0.845SerHis: 0.845 ± 0.281
4.817SerIle: 4.817 ± 0.81
5.831SerLys: 5.831 ± 0.912
4.479SerLeu: 4.479 ± 0.567
1.859SerMet: 1.859 ± 0.389
3.887SerAsn: 3.887 ± 0.607
1.352SerPro: 1.352 ± 0.393
2.113SerGln: 2.113 ± 0.536
2.789SerArg: 2.789 ± 0.402
4.648SerSer: 4.648 ± 0.744
2.789SerThr: 2.789 ± 0.412
3.803SerVal: 3.803 ± 0.767
0.93SerTrp: 0.93 ± 0.304
2.789SerTyr: 2.789 ± 0.52
0.0SerXaa: 0.0 ± 0.0
Thr
4.394ThrAla: 4.394 ± 0.952
0.169ThrCys: 0.169 ± 0.127
2.535ThrAsp: 2.535 ± 0.569
3.042ThrGlu: 3.042 ± 0.56
2.535ThrPhe: 2.535 ± 0.654
4.479ThrGly: 4.479 ± 0.923
1.014ThrHis: 1.014 ± 0.361
4.648ThrIle: 4.648 ± 0.552
4.986ThrLys: 4.986 ± 0.666
6.253ThrLeu: 6.253 ± 0.605
0.845ThrMet: 0.845 ± 0.231
3.465ThrAsn: 3.465 ± 0.657
2.451ThrPro: 2.451 ± 0.635
2.789ThrGln: 2.789 ± 0.554
1.775ThrArg: 1.775 ± 0.364
3.634ThrSer: 3.634 ± 0.558
3.803ThrThr: 3.803 ± 0.777
3.634ThrVal: 3.634 ± 0.684
0.676ThrTrp: 0.676 ± 0.232
2.704ThrTyr: 2.704 ± 0.623
0.0ThrXaa: 0.0 ± 0.0
Val
4.31ValAla: 4.31 ± 0.587
0.423ValCys: 0.423 ± 0.197
3.296ValAsp: 3.296 ± 0.537
4.141ValGlu: 4.141 ± 0.584
3.296ValPhe: 3.296 ± 0.549
3.465ValGly: 3.465 ± 0.41
0.676ValHis: 0.676 ± 0.18
3.803ValIle: 3.803 ± 0.552
4.648ValLys: 4.648 ± 0.742
4.563ValLeu: 4.563 ± 0.696
1.437ValMet: 1.437 ± 0.318
3.803ValAsn: 3.803 ± 0.523
1.859ValPro: 1.859 ± 0.386
1.775ValGln: 1.775 ± 0.455
2.028ValArg: 2.028 ± 0.548
3.718ValSer: 3.718 ± 0.709
4.31ValThr: 4.31 ± 0.58
4.817ValVal: 4.817 ± 0.553
0.423ValTrp: 0.423 ± 0.157
2.62ValTyr: 2.62 ± 0.676
0.0ValXaa: 0.0 ± 0.0
Trp
0.592TrpAla: 0.592 ± 0.178
0.085TrpCys: 0.085 ± 0.082
0.676TrpAsp: 0.676 ± 0.265
0.845TrpGlu: 0.845 ± 0.217
0.592TrpPhe: 0.592 ± 0.257
1.352TrpGly: 1.352 ± 0.412
0.0TrpHis: 0.0 ± 0.0
1.099TrpIle: 1.099 ± 0.283
1.268TrpLys: 1.268 ± 0.362
1.268TrpLeu: 1.268 ± 0.34
0.0TrpMet: 0.0 ± 0.0
0.845TrpAsn: 0.845 ± 0.305
0.085TrpPro: 0.085 ± 0.092
0.592TrpGln: 0.592 ± 0.223
0.845TrpArg: 0.845 ± 0.255
0.761TrpSer: 0.761 ± 0.261
0.423TrpThr: 0.423 ± 0.233
1.014TrpVal: 1.014 ± 0.305
0.254TrpTrp: 0.254 ± 0.166
0.507TrpTyr: 0.507 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.197TyrAla: 2.197 ± 0.452
0.254TyrCys: 0.254 ± 0.147
2.958TyrAsp: 2.958 ± 0.573
2.62TyrGlu: 2.62 ± 0.611
1.437TyrPhe: 1.437 ± 0.396
3.549TyrGly: 3.549 ± 0.973
0.592TyrHis: 0.592 ± 0.222
2.958TyrIle: 2.958 ± 0.659
2.62TyrLys: 2.62 ± 0.553
2.535TyrLeu: 2.535 ± 0.557
1.099TyrMet: 1.099 ± 0.298
2.197TyrAsn: 2.197 ± 0.469
0.93TyrPro: 0.93 ± 0.331
1.775TyrGln: 1.775 ± 0.388
3.042TyrArg: 3.042 ± 0.504
1.944TyrSer: 1.944 ± 0.463
1.775TyrThr: 1.775 ± 0.461
2.62TyrVal: 2.62 ± 0.413
0.338TyrTrp: 0.338 ± 0.157
1.099TyrTyr: 1.099 ± 0.382
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (11835 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski