Amino acid dipepetide frequency for Streptococcus phage SpGS-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.392AlaAla: 4.392 ± 1.212
0.253AlaCys: 0.253 ± 0.143
4.984AlaAsp: 4.984 ± 0.953
6.166AlaGlu: 6.166 ± 0.848
2.956AlaPhe: 2.956 ± 0.414
4.139AlaGly: 4.139 ± 0.655
0.676AlaHis: 0.676 ± 0.208
5.659AlaIle: 5.659 ± 0.696
4.984AlaLys: 4.984 ± 0.633
5.744AlaLeu: 5.744 ± 0.973
2.196AlaMet: 2.196 ± 0.561
3.97AlaAsn: 3.97 ± 0.545
1.183AlaPro: 1.183 ± 0.297
2.787AlaGln: 2.787 ± 0.549
3.379AlaArg: 3.379 ± 0.602
4.308AlaSer: 4.308 ± 0.729
4.223AlaThr: 4.223 ± 0.761
4.054AlaVal: 4.054 ± 0.635
0.76AlaTrp: 0.76 ± 0.245
2.281AlaTyr: 2.281 ± 0.591
0.0AlaXaa: 0.0 ± 0.0
Cys
0.422CysAla: 0.422 ± 0.2
0.169CysCys: 0.169 ± 0.128
0.169CysAsp: 0.169 ± 0.122
0.845CysGlu: 0.845 ± 0.245
0.253CysPhe: 0.253 ± 0.144
0.422CysGly: 0.422 ± 0.148
0.0CysHis: 0.0 ± 0.0
0.253CysIle: 0.253 ± 0.145
0.422CysLys: 0.422 ± 0.196
0.253CysLeu: 0.253 ± 0.144
0.084CysMet: 0.084 ± 0.073
0.0CysAsn: 0.0 ± 0.0
0.169CysPro: 0.169 ± 0.127
0.253CysGln: 0.253 ± 0.135
0.0CysArg: 0.0 ± 0.0
0.253CysSer: 0.253 ± 0.154
0.084CysThr: 0.084 ± 0.089
0.422CysVal: 0.422 ± 0.161
0.253CysTrp: 0.253 ± 0.14
0.253CysTyr: 0.253 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
3.885AspAla: 3.885 ± 0.67
0.338AspCys: 0.338 ± 0.159
4.054AspAsp: 4.054 ± 0.796
3.97AspGlu: 3.97 ± 0.565
4.477AspPhe: 4.477 ± 0.535
6.757AspGly: 6.757 ± 0.892
0.591AspHis: 0.591 ± 0.183
5.152AspIle: 5.152 ± 0.533
6.166AspLys: 6.166 ± 0.713
4.308AspLeu: 4.308 ± 0.678
1.267AspMet: 1.267 ± 0.339
3.21AspAsn: 3.21 ± 0.531
1.183AspPro: 1.183 ± 0.328
1.689AspGln: 1.689 ± 0.317
3.21AspArg: 3.21 ± 0.472
2.787AspSer: 2.787 ± 0.524
3.632AspThr: 3.632 ± 0.439
3.801AspVal: 3.801 ± 0.711
0.676AspTrp: 0.676 ± 0.223
3.548AspTyr: 3.548 ± 0.549
0.0AspXaa: 0.0 ± 0.0
Glu
4.139GluAla: 4.139 ± 0.547
0.253GluCys: 0.253 ± 0.203
3.801GluAsp: 3.801 ± 0.616
6.335GluGlu: 6.335 ± 0.924
2.956GluPhe: 2.956 ± 0.493
2.787GluGly: 2.787 ± 0.48
1.098GluHis: 1.098 ± 0.278
6.588GluIle: 6.588 ± 0.999
6.419GluLys: 6.419 ± 0.855
7.94GluLeu: 7.94 ± 0.915
2.534GluMet: 2.534 ± 0.602
4.054GluAsn: 4.054 ± 0.588
2.365GluPro: 2.365 ± 0.594
4.392GluGln: 4.392 ± 0.552
3.21GluArg: 3.21 ± 0.565
3.041GluSer: 3.041 ± 0.715
3.548GluThr: 3.548 ± 0.445
4.477GluVal: 4.477 ± 0.634
0.507GluTrp: 0.507 ± 0.191
2.787GluTyr: 2.787 ± 0.567
0.0GluXaa: 0.0 ± 0.0
Phe
2.365PheAla: 2.365 ± 0.416
0.169PheCys: 0.169 ± 0.111
4.561PheAsp: 4.561 ± 0.461
4.646PheGlu: 4.646 ± 0.745
1.52PhePhe: 1.52 ± 0.392
2.281PheGly: 2.281 ± 0.476
0.845PheHis: 0.845 ± 0.262
2.365PheIle: 2.365 ± 0.366
2.618PheLys: 2.618 ± 0.502
2.196PheLeu: 2.196 ± 0.478
0.929PheMet: 0.929 ± 0.305
2.365PheAsn: 2.365 ± 0.425
1.014PhePro: 1.014 ± 0.351
1.943PheGln: 1.943 ± 0.462
1.605PheArg: 1.605 ± 0.386
3.294PheSer: 3.294 ± 0.38
3.294PheThr: 3.294 ± 0.59
2.365PheVal: 2.365 ± 0.448
0.169PheTrp: 0.169 ± 0.144
1.774PheTyr: 1.774 ± 0.467
0.0PheXaa: 0.0 ± 0.0
Gly
4.308GlyAla: 4.308 ± 0.754
0.253GlyCys: 0.253 ± 0.154
3.041GlyAsp: 3.041 ± 0.424
3.632GlyGlu: 3.632 ± 0.399
3.21GlyPhe: 3.21 ± 0.67
5.237GlyGly: 5.237 ± 0.596
1.183GlyHis: 1.183 ± 0.283
4.646GlyIle: 4.646 ± 0.508
4.815GlyLys: 4.815 ± 0.664
4.815GlyLeu: 4.815 ± 0.816
2.281GlyMet: 2.281 ± 0.482
4.223GlyAsn: 4.223 ± 0.687
0.338GlyPro: 0.338 ± 0.177
2.45GlyGln: 2.45 ± 0.489
3.885GlyArg: 3.885 ± 0.604
5.237GlySer: 5.237 ± 0.87
4.815GlyThr: 4.815 ± 0.681
5.406GlyVal: 5.406 ± 0.816
1.014GlyTrp: 1.014 ± 0.539
2.365GlyTyr: 2.365 ± 0.467
0.0GlyXaa: 0.0 ± 0.0
His
0.845HisAla: 0.845 ± 0.233
0.169HisCys: 0.169 ± 0.134
1.098HisAsp: 1.098 ± 0.323
0.929HisGlu: 0.929 ± 0.263
0.676HisPhe: 0.676 ± 0.21
1.014HisGly: 1.014 ± 0.333
0.338HisHis: 0.338 ± 0.159
1.183HisIle: 1.183 ± 0.371
1.014HisLys: 1.014 ± 0.358
1.098HisLeu: 1.098 ± 0.335
0.253HisMet: 0.253 ± 0.143
0.676HisAsn: 0.676 ± 0.292
0.676HisPro: 0.676 ± 0.279
0.422HisGln: 0.422 ± 0.159
0.422HisArg: 0.422 ± 0.17
1.689HisSer: 1.689 ± 0.476
1.098HisThr: 1.098 ± 0.284
0.591HisVal: 0.591 ± 0.237
0.338HisTrp: 0.338 ± 0.138
0.591HisTyr: 0.591 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
5.237IleAla: 5.237 ± 0.688
0.338IleCys: 0.338 ± 0.158
4.73IleAsp: 4.73 ± 0.602
6.504IleGlu: 6.504 ± 0.868
2.281IlePhe: 2.281 ± 0.546
4.984IleGly: 4.984 ± 0.704
1.267IleHis: 1.267 ± 0.268
4.392IleIle: 4.392 ± 0.707
7.518IleLys: 7.518 ± 0.758
3.885IleLeu: 3.885 ± 0.53
1.267IleMet: 1.267 ± 0.319
3.885IleAsn: 3.885 ± 0.46
2.281IlePro: 2.281 ± 0.578
2.618IleGln: 2.618 ± 0.424
2.112IleArg: 2.112 ± 0.404
4.899IleSer: 4.899 ± 0.533
3.717IleThr: 3.717 ± 0.622
4.139IleVal: 4.139 ± 0.674
0.76IleTrp: 0.76 ± 0.279
2.196IleTyr: 2.196 ± 0.446
0.0IleXaa: 0.0 ± 0.0
Lys
6.251LysAla: 6.251 ± 0.863
0.507LysCys: 0.507 ± 0.205
3.801LysAsp: 3.801 ± 0.566
6.757LysGlu: 6.757 ± 0.779
2.45LysPhe: 2.45 ± 0.4
5.997LysGly: 5.997 ± 0.812
1.351LysHis: 1.351 ± 0.397
5.321LysIle: 5.321 ± 0.776
6.926LysLys: 6.926 ± 1.056
6.842LysLeu: 6.842 ± 0.814
1.351LysMet: 1.351 ± 0.309
6.166LysAsn: 6.166 ± 0.695
1.943LysPro: 1.943 ± 0.396
3.885LysGln: 3.885 ± 0.774
3.294LysArg: 3.294 ± 0.534
4.899LysSer: 4.899 ± 0.662
5.575LysThr: 5.575 ± 0.645
5.49LysVal: 5.49 ± 0.564
1.351LysTrp: 1.351 ± 0.265
3.463LysTyr: 3.463 ± 0.559
0.0LysXaa: 0.0 ± 0.0
Leu
6.251LeuAla: 6.251 ± 0.911
0.253LeuCys: 0.253 ± 0.144
5.575LeuAsp: 5.575 ± 0.602
6.335LeuGlu: 6.335 ± 0.866
2.956LeuPhe: 2.956 ± 0.525
4.477LeuGly: 4.477 ± 0.529
1.098LeuHis: 1.098 ± 0.341
2.787LeuIle: 2.787 ± 0.582
7.433LeuLys: 7.433 ± 0.724
5.49LeuLeu: 5.49 ± 0.805
1.436LeuMet: 1.436 ± 0.298
5.321LeuAsn: 5.321 ± 0.698
2.956LeuPro: 2.956 ± 0.421
3.21LeuGln: 3.21 ± 0.749
2.787LeuArg: 2.787 ± 0.517
4.223LeuSer: 4.223 ± 0.609
5.575LeuThr: 5.575 ± 0.645
4.392LeuVal: 4.392 ± 0.645
0.76LeuTrp: 0.76 ± 0.281
2.365LeuTyr: 2.365 ± 0.362
0.0LeuXaa: 0.0 ± 0.0
Met
2.112MetAla: 2.112 ± 0.5
0.169MetCys: 0.169 ± 0.13
1.267MetAsp: 1.267 ± 0.338
1.52MetGlu: 1.52 ± 0.321
0.676MetPhe: 0.676 ± 0.247
0.929MetGly: 0.929 ± 0.254
0.169MetHis: 0.169 ± 0.135
1.52MetIle: 1.52 ± 0.339
2.196MetLys: 2.196 ± 0.363
1.014MetLeu: 1.014 ± 0.218
0.422MetMet: 0.422 ± 0.142
1.351MetAsn: 1.351 ± 0.343
0.929MetPro: 0.929 ± 0.262
1.351MetGln: 1.351 ± 0.321
1.774MetArg: 1.774 ± 0.445
1.52MetSer: 1.52 ± 0.329
2.281MetThr: 2.281 ± 0.328
0.845MetVal: 0.845 ± 0.249
0.253MetTrp: 0.253 ± 0.141
0.507MetTyr: 0.507 ± 0.222
0.0MetXaa: 0.0 ± 0.0
Asn
4.308AsnAla: 4.308 ± 0.606
0.338AsnCys: 0.338 ± 0.215
3.97AsnAsp: 3.97 ± 0.545
3.801AsnGlu: 3.801 ± 0.615
2.45AsnPhe: 2.45 ± 0.569
5.406AsnGly: 5.406 ± 0.883
0.929AsnHis: 0.929 ± 0.227
3.379AsnIle: 3.379 ± 0.542
4.561AsnLys: 4.561 ± 0.66
4.561AsnLeu: 4.561 ± 0.827
1.436AsnMet: 1.436 ± 0.359
3.885AsnAsn: 3.885 ± 0.811
1.943AsnPro: 1.943 ± 0.42
2.703AsnGln: 2.703 ± 0.548
2.45AsnArg: 2.45 ± 0.511
4.054AsnSer: 4.054 ± 0.535
2.872AsnThr: 2.872 ± 0.337
3.379AsnVal: 3.379 ± 0.581
0.929AsnTrp: 0.929 ± 0.248
2.027AsnTyr: 2.027 ± 0.479
0.0AsnXaa: 0.0 ± 0.0
Pro
1.436ProAla: 1.436 ± 0.384
0.253ProCys: 0.253 ± 0.16
2.534ProAsp: 2.534 ± 0.561
1.858ProGlu: 1.858 ± 0.352
1.267ProPhe: 1.267 ± 0.429
1.098ProGly: 1.098 ± 0.432
0.253ProHis: 0.253 ± 0.168
2.281ProIle: 2.281 ± 0.485
2.112ProLys: 2.112 ± 0.555
1.351ProLeu: 1.351 ± 0.367
0.422ProMet: 0.422 ± 0.236
2.196ProAsn: 2.196 ± 0.477
0.507ProPro: 0.507 ± 0.211
1.267ProGln: 1.267 ± 0.36
0.845ProArg: 0.845 ± 0.278
2.027ProSer: 2.027 ± 0.387
1.52ProThr: 1.52 ± 0.4
2.112ProVal: 2.112 ± 0.346
0.169ProTrp: 0.169 ± 0.153
1.436ProTyr: 1.436 ± 0.346
0.0ProXaa: 0.0 ± 0.0
Gln
3.125GlnAla: 3.125 ± 0.533
0.084GlnCys: 0.084 ± 0.086
1.689GlnAsp: 1.689 ± 0.39
3.21GlnGlu: 3.21 ± 0.627
1.52GlnPhe: 1.52 ± 0.401
2.956GlnGly: 2.956 ± 0.713
0.591GlnHis: 0.591 ± 0.223
2.787GlnIle: 2.787 ± 0.386
4.477GlnLys: 4.477 ± 0.815
3.379GlnLeu: 3.379 ± 0.442
1.014GlnMet: 1.014 ± 0.346
2.618GlnAsn: 2.618 ± 0.484
0.76GlnPro: 0.76 ± 0.253
2.45GlnGln: 2.45 ± 0.513
2.027GlnArg: 2.027 ± 0.418
2.872GlnSer: 2.872 ± 0.53
1.774GlnThr: 1.774 ± 0.488
3.21GlnVal: 3.21 ± 0.695
0.507GlnTrp: 0.507 ± 0.146
1.943GlnTyr: 1.943 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
3.463ArgAla: 3.463 ± 0.556
0.084ArgCys: 0.084 ± 0.087
2.365ArgAsp: 2.365 ± 0.453
2.281ArgGlu: 2.281 ± 0.52
2.027ArgPhe: 2.027 ± 0.363
2.618ArgGly: 2.618 ± 0.544
0.76ArgHis: 0.76 ± 0.242
2.872ArgIle: 2.872 ± 0.44
3.125ArgLys: 3.125 ± 0.534
3.885ArgLeu: 3.885 ± 0.624
1.183ArgMet: 1.183 ± 0.288
2.534ArgAsn: 2.534 ± 0.522
1.774ArgPro: 1.774 ± 0.538
2.027ArgGln: 2.027 ± 0.335
1.858ArgArg: 1.858 ± 0.414
2.112ArgSer: 2.112 ± 0.407
2.365ArgThr: 2.365 ± 0.454
2.618ArgVal: 2.618 ± 0.4
0.591ArgTrp: 0.591 ± 0.208
1.858ArgTyr: 1.858 ± 0.356
0.0ArgXaa: 0.0 ± 0.0
Ser
4.392SerAla: 4.392 ± 0.728
0.253SerCys: 0.253 ± 0.105
5.321SerAsp: 5.321 ± 0.803
3.801SerGlu: 3.801 ± 0.698
2.45SerPhe: 2.45 ± 0.515
3.632SerGly: 3.632 ± 0.789
1.267SerHis: 1.267 ± 0.422
3.548SerIle: 3.548 ± 0.519
3.97SerLys: 3.97 ± 0.691
5.068SerLeu: 5.068 ± 0.587
1.943SerMet: 1.943 ± 0.319
3.294SerAsn: 3.294 ± 0.481
1.689SerPro: 1.689 ± 0.427
2.787SerGln: 2.787 ± 0.413
2.196SerArg: 2.196 ± 0.357
3.885SerSer: 3.885 ± 0.7
4.815SerThr: 4.815 ± 0.813
4.392SerVal: 4.392 ± 0.646
1.014SerTrp: 1.014 ± 0.25
1.774SerTyr: 1.774 ± 0.354
0.0SerXaa: 0.0 ± 0.0
Thr
4.223ThrAla: 4.223 ± 0.947
0.338ThrCys: 0.338 ± 0.156
4.223ThrAsp: 4.223 ± 0.614
4.054ThrGlu: 4.054 ± 0.542
3.21ThrPhe: 3.21 ± 0.453
4.984ThrGly: 4.984 ± 0.751
0.591ThrHis: 0.591 ± 0.241
5.913ThrIle: 5.913 ± 0.639
4.984ThrLys: 4.984 ± 0.527
5.997ThrLeu: 5.997 ± 0.812
0.676ThrMet: 0.676 ± 0.181
3.717ThrAsn: 3.717 ± 0.531
2.787ThrPro: 2.787 ± 0.463
2.281ThrGln: 2.281 ± 0.377
2.45ThrArg: 2.45 ± 0.502
2.787ThrSer: 2.787 ± 0.411
4.561ThrThr: 4.561 ± 0.747
4.223ThrVal: 4.223 ± 0.681
1.351ThrTrp: 1.351 ± 0.345
1.689ThrTyr: 1.689 ± 0.324
0.0ThrXaa: 0.0 ± 0.0
Val
4.815ValAla: 4.815 ± 0.485
0.338ValCys: 0.338 ± 0.164
4.054ValAsp: 4.054 ± 0.623
3.885ValGlu: 3.885 ± 0.493
2.112ValPhe: 2.112 ± 0.368
4.392ValGly: 4.392 ± 0.734
1.267ValHis: 1.267 ± 0.336
5.321ValIle: 5.321 ± 0.627
5.49ValLys: 5.49 ± 0.654
3.21ValLeu: 3.21 ± 0.449
0.929ValMet: 0.929 ± 0.227
3.379ValAsn: 3.379 ± 0.51
0.76ValPro: 0.76 ± 0.231
2.196ValGln: 2.196 ± 0.397
2.196ValArg: 2.196 ± 0.365
4.392ValSer: 4.392 ± 0.59
5.913ValThr: 5.913 ± 0.559
4.308ValVal: 4.308 ± 0.647
0.676ValTrp: 0.676 ± 0.377
2.787ValTyr: 2.787 ± 0.576
0.0ValXaa: 0.0 ± 0.0
Trp
1.52TrpAla: 1.52 ± 0.275
0.0TrpCys: 0.0 ± 0.0
0.591TrpAsp: 0.591 ± 0.232
0.591TrpGlu: 0.591 ± 0.185
0.507TrpPhe: 0.507 ± 0.171
0.338TrpGly: 0.338 ± 0.138
0.338TrpHis: 0.338 ± 0.19
0.845TrpIle: 0.845 ± 0.297
1.098TrpLys: 1.098 ± 0.296
0.845TrpLeu: 0.845 ± 0.251
0.253TrpMet: 0.253 ± 0.123
0.591TrpAsn: 0.591 ± 0.206
0.169TrpPro: 0.169 ± 0.168
0.845TrpGln: 0.845 ± 0.285
0.507TrpArg: 0.507 ± 0.212
0.422TrpSer: 0.422 ± 0.134
0.929TrpThr: 0.929 ± 0.337
0.845TrpVal: 0.845 ± 0.217
0.169TrpTrp: 0.169 ± 0.119
1.098TrpTyr: 1.098 ± 0.546
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.605TyrAla: 1.605 ± 0.307
0.422TyrCys: 0.422 ± 0.226
2.872TyrAsp: 2.872 ± 0.514
2.112TyrGlu: 2.112 ± 0.496
2.365TyrPhe: 2.365 ± 0.458
2.703TyrGly: 2.703 ± 0.452
0.507TyrHis: 0.507 ± 0.232
2.281TyrIle: 2.281 ± 0.421
3.294TyrLys: 3.294 ± 0.581
3.801TyrLeu: 3.801 ± 0.662
0.845TyrMet: 0.845 ± 0.293
1.943TyrAsn: 1.943 ± 0.43
1.52TyrPro: 1.52 ± 0.343
1.436TyrGln: 1.436 ± 0.31
2.196TyrArg: 2.196 ± 0.498
2.787TyrSer: 2.787 ± 0.447
2.45TyrThr: 2.45 ± 0.53
1.351TyrVal: 1.351 ± 0.341
0.253TyrTrp: 0.253 ± 0.168
2.365TyrTyr: 2.365 ± 0.662
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11840 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski