Amino acid dipepetide frequency for Streptococcus phage Javan501

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.988AlaAla: 3.988 ± 0.973
0.537AlaCys: 0.537 ± 0.231
3.911AlaAsp: 3.911 ± 0.519
4.525AlaGlu: 4.525 ± 0.865
2.147AlaPhe: 2.147 ± 0.334
4.832AlaGly: 4.832 ± 0.704
0.767AlaHis: 0.767 ± 0.212
5.292AlaIle: 5.292 ± 0.648
6.902AlaLys: 6.902 ± 0.75
5.752AlaLeu: 5.752 ± 1.018
2.147AlaMet: 2.147 ± 0.349
3.758AlaAsn: 3.758 ± 0.511
1.534AlaPro: 1.534 ± 0.309
2.838AlaGln: 2.838 ± 0.614
2.301AlaArg: 2.301 ± 0.455
4.678AlaSer: 4.678 ± 0.777
4.448AlaThr: 4.448 ± 0.715
4.295AlaVal: 4.295 ± 0.685
0.92AlaTrp: 0.92 ± 0.222
2.147AlaTyr: 2.147 ± 0.374
0.0AlaXaa: 0.0 ± 0.0
Cys
0.23CysAla: 0.23 ± 0.138
0.077CysCys: 0.077 ± 0.082
0.383CysAsp: 0.383 ± 0.154
0.69CysGlu: 0.69 ± 0.22
0.307CysPhe: 0.307 ± 0.138
0.92CysGly: 0.92 ± 0.277
0.077CysHis: 0.077 ± 0.065
0.23CysIle: 0.23 ± 0.131
0.537CysLys: 0.537 ± 0.214
0.383CysLeu: 0.383 ± 0.164
0.153CysMet: 0.153 ± 0.121
0.307CysAsn: 0.307 ± 0.164
0.153CysPro: 0.153 ± 0.119
0.23CysGln: 0.23 ± 0.121
0.23CysArg: 0.23 ± 0.105
0.23CysSer: 0.23 ± 0.136
0.307CysThr: 0.307 ± 0.155
0.537CysVal: 0.537 ± 0.248
0.153CysTrp: 0.153 ± 0.102
0.307CysTyr: 0.307 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
3.681AspAla: 3.681 ± 0.493
0.537AspCys: 0.537 ± 0.195
4.448AspAsp: 4.448 ± 0.5
5.062AspGlu: 5.062 ± 0.712
3.221AspPhe: 3.221 ± 0.527
5.138AspGly: 5.138 ± 0.656
1.074AspHis: 1.074 ± 0.298
4.295AspIle: 4.295 ± 0.675
6.135AspLys: 6.135 ± 0.684
6.519AspLeu: 6.519 ± 0.627
1.841AspMet: 1.841 ± 0.34
4.448AspAsn: 4.448 ± 0.469
1.841AspPro: 1.841 ± 0.423
1.074AspGln: 1.074 ± 0.319
2.147AspArg: 2.147 ± 0.302
3.605AspSer: 3.605 ± 0.504
3.835AspThr: 3.835 ± 0.571
4.218AspVal: 4.218 ± 0.511
0.92AspTrp: 0.92 ± 0.333
3.374AspTyr: 3.374 ± 0.551
0.0AspXaa: 0.0 ± 0.0
Glu
4.678GluAla: 4.678 ± 0.728
0.537GluCys: 0.537 ± 0.24
3.528GluAsp: 3.528 ± 0.545
5.752GluGlu: 5.752 ± 0.766
3.221GluPhe: 3.221 ± 0.387
2.454GluGly: 2.454 ± 0.381
1.15GluHis: 1.15 ± 0.347
6.902GluIle: 6.902 ± 0.775
6.366GluLys: 6.366 ± 0.78
7.439GluLeu: 7.439 ± 0.79
2.071GluMet: 2.071 ± 0.42
3.068GluAsn: 3.068 ± 0.518
1.994GluPro: 1.994 ± 0.39
3.298GluGln: 3.298 ± 0.525
2.914GluArg: 2.914 ± 0.553
4.218GluSer: 4.218 ± 0.541
4.295GluThr: 4.295 ± 0.519
4.525GluVal: 4.525 ± 0.733
0.997GluTrp: 0.997 ± 0.191
2.608GluTyr: 2.608 ± 0.345
0.0GluXaa: 0.0 ± 0.0
Phe
2.454PheAla: 2.454 ± 0.341
0.307PheCys: 0.307 ± 0.157
3.221PheAsp: 3.221 ± 0.487
3.144PheGlu: 3.144 ± 0.556
1.38PhePhe: 1.38 ± 0.399
2.838PheGly: 2.838 ± 0.416
0.307PheHis: 0.307 ± 0.154
2.531PheIle: 2.531 ± 0.502
4.065PheLys: 4.065 ± 0.602
1.994PheLeu: 1.994 ± 0.433
1.15PheMet: 1.15 ± 0.232
2.147PheAsn: 2.147 ± 0.382
0.92PhePro: 0.92 ± 0.228
1.074PheGln: 1.074 ± 0.238
1.764PheArg: 1.764 ± 0.423
3.221PheSer: 3.221 ± 0.398
2.531PheThr: 2.531 ± 0.445
2.224PheVal: 2.224 ± 0.368
0.307PheTrp: 0.307 ± 0.123
1.687PheTyr: 1.687 ± 0.382
0.0PheXaa: 0.0 ± 0.0
Gly
3.528GlyAla: 3.528 ± 0.549
0.307GlyCys: 0.307 ± 0.137
4.448GlyAsp: 4.448 ± 0.538
3.144GlyGlu: 3.144 ± 0.5
2.761GlyPhe: 2.761 ± 0.519
4.832GlyGly: 4.832 ± 0.672
1.227GlyHis: 1.227 ± 0.276
4.832GlyIle: 4.832 ± 0.62
6.902GlyLys: 6.902 ± 0.591
5.138GlyLeu: 5.138 ± 0.774
1.764GlyMet: 1.764 ± 0.407
3.835GlyAsn: 3.835 ± 0.457
0.614GlyPro: 0.614 ± 0.284
2.224GlyGln: 2.224 ± 0.38
2.991GlyArg: 2.991 ± 0.524
3.068GlySer: 3.068 ± 0.394
2.838GlyThr: 2.838 ± 0.466
4.908GlyVal: 4.908 ± 0.536
1.227GlyTrp: 1.227 ± 0.368
2.684GlyTyr: 2.684 ± 0.531
0.0GlyXaa: 0.0 ± 0.0
His
1.304HisAla: 1.304 ± 0.337
0.0HisCys: 0.0 ± 0.0
0.92HisAsp: 0.92 ± 0.335
0.767HisGlu: 0.767 ± 0.218
0.92HisPhe: 0.92 ± 0.219
0.69HisGly: 0.69 ± 0.231
0.077HisHis: 0.077 ± 0.072
1.074HisIle: 1.074 ± 0.277
0.767HisLys: 0.767 ± 0.22
0.92HisLeu: 0.92 ± 0.217
0.23HisMet: 0.23 ± 0.148
0.767HisAsn: 0.767 ± 0.205
0.614HisPro: 0.614 ± 0.198
0.46HisGln: 0.46 ± 0.199
0.69HisArg: 0.69 ± 0.208
0.92HisSer: 0.92 ± 0.31
0.92HisThr: 0.92 ± 0.267
0.92HisVal: 0.92 ± 0.206
0.383HisTrp: 0.383 ± 0.168
0.767HisTyr: 0.767 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
4.985IleAla: 4.985 ± 0.554
0.46IleCys: 0.46 ± 0.181
6.366IleAsp: 6.366 ± 0.965
5.982IleGlu: 5.982 ± 0.544
1.841IlePhe: 1.841 ± 0.353
3.528IleGly: 3.528 ± 0.554
1.074IleHis: 1.074 ± 0.243
5.369IleIle: 5.369 ± 0.723
6.902IleLys: 6.902 ± 0.678
6.212IleLeu: 6.212 ± 0.955
0.997IleMet: 0.997 ± 0.255
4.908IleAsn: 4.908 ± 0.644
2.914IlePro: 2.914 ± 0.477
1.304IleGln: 1.304 ± 0.297
2.531IleArg: 2.531 ± 0.413
3.911IleSer: 3.911 ± 0.628
4.908IleThr: 4.908 ± 0.639
3.605IleVal: 3.605 ± 0.519
0.69IleTrp: 0.69 ± 0.253
3.144IleTyr: 3.144 ± 0.495
0.0IleXaa: 0.0 ± 0.0
Lys
6.596LysAla: 6.596 ± 0.776
0.767LysCys: 0.767 ± 0.211
5.522LysAsp: 5.522 ± 0.814
6.826LysGlu: 6.826 ± 0.671
2.991LysPhe: 2.991 ± 0.376
4.525LysGly: 4.525 ± 0.584
1.457LysHis: 1.457 ± 0.266
6.979LysIle: 6.979 ± 0.962
7.899LysLys: 7.899 ± 0.93
7.132LysLeu: 7.132 ± 0.738
2.224LysMet: 2.224 ± 0.366
6.059LysAsn: 6.059 ± 0.66
3.144LysPro: 3.144 ± 0.459
5.062LysGln: 5.062 ± 0.738
3.221LysArg: 3.221 ± 0.596
5.445LysSer: 5.445 ± 0.559
5.905LysThr: 5.905 ± 0.708
4.755LysVal: 4.755 ± 0.677
0.844LysTrp: 0.844 ± 0.215
3.451LysTyr: 3.451 ± 0.492
0.0LysXaa: 0.0 ± 0.0
Leu
5.982LeuAla: 5.982 ± 0.936
0.307LeuCys: 0.307 ± 0.144
6.596LeuAsp: 6.596 ± 0.668
6.979LeuGlu: 6.979 ± 0.794
3.144LeuPhe: 3.144 ± 0.635
4.985LeuGly: 4.985 ± 0.659
0.92LeuHis: 0.92 ± 0.294
5.982LeuIle: 5.982 ± 0.802
8.283LeuLys: 8.283 ± 0.992
6.289LeuLeu: 6.289 ± 0.747
1.917LeuMet: 1.917 ± 0.375
5.599LeuAsn: 5.599 ± 0.571
2.991LeuPro: 2.991 ± 0.349
4.218LeuGln: 4.218 ± 0.644
3.605LeuArg: 3.605 ± 0.582
5.138LeuSer: 5.138 ± 0.718
5.062LeuThr: 5.062 ± 0.645
4.525LeuVal: 4.525 ± 0.615
0.69LeuTrp: 0.69 ± 0.291
2.914LeuTyr: 2.914 ± 0.501
0.0LeuXaa: 0.0 ± 0.0
Met
1.917MetAla: 1.917 ± 0.341
0.23MetCys: 0.23 ± 0.126
1.687MetAsp: 1.687 ± 0.404
1.38MetGlu: 1.38 ± 0.357
1.15MetPhe: 1.15 ± 0.269
1.074MetGly: 1.074 ± 0.274
0.383MetHis: 0.383 ± 0.165
2.147MetIle: 2.147 ± 0.447
1.687MetLys: 1.687 ± 0.337
2.071MetLeu: 2.071 ± 0.419
0.383MetMet: 0.383 ± 0.187
1.15MetAsn: 1.15 ± 0.291
0.537MetPro: 0.537 ± 0.184
0.69MetGln: 0.69 ± 0.23
1.917MetArg: 1.917 ± 0.337
1.764MetSer: 1.764 ± 0.424
1.917MetThr: 1.917 ± 0.472
1.764MetVal: 1.764 ± 0.451
0.307MetTrp: 0.307 ± 0.145
0.69MetTyr: 0.69 ± 0.238
0.0MetXaa: 0.0 ± 0.0
Asn
3.681AsnAla: 3.681 ± 0.641
0.153AsnCys: 0.153 ± 0.144
3.911AsnAsp: 3.911 ± 0.454
3.988AsnGlu: 3.988 ± 0.46
2.071AsnPhe: 2.071 ± 0.364
5.215AsnGly: 5.215 ± 0.761
0.767AsnHis: 0.767 ± 0.215
3.681AsnIle: 3.681 ± 0.595
4.065AsnLys: 4.065 ± 0.51
5.599AsnLeu: 5.599 ± 0.558
1.611AsnMet: 1.611 ± 0.314
3.758AsnAsn: 3.758 ± 0.551
2.531AsnPro: 2.531 ± 0.423
2.377AsnGln: 2.377 ± 0.518
2.761AsnArg: 2.761 ± 0.422
3.605AsnSer: 3.605 ± 0.567
2.684AsnThr: 2.684 ± 0.397
2.991AsnVal: 2.991 ± 0.385
0.767AsnTrp: 0.767 ± 0.224
2.301AsnTyr: 2.301 ± 0.437
0.0AsnXaa: 0.0 ± 0.0
Pro
1.611ProAla: 1.611 ± 0.451
0.077ProCys: 0.077 ± 0.065
1.994ProAsp: 1.994 ± 0.298
2.147ProGlu: 2.147 ± 0.368
1.304ProPhe: 1.304 ± 0.303
0.767ProGly: 0.767 ± 0.214
0.537ProHis: 0.537 ± 0.213
1.534ProIle: 1.534 ± 0.414
3.298ProLys: 3.298 ± 0.508
2.914ProLeu: 2.914 ± 0.536
0.307ProMet: 0.307 ± 0.206
1.841ProAsn: 1.841 ± 0.458
0.614ProPro: 0.614 ± 0.25
1.227ProGln: 1.227 ± 0.295
0.767ProArg: 0.767 ± 0.242
1.611ProSer: 1.611 ± 0.347
2.301ProThr: 2.301 ± 0.404
1.841ProVal: 1.841 ± 0.416
0.23ProTrp: 0.23 ± 0.124
1.611ProTyr: 1.611 ± 0.467
0.0ProXaa: 0.0 ± 0.0
Gln
2.838GlnAla: 2.838 ± 0.364
0.23GlnCys: 0.23 ± 0.147
1.611GlnAsp: 1.611 ± 0.311
2.838GlnGlu: 2.838 ± 0.497
2.224GlnPhe: 2.224 ± 0.383
2.224GlnGly: 2.224 ± 0.356
0.46GlnHis: 0.46 ± 0.165
2.608GlnIle: 2.608 ± 0.454
3.605GlnLys: 3.605 ± 0.439
4.295GlnLeu: 4.295 ± 0.935
1.534GlnMet: 1.534 ± 0.322
2.071GlnAsn: 2.071 ± 0.546
0.614GlnPro: 0.614 ± 0.199
1.917GlnGln: 1.917 ± 0.61
1.227GlnArg: 1.227 ± 0.358
3.681GlnSer: 3.681 ± 0.499
2.071GlnThr: 2.071 ± 0.421
1.534GlnVal: 1.534 ± 0.371
0.383GlnTrp: 0.383 ± 0.203
1.227GlnTyr: 1.227 ± 0.291
0.0GlnXaa: 0.0 ± 0.0
Arg
2.608ArgAla: 2.608 ± 0.36
0.23ArgCys: 0.23 ± 0.172
2.761ArgAsp: 2.761 ± 0.509
2.684ArgGlu: 2.684 ± 0.404
1.457ArgPhe: 1.457 ± 0.397
2.377ArgGly: 2.377 ± 0.472
0.614ArgHis: 0.614 ± 0.266
3.068ArgIle: 3.068 ± 0.347
3.988ArgLys: 3.988 ± 0.578
3.681ArgLeu: 3.681 ± 0.533
0.767ArgMet: 0.767 ± 0.236
2.454ArgAsn: 2.454 ± 0.422
0.767ArgPro: 0.767 ± 0.328
1.764ArgGln: 1.764 ± 0.281
2.531ArgArg: 2.531 ± 0.515
1.917ArgSer: 1.917 ± 0.372
2.531ArgThr: 2.531 ± 0.538
2.608ArgVal: 2.608 ± 0.499
0.307ArgTrp: 0.307 ± 0.159
1.841ArgTyr: 1.841 ± 0.406
0.0ArgXaa: 0.0 ± 0.0
Ser
4.372SerAla: 4.372 ± 0.855
0.153SerCys: 0.153 ± 0.105
4.755SerAsp: 4.755 ± 0.582
4.218SerGlu: 4.218 ± 0.534
2.608SerPhe: 2.608 ± 0.427
4.985SerGly: 4.985 ± 0.677
0.997SerHis: 0.997 ± 0.308
3.681SerIle: 3.681 ± 0.544
4.678SerLys: 4.678 ± 0.617
4.141SerLeu: 4.141 ± 0.544
1.917SerMet: 1.917 ± 0.346
3.298SerAsn: 3.298 ± 0.521
1.534SerPro: 1.534 ± 0.311
2.761SerGln: 2.761 ± 0.486
2.147SerArg: 2.147 ± 0.442
3.835SerSer: 3.835 ± 0.693
3.298SerThr: 3.298 ± 0.594
3.835SerVal: 3.835 ± 0.506
0.92SerTrp: 0.92 ± 0.295
2.608SerTyr: 2.608 ± 0.501
0.0SerXaa: 0.0 ± 0.0
Thr
5.675ThrAla: 5.675 ± 0.837
0.23ThrCys: 0.23 ± 0.133
3.221ThrAsp: 3.221 ± 0.481
3.605ThrGlu: 3.605 ± 0.511
2.684ThrPhe: 2.684 ± 0.442
4.678ThrGly: 4.678 ± 0.577
0.844ThrHis: 0.844 ± 0.273
4.448ThrIle: 4.448 ± 0.456
4.678ThrLys: 4.678 ± 0.509
4.908ThrLeu: 4.908 ± 0.604
1.457ThrMet: 1.457 ± 0.306
2.914ThrAsn: 2.914 ± 0.415
2.224ThrPro: 2.224 ± 0.399
1.994ThrGln: 1.994 ± 0.418
2.147ThrArg: 2.147 ± 0.292
3.681ThrSer: 3.681 ± 0.493
3.528ThrThr: 3.528 ± 0.469
3.605ThrVal: 3.605 ± 0.6
0.767ThrTrp: 0.767 ± 0.202
1.841ThrTyr: 1.841 ± 0.379
0.0ThrXaa: 0.0 ± 0.0
Val
4.525ValAla: 4.525 ± 0.845
0.614ValCys: 0.614 ± 0.258
4.295ValAsp: 4.295 ± 0.592
4.908ValGlu: 4.908 ± 0.543
2.224ValPhe: 2.224 ± 0.438
4.295ValGly: 4.295 ± 0.541
0.844ValHis: 0.844 ± 0.319
3.988ValIle: 3.988 ± 0.532
5.675ValLys: 5.675 ± 0.859
6.135ValLeu: 6.135 ± 0.742
1.304ValMet: 1.304 ± 0.353
3.605ValAsn: 3.605 ± 0.56
1.15ValPro: 1.15 ± 0.216
1.38ValGln: 1.38 ± 0.257
2.147ValArg: 2.147 ± 0.366
3.605ValSer: 3.605 ± 0.485
3.144ValThr: 3.144 ± 0.543
2.838ValVal: 2.838 ± 0.294
0.383ValTrp: 0.383 ± 0.142
1.534ValTyr: 1.534 ± 0.385
0.0ValXaa: 0.0 ± 0.0
Trp
0.383TrpAla: 0.383 ± 0.15
0.23TrpCys: 0.23 ± 0.118
0.767TrpAsp: 0.767 ± 0.233
0.844TrpGlu: 0.844 ± 0.246
0.69TrpPhe: 0.69 ± 0.239
0.69TrpGly: 0.69 ± 0.257
0.153TrpHis: 0.153 ± 0.106
0.844TrpIle: 0.844 ± 0.271
1.304TrpLys: 1.304 ± 0.308
1.227TrpLeu: 1.227 ± 0.341
0.23TrpMet: 0.23 ± 0.141
0.383TrpAsn: 0.383 ± 0.19
0.307TrpPro: 0.307 ± 0.153
0.69TrpGln: 0.69 ± 0.183
0.69TrpArg: 0.69 ± 0.277
0.614TrpSer: 0.614 ± 0.194
0.537TrpThr: 0.537 ± 0.172
0.537TrpVal: 0.537 ± 0.177
0.077TrpTrp: 0.077 ± 0.066
0.383TrpTyr: 0.383 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.684TyrAla: 2.684 ± 0.371
0.46TyrCys: 0.46 ± 0.213
3.068TyrAsp: 3.068 ± 0.485
2.531TyrGlu: 2.531 ± 0.415
0.92TyrPhe: 0.92 ± 0.306
2.224TyrGly: 2.224 ± 0.358
0.383TyrHis: 0.383 ± 0.164
2.147TyrIle: 2.147 ± 0.486
3.144TyrLys: 3.144 ± 0.526
3.374TyrLeu: 3.374 ± 0.504
0.767TyrMet: 0.767 ± 0.204
2.147TyrAsn: 2.147 ± 0.499
1.534TyrPro: 1.534 ± 0.431
2.608TyrGln: 2.608 ± 0.444
2.224TyrArg: 2.224 ± 0.426
1.994TyrSer: 1.994 ± 0.368
2.071TyrThr: 2.071 ± 0.414
2.531TyrVal: 2.531 ± 0.42
0.307TyrTrp: 0.307 ± 0.115
1.994TyrTyr: 1.994 ± 0.318
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (13040 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski