Amino acid dipepetide frequency for Streptococcus phage Javan246

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.863AlaAla: 2.863 ± 0.921
0.089AlaCys: 0.089 ± 0.092
4.742AlaAsp: 4.742 ± 0.639
6.8AlaGlu: 6.8 ± 0.937
2.326AlaPhe: 2.326 ± 0.476
6.084AlaGly: 6.084 ± 1.656
0.895AlaHis: 0.895 ± 0.326
4.831AlaIle: 4.831 ± 0.568
7.158AlaLys: 7.158 ± 1.034
6.442AlaLeu: 6.442 ± 1.169
2.147AlaMet: 2.147 ± 0.469
3.221AlaAsn: 3.221 ± 0.453
2.147AlaPro: 2.147 ± 0.49
2.416AlaGln: 2.416 ± 0.745
4.473AlaArg: 4.473 ± 0.923
6.084AlaSer: 6.084 ± 1.642
3.579AlaThr: 3.579 ± 0.473
4.831AlaVal: 4.831 ± 0.725
0.805AlaTrp: 0.805 ± 0.306
2.505AlaTyr: 2.505 ± 0.487
0.0AlaXaa: 0.0 ± 0.0
Cys
0.268CysAla: 0.268 ± 0.169
0.0CysCys: 0.0 ± 0.0
0.537CysAsp: 0.537 ± 0.205
0.268CysGlu: 0.268 ± 0.142
0.447CysPhe: 0.447 ± 0.219
0.179CysGly: 0.179 ± 0.124
0.358CysHis: 0.358 ± 0.184
0.089CysIle: 0.089 ± 0.104
0.358CysLys: 0.358 ± 0.168
0.895CysLeu: 0.895 ± 0.333
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.358CysPro: 0.358 ± 0.182
0.268CysGln: 0.268 ± 0.14
0.179CysArg: 0.179 ± 0.128
0.268CysSer: 0.268 ± 0.153
0.089CysThr: 0.089 ± 0.081
0.447CysVal: 0.447 ± 0.199
0.089CysTrp: 0.089 ± 0.081
0.179CysTyr: 0.179 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
4.742AspAla: 4.742 ± 0.78
0.358AspCys: 0.358 ± 0.18
4.026AspAsp: 4.026 ± 0.702
4.205AspGlu: 4.205 ± 0.514
3.579AspPhe: 3.579 ± 0.605
5.547AspGly: 5.547 ± 0.775
0.805AspHis: 0.805 ± 0.282
3.579AspIle: 3.579 ± 0.724
6.531AspLys: 6.531 ± 0.644
4.921AspLeu: 4.921 ± 0.634
1.342AspMet: 1.342 ± 0.341
3.758AspAsn: 3.758 ± 0.604
1.61AspPro: 1.61 ± 0.383
1.61AspGln: 1.61 ± 0.282
3.131AspArg: 3.131 ± 0.537
3.31AspSer: 3.31 ± 0.619
2.952AspThr: 2.952 ± 0.562
3.579AspVal: 3.579 ± 0.505
0.984AspTrp: 0.984 ± 0.273
2.952AspTyr: 2.952 ± 0.608
0.0AspXaa: 0.0 ± 0.0
Glu
6.173GluAla: 6.173 ± 0.763
0.537GluCys: 0.537 ± 0.311
3.847GluAsp: 3.847 ± 0.471
6.889GluGlu: 6.889 ± 1.022
2.505GluPhe: 2.505 ± 0.383
3.31GluGly: 3.31 ± 0.682
1.163GluHis: 1.163 ± 0.275
6.442GluIle: 6.442 ± 0.767
7.068GluLys: 7.068 ± 0.974
9.036GluLeu: 9.036 ± 0.864
2.863GluMet: 2.863 ± 0.499
3.131GluAsn: 3.131 ± 0.509
1.342GluPro: 1.342 ± 0.335
3.489GluGln: 3.489 ± 0.569
4.026GluArg: 4.026 ± 0.552
4.652GluSer: 4.652 ± 0.789
3.042GluThr: 3.042 ± 0.483
5.458GluVal: 5.458 ± 0.948
0.805GluTrp: 0.805 ± 0.252
2.595GluTyr: 2.595 ± 0.564
0.0GluXaa: 0.0 ± 0.0
Phe
1.7PheAla: 1.7 ± 0.281
0.358PheCys: 0.358 ± 0.174
4.205PheAsp: 4.205 ± 0.433
3.042PheGlu: 3.042 ± 0.642
0.805PhePhe: 0.805 ± 0.258
2.863PheGly: 2.863 ± 0.521
0.358PheHis: 0.358 ± 0.169
2.505PheIle: 2.505 ± 0.512
3.758PheLys: 3.758 ± 0.676
2.505PheLeu: 2.505 ± 0.366
1.074PheMet: 1.074 ± 0.336
1.7PheAsn: 1.7 ± 0.314
0.984PhePro: 0.984 ± 0.37
0.895PheGln: 0.895 ± 0.264
1.432PheArg: 1.432 ± 0.279
2.863PheSer: 2.863 ± 0.503
2.416PheThr: 2.416 ± 0.468
2.237PheVal: 2.237 ± 0.57
0.179PheTrp: 0.179 ± 0.114
2.058PheTyr: 2.058 ± 0.476
0.0PheXaa: 0.0 ± 0.0
Gly
4.921GlyAla: 4.921 ± 1.346
0.447GlyCys: 0.447 ± 0.196
3.758GlyAsp: 3.758 ± 0.746
4.742GlyGlu: 4.742 ± 0.542
4.116GlyPhe: 4.116 ± 1.009
3.31GlyGly: 3.31 ± 0.649
1.342GlyHis: 1.342 ± 0.372
6.531GlyIle: 6.531 ± 1.108
3.758GlyLys: 3.758 ± 0.514
5.637GlyLeu: 5.637 ± 0.804
1.7GlyMet: 1.7 ± 0.383
4.295GlyAsn: 4.295 ± 1.259
0.626GlyPro: 0.626 ± 0.204
4.026GlyGln: 4.026 ± 0.634
3.4GlyArg: 3.4 ± 0.642
3.489GlySer: 3.489 ± 1.01
3.131GlyThr: 3.131 ± 0.86
3.042GlyVal: 3.042 ± 0.626
0.179GlyTrp: 0.179 ± 0.114
2.326GlyTyr: 2.326 ± 0.464
0.0GlyXaa: 0.0 ± 0.0
His
0.805HisAla: 0.805 ± 0.251
0.0HisCys: 0.0 ± 0.0
0.895HisAsp: 0.895 ± 0.31
0.626HisGlu: 0.626 ± 0.223
0.716HisPhe: 0.716 ± 0.269
0.716HisGly: 0.716 ± 0.318
0.179HisHis: 0.179 ± 0.143
0.984HisIle: 0.984 ± 0.337
1.789HisLys: 1.789 ± 0.349
1.253HisLeu: 1.253 ± 0.322
0.0HisMet: 0.0 ± 0.0
0.626HisAsn: 0.626 ± 0.257
0.895HisPro: 0.895 ± 0.298
0.447HisGln: 0.447 ± 0.186
0.358HisArg: 0.358 ± 0.159
0.626HisSer: 0.626 ± 0.204
0.537HisThr: 0.537 ± 0.228
0.716HisVal: 0.716 ± 0.277
0.179HisTrp: 0.179 ± 0.123
0.447HisTyr: 0.447 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
5.1IleAla: 5.1 ± 0.592
0.537IleCys: 0.537 ± 0.177
5.547IleAsp: 5.547 ± 0.741
6.979IleGlu: 6.979 ± 0.601
1.521IlePhe: 1.521 ± 0.345
4.026IleGly: 4.026 ± 0.554
0.805IleHis: 0.805 ± 0.328
4.116IleIle: 4.116 ± 0.809
6.531IleLys: 6.531 ± 0.764
4.026IleLeu: 4.026 ± 0.794
1.253IleMet: 1.253 ± 0.335
5.994IleAsn: 5.994 ± 0.871
2.326IlePro: 2.326 ± 0.358
3.847IleGln: 3.847 ± 0.556
2.952IleArg: 2.952 ± 0.457
3.937IleSer: 3.937 ± 0.762
4.295IleThr: 4.295 ± 0.596
3.31IleVal: 3.31 ± 0.457
0.537IleTrp: 0.537 ± 0.217
1.968IleTyr: 1.968 ± 0.378
0.0IleXaa: 0.0 ± 0.0
Lys
7.694LysAla: 7.694 ± 0.94
0.447LysCys: 0.447 ± 0.183
4.652LysAsp: 4.652 ± 0.614
7.605LysGlu: 7.605 ± 0.859
2.684LysPhe: 2.684 ± 0.387
2.863LysGly: 2.863 ± 0.517
0.895LysHis: 0.895 ± 0.24
7.158LysIle: 7.158 ± 0.904
7.873LysLys: 7.873 ± 1.241
7.873LysLeu: 7.873 ± 0.658
1.789LysMet: 1.789 ± 0.43
5.279LysAsn: 5.279 ± 0.524
2.595LysPro: 2.595 ± 0.412
3.937LysGln: 3.937 ± 0.6
3.31LysArg: 3.31 ± 0.543
5.547LysSer: 5.547 ± 0.655
4.652LysThr: 4.652 ± 0.762
4.563LysVal: 4.563 ± 0.653
0.805LysTrp: 0.805 ± 0.222
2.237LysTyr: 2.237 ± 0.531
0.0LysXaa: 0.0 ± 0.0
Leu
7.336LeuAla: 7.336 ± 0.964
0.537LeuCys: 0.537 ± 0.199
5.905LeuAsp: 5.905 ± 0.784
6.71LeuGlu: 6.71 ± 0.752
3.042LeuPhe: 3.042 ± 0.648
5.994LeuGly: 5.994 ± 0.731
1.253LeuHis: 1.253 ± 0.363
4.652LeuIle: 4.652 ± 0.62
6.71LeuLys: 6.71 ± 0.837
5.458LeuLeu: 5.458 ± 0.893
1.879LeuMet: 1.879 ± 0.449
5.905LeuAsn: 5.905 ± 0.732
3.042LeuPro: 3.042 ± 0.537
2.237LeuGln: 2.237 ± 0.534
3.131LeuArg: 3.131 ± 0.423
5.816LeuSer: 5.816 ± 0.613
4.921LeuThr: 4.921 ± 0.758
4.921LeuVal: 4.921 ± 0.573
0.447LeuTrp: 0.447 ± 0.175
2.237LeuTyr: 2.237 ± 0.464
0.0LeuXaa: 0.0 ± 0.0
Met
2.684MetAla: 2.684 ± 0.532
0.089MetCys: 0.089 ± 0.091
1.61MetAsp: 1.61 ± 0.287
1.253MetGlu: 1.253 ± 0.371
0.537MetPhe: 0.537 ± 0.235
0.447MetGly: 0.447 ± 0.2
0.358MetHis: 0.358 ± 0.195
1.163MetIle: 1.163 ± 0.365
2.237MetLys: 2.237 ± 0.434
1.521MetLeu: 1.521 ± 0.393
0.447MetMet: 0.447 ± 0.205
1.163MetAsn: 1.163 ± 0.316
0.805MetPro: 0.805 ± 0.244
0.537MetGln: 0.537 ± 0.27
1.432MetArg: 1.432 ± 0.363
1.521MetSer: 1.521 ± 0.382
2.595MetThr: 2.595 ± 0.458
1.074MetVal: 1.074 ± 0.292
0.358MetTrp: 0.358 ± 0.149
0.805MetTyr: 0.805 ± 0.293
0.0MetXaa: 0.0 ± 0.0
Asn
5.368AsnAla: 5.368 ± 1.135
0.447AsnCys: 0.447 ± 0.199
2.863AsnAsp: 2.863 ± 0.544
3.847AsnGlu: 3.847 ± 0.535
1.968AsnPhe: 1.968 ± 0.446
4.563AsnGly: 4.563 ± 0.915
0.626AsnHis: 0.626 ± 0.271
3.489AsnIle: 3.489 ± 0.446
4.831AsnLys: 4.831 ± 0.752
4.921AsnLeu: 4.921 ± 0.672
0.984AsnMet: 0.984 ± 0.288
3.131AsnAsn: 3.131 ± 0.503
2.505AsnPro: 2.505 ± 0.555
3.042AsnGln: 3.042 ± 0.506
1.789AsnArg: 1.789 ± 0.473
3.131AsnSer: 3.131 ± 0.576
3.4AsnThr: 3.4 ± 0.564
2.774AsnVal: 2.774 ± 0.546
0.537AsnTrp: 0.537 ± 0.207
1.789AsnTyr: 1.789 ± 0.357
0.0AsnXaa: 0.0 ± 0.0
Pro
1.342ProAla: 1.342 ± 0.407
0.089ProCys: 0.089 ± 0.081
1.7ProAsp: 1.7 ± 0.333
2.147ProGlu: 2.147 ± 0.413
1.521ProPhe: 1.521 ± 0.38
1.163ProGly: 1.163 ± 0.291
0.358ProHis: 0.358 ± 0.223
2.147ProIle: 2.147 ± 0.372
2.416ProLys: 2.416 ± 0.336
2.684ProLeu: 2.684 ± 0.571
0.268ProMet: 0.268 ± 0.145
1.074ProAsn: 1.074 ± 0.304
0.626ProPro: 0.626 ± 0.255
0.716ProGln: 0.716 ± 0.241
1.253ProArg: 1.253 ± 0.393
1.521ProSer: 1.521 ± 0.298
1.789ProThr: 1.789 ± 0.4
3.131ProVal: 3.131 ± 0.627
0.179ProTrp: 0.179 ± 0.116
1.253ProTyr: 1.253 ± 0.308
0.0ProXaa: 0.0 ± 0.0
Gln
4.384GlnAla: 4.384 ± 0.884
0.089GlnCys: 0.089 ± 0.081
1.968GlnAsp: 1.968 ± 0.336
2.595GlnGlu: 2.595 ± 0.524
1.342GlnPhe: 1.342 ± 0.357
2.774GlnGly: 2.774 ± 1.023
0.447GlnHis: 0.447 ± 0.28
2.952GlnIle: 2.952 ± 0.464
3.221GlnLys: 3.221 ± 0.727
3.221GlnLeu: 3.221 ± 0.392
1.432GlnMet: 1.432 ± 0.407
2.058GlnAsn: 2.058 ± 0.357
0.447GlnPro: 0.447 ± 0.21
2.416GlnGln: 2.416 ± 0.368
1.61GlnArg: 1.61 ± 0.417
3.758GlnSer: 3.758 ± 0.511
2.326GlnThr: 2.326 ± 0.435
2.505GlnVal: 2.505 ± 0.477
0.537GlnTrp: 0.537 ± 0.242
2.237GlnTyr: 2.237 ± 0.401
0.0GlnXaa: 0.0 ± 0.0
Arg
2.774ArgAla: 2.774 ± 0.607
0.0ArgCys: 0.0 ± 0.0
2.684ArgAsp: 2.684 ± 0.485
4.116ArgGlu: 4.116 ± 0.524
1.342ArgPhe: 1.342 ± 0.355
2.505ArgGly: 2.505 ± 0.605
0.358ArgHis: 0.358 ± 0.179
3.221ArgIle: 3.221 ± 0.723
3.131ArgLys: 3.131 ± 0.414
3.758ArgLeu: 3.758 ± 0.566
1.074ArgMet: 1.074 ± 0.333
2.952ArgAsn: 2.952 ± 0.61
0.895ArgPro: 0.895 ± 0.339
2.774ArgGln: 2.774 ± 0.458
2.058ArgArg: 2.058 ± 0.368
2.863ArgSer: 2.863 ± 0.51
1.7ArgThr: 1.7 ± 0.598
3.31ArgVal: 3.31 ± 0.987
0.537ArgTrp: 0.537 ± 0.22
1.968ArgTyr: 1.968 ± 0.449
0.0ArgXaa: 0.0 ± 0.0
Ser
5.01SerAla: 5.01 ± 1.552
0.447SerCys: 0.447 ± 0.183
4.295SerAsp: 4.295 ± 0.665
4.473SerGlu: 4.473 ± 0.547
2.952SerPhe: 2.952 ± 0.57
6.889SerGly: 6.889 ± 1.532
0.895SerHis: 0.895 ± 0.242
5.189SerIle: 5.189 ± 0.76
4.742SerLys: 4.742 ± 0.514
4.384SerLeu: 4.384 ± 0.606
1.074SerMet: 1.074 ± 0.278
3.221SerAsn: 3.221 ± 0.558
1.163SerPro: 1.163 ± 0.377
3.579SerGln: 3.579 ± 0.551
2.237SerArg: 2.237 ± 0.461
4.563SerSer: 4.563 ± 0.601
4.026SerThr: 4.026 ± 0.705
5.01SerVal: 5.01 ± 0.658
1.432SerTrp: 1.432 ± 0.353
1.968SerTyr: 1.968 ± 0.519
0.0SerXaa: 0.0 ± 0.0
Thr
3.847ThrAla: 3.847 ± 0.741
0.268ThrCys: 0.268 ± 0.198
2.952ThrAsp: 2.952 ± 0.539
4.563ThrGlu: 4.563 ± 0.828
2.684ThrPhe: 2.684 ± 0.596
4.473ThrGly: 4.473 ± 0.523
0.805ThrHis: 0.805 ± 0.241
4.384ThrIle: 4.384 ± 0.589
4.116ThrLys: 4.116 ± 0.574
5.01ThrLeu: 5.01 ± 0.723
0.984ThrMet: 0.984 ± 0.253
3.042ThrAsn: 3.042 ± 0.454
2.058ThrPro: 2.058 ± 0.462
1.968ThrGln: 1.968 ± 0.463
1.879ThrArg: 1.879 ± 0.516
3.489ThrSer: 3.489 ± 0.609
4.116ThrThr: 4.116 ± 0.753
4.116ThrVal: 4.116 ± 0.695
0.358ThrTrp: 0.358 ± 0.148
2.237ThrTyr: 2.237 ± 0.551
0.0ThrXaa: 0.0 ± 0.0
Val
4.026ValAla: 4.026 ± 0.566
0.358ValCys: 0.358 ± 0.175
4.384ValAsp: 4.384 ± 0.63
4.384ValGlu: 4.384 ± 0.518
2.326ValPhe: 2.326 ± 0.495
3.758ValGly: 3.758 ± 0.459
0.447ValHis: 0.447 ± 0.194
3.668ValIle: 3.668 ± 0.53
4.205ValLys: 4.205 ± 0.553
3.579ValLeu: 3.579 ± 0.739
1.61ValMet: 1.61 ± 0.364
2.952ValAsn: 2.952 ± 0.543
1.7ValPro: 1.7 ± 0.382
2.237ValGln: 2.237 ± 0.378
2.326ValArg: 2.326 ± 0.566
6.8ValSer: 6.8 ± 1.209
5.368ValThr: 5.368 ± 0.636
3.31ValVal: 3.31 ± 0.702
0.805ValTrp: 0.805 ± 0.346
3.221ValTyr: 3.221 ± 0.807
0.0ValXaa: 0.0 ± 0.0
Trp
0.984TrpAla: 0.984 ± 0.243
0.0TrpCys: 0.0 ± 0.0
0.895TrpAsp: 0.895 ± 0.369
0.626TrpGlu: 0.626 ± 0.221
0.537TrpPhe: 0.537 ± 0.255
0.358TrpGly: 0.358 ± 0.14
0.179TrpHis: 0.179 ± 0.138
0.268TrpIle: 0.268 ± 0.169
0.984TrpLys: 0.984 ± 0.338
0.895TrpLeu: 0.895 ± 0.264
0.089TrpMet: 0.089 ± 0.091
0.447TrpAsn: 0.447 ± 0.189
0.089TrpPro: 0.089 ± 0.085
0.268TrpGln: 0.268 ± 0.202
0.537TrpArg: 0.537 ± 0.214
0.716TrpSer: 0.716 ± 0.219
0.447TrpThr: 0.447 ± 0.195
1.163TrpVal: 1.163 ± 0.351
0.179TrpTrp: 0.179 ± 0.115
0.358TrpTyr: 0.358 ± 0.12
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.416TyrAla: 2.416 ± 0.435
0.179TyrCys: 0.179 ± 0.168
2.147TyrAsp: 2.147 ± 0.408
2.505TyrGlu: 2.505 ± 0.378
0.984TyrPhe: 0.984 ± 0.341
2.952TyrGly: 2.952 ± 0.744
0.268TyrHis: 0.268 ± 0.149
2.237TyrIle: 2.237 ± 0.515
3.042TyrLys: 3.042 ± 0.676
4.116TyrLeu: 4.116 ± 0.871
0.537TyrMet: 0.537 ± 0.187
2.058TyrAsn: 2.058 ± 0.456
1.163TyrPro: 1.163 ± 0.258
1.61TyrGln: 1.61 ± 0.388
2.505TyrArg: 2.505 ± 0.452
2.595TyrSer: 2.595 ± 0.482
1.968TyrThr: 1.968 ± 0.327
1.968TyrVal: 1.968 ± 0.479
0.089TyrTrp: 0.089 ± 0.09
2.595TyrTyr: 2.595 ± 0.682
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (11178 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski