Amino acid dipepetide frequency for Streptococcus phage SW14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.643AlaAla: 6.643 ± 2.613
0.185AlaCys: 0.185 ± 0.133
4.705AlaAsp: 4.705 ± 0.852
3.875AlaGlu: 3.875 ± 0.7
2.583AlaPhe: 2.583 ± 1.087
5.812AlaGly: 5.812 ± 1.244
0.83AlaHis: 0.83 ± 0.276
6.366AlaIle: 6.366 ± 1.487
5.167AlaLys: 5.167 ± 0.786
6.458AlaLeu: 6.458 ± 1.084
2.676AlaMet: 2.676 ± 1.156
4.428AlaAsn: 4.428 ± 0.809
2.676AlaPro: 2.676 ± 0.533
3.321AlaGln: 3.321 ± 1.105
3.229AlaArg: 3.229 ± 0.66
6.181AlaSer: 6.181 ± 1.433
4.244AlaThr: 4.244 ± 0.958
4.428AlaVal: 4.428 ± 1.32
0.554AlaTrp: 0.554 ± 0.18
2.583AlaTyr: 2.583 ± 0.522
0.0AlaXaa: 0.0 ± 0.0
Cys
0.185CysAla: 0.185 ± 0.128
0.0CysCys: 0.0 ± 0.0
0.554CysAsp: 0.554 ± 0.269
0.461CysGlu: 0.461 ± 0.208
0.092CysPhe: 0.092 ± 0.112
0.461CysGly: 0.461 ± 0.255
0.185CysHis: 0.185 ± 0.145
0.185CysIle: 0.185 ± 0.118
0.554CysLys: 0.554 ± 0.246
0.185CysLeu: 0.185 ± 0.212
0.092CysMet: 0.092 ± 0.097
0.185CysAsn: 0.185 ± 0.117
0.277CysPro: 0.277 ± 0.142
0.092CysGln: 0.092 ± 0.092
0.185CysArg: 0.185 ± 0.122
0.646CysSer: 0.646 ± 0.296
0.0CysThr: 0.0 ± 0.0
0.277CysVal: 0.277 ± 0.146
0.092CysTrp: 0.092 ± 0.088
0.277CysTyr: 0.277 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
3.045AspAla: 3.045 ± 0.493
0.554AspCys: 0.554 ± 0.291
3.967AspAsp: 3.967 ± 0.548
3.69AspGlu: 3.69 ± 0.624
3.229AspPhe: 3.229 ± 0.653
6.643AspGly: 6.643 ± 1.367
0.554AspHis: 0.554 ± 0.276
2.952AspIle: 2.952 ± 0.543
3.967AspLys: 3.967 ± 0.76
4.521AspLeu: 4.521 ± 0.803
1.476AspMet: 1.476 ± 0.422
3.967AspAsn: 3.967 ± 0.762
0.83AspPro: 0.83 ± 0.301
1.476AspGln: 1.476 ± 0.381
2.768AspArg: 2.768 ± 0.587
4.244AspSer: 4.244 ± 0.747
3.875AspThr: 3.875 ± 0.74
4.428AspVal: 4.428 ± 0.781
1.107AspTrp: 1.107 ± 0.365
3.321AspTyr: 3.321 ± 0.674
0.0AspXaa: 0.0 ± 0.0
Glu
4.428GluAla: 4.428 ± 0.771
0.185GluCys: 0.185 ± 0.131
1.937GluAsp: 1.937 ± 0.462
4.521GluGlu: 4.521 ± 0.979
2.86GluPhe: 2.86 ± 0.569
3.598GluGly: 3.598 ± 0.553
1.107GluHis: 1.107 ± 0.298
5.628GluIle: 5.628 ± 0.959
4.059GluLys: 4.059 ± 0.789
7.196GluLeu: 7.196 ± 1.154
2.03GluMet: 2.03 ± 0.489
4.336GluAsn: 4.336 ± 0.625
2.03GluPro: 2.03 ± 0.645
2.768GluGln: 2.768 ± 0.59
4.152GluArg: 4.152 ± 0.765
2.122GluSer: 2.122 ± 0.613
3.045GluThr: 3.045 ± 0.667
5.351GluVal: 5.351 ± 0.949
1.199GluTrp: 1.199 ± 0.339
3.598GluTyr: 3.598 ± 0.925
0.0GluXaa: 0.0 ± 0.0
Phe
2.491PheAla: 2.491 ± 0.433
0.277PheCys: 0.277 ± 0.173
2.952PheAsp: 2.952 ± 0.627
3.414PheGlu: 3.414 ± 0.613
1.292PhePhe: 1.292 ± 0.323
3.69PheGly: 3.69 ± 0.733
0.369PheHis: 0.369 ± 0.177
2.214PheIle: 2.214 ± 0.419
4.89PheLys: 4.89 ± 0.492
2.122PheLeu: 2.122 ± 0.633
0.646PheMet: 0.646 ± 0.282
2.952PheAsn: 2.952 ± 0.492
0.461PhePro: 0.461 ± 0.226
1.107PheGln: 1.107 ± 0.289
1.015PheArg: 1.015 ± 0.313
3.414PheSer: 3.414 ± 0.875
2.768PheThr: 2.768 ± 0.636
2.214PheVal: 2.214 ± 0.449
0.554PheTrp: 0.554 ± 0.258
1.292PheTyr: 1.292 ± 0.393
0.0PheXaa: 0.0 ± 0.0
Gly
4.982GlyAla: 4.982 ± 1.004
0.461GlyCys: 0.461 ± 0.224
3.875GlyAsp: 3.875 ± 0.543
3.414GlyGlu: 3.414 ± 0.627
3.137GlyPhe: 3.137 ± 0.599
3.321GlyGly: 3.321 ± 0.591
0.461GlyHis: 0.461 ± 0.232
7.658GlyIle: 7.658 ± 1.692
6.55GlyLys: 6.55 ± 0.884
6.55GlyLeu: 6.55 ± 0.938
2.03GlyMet: 2.03 ± 0.741
3.321GlyAsn: 3.321 ± 0.505
1.292GlyPro: 1.292 ± 0.536
3.045GlyGln: 3.045 ± 0.487
3.137GlyArg: 3.137 ± 0.635
4.613GlySer: 4.613 ± 0.83
5.259GlyThr: 5.259 ± 0.981
3.967GlyVal: 3.967 ± 0.651
1.015GlyTrp: 1.015 ± 0.35
3.137GlyTyr: 3.137 ± 0.537
0.0GlyXaa: 0.0 ± 0.0
His
0.646HisAla: 0.646 ± 0.261
0.0HisCys: 0.0 ± 0.0
1.199HisAsp: 1.199 ± 0.273
0.646HisGlu: 0.646 ± 0.22
0.83HisPhe: 0.83 ± 0.232
0.738HisGly: 0.738 ± 0.279
0.554HisHis: 0.554 ± 0.226
1.015HisIle: 1.015 ± 0.326
0.738HisLys: 0.738 ± 0.222
0.738HisLeu: 0.738 ± 0.247
0.461HisMet: 0.461 ± 0.206
0.554HisAsn: 0.554 ± 0.272
0.461HisPro: 0.461 ± 0.208
0.369HisGln: 0.369 ± 0.193
0.646HisArg: 0.646 ± 0.219
0.83HisSer: 0.83 ± 0.282
0.83HisThr: 0.83 ± 0.269
1.199HisVal: 1.199 ± 0.407
0.185HisTrp: 0.185 ± 0.156
0.369HisTyr: 0.369 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
5.443IleAla: 5.443 ± 1.073
0.554IleCys: 0.554 ± 0.267
4.797IleAsp: 4.797 ± 0.691
4.613IleGlu: 4.613 ± 0.588
1.568IlePhe: 1.568 ± 0.393
5.074IleGly: 5.074 ± 1.158
0.923IleHis: 0.923 ± 0.281
3.967IleIle: 3.967 ± 0.826
5.536IleLys: 5.536 ± 0.485
3.414IleLeu: 3.414 ± 0.511
1.845IleMet: 1.845 ± 0.383
3.598IleAsn: 3.598 ± 0.687
2.86IlePro: 2.86 ± 0.61
2.952IleGln: 2.952 ± 0.441
2.86IleArg: 2.86 ± 0.692
6.458IleSer: 6.458 ± 1.335
4.613IleThr: 4.613 ± 0.883
4.152IleVal: 4.152 ± 0.742
0.461IleTrp: 0.461 ± 0.193
2.86IleTyr: 2.86 ± 0.631
0.0IleXaa: 0.0 ± 0.0
Lys
6.919LysAla: 6.919 ± 0.951
0.185LysCys: 0.185 ± 0.142
3.875LysAsp: 3.875 ± 0.597
7.381LysGlu: 7.381 ± 1.291
2.214LysPhe: 2.214 ± 0.58
5.72LysGly: 5.72 ± 0.684
1.384LysHis: 1.384 ± 0.421
3.967LysIle: 3.967 ± 0.827
5.443LysLys: 5.443 ± 1.32
5.628LysLeu: 5.628 ± 0.897
1.661LysMet: 1.661 ± 0.439
3.783LysAsn: 3.783 ± 0.731
3.69LysPro: 3.69 ± 0.557
2.214LysGln: 2.214 ± 0.473
4.613LysArg: 4.613 ± 0.896
4.428LysSer: 4.428 ± 0.575
5.72LysThr: 5.72 ± 0.778
3.967LysVal: 3.967 ± 0.661
1.015LysTrp: 1.015 ± 0.304
3.321LysTyr: 3.321 ± 0.713
0.0LysXaa: 0.0 ± 0.0
Leu
7.012LeuAla: 7.012 ± 1.02
0.277LeuCys: 0.277 ± 0.166
4.336LeuAsp: 4.336 ± 0.686
5.812LeuGlu: 5.812 ± 0.958
2.306LeuPhe: 2.306 ± 0.445
5.074LeuGly: 5.074 ± 1.061
0.646LeuHis: 0.646 ± 0.273
4.152LeuIle: 4.152 ± 0.464
6.089LeuLys: 6.089 ± 0.684
5.074LeuLeu: 5.074 ± 0.867
1.845LeuMet: 1.845 ± 0.404
5.997LeuAsn: 5.997 ± 0.674
2.952LeuPro: 2.952 ± 0.514
2.491LeuGln: 2.491 ± 0.449
2.86LeuArg: 2.86 ± 0.729
5.997LeuSer: 5.997 ± 0.789
5.905LeuThr: 5.905 ± 0.88
4.705LeuVal: 4.705 ± 0.526
0.554LeuTrp: 0.554 ± 0.312
2.676LeuTyr: 2.676 ± 0.563
0.0LeuXaa: 0.0 ± 0.0
Met
2.86MetAla: 2.86 ± 0.914
0.0MetCys: 0.0 ± 0.0
0.738MetAsp: 0.738 ± 0.226
1.476MetGlu: 1.476 ± 0.447
1.292MetPhe: 1.292 ± 0.336
1.292MetGly: 1.292 ± 0.388
0.185MetHis: 0.185 ± 0.133
1.199MetIle: 1.199 ± 0.335
2.122MetLys: 2.122 ± 0.54
1.568MetLeu: 1.568 ± 0.404
0.83MetMet: 0.83 ± 0.527
1.292MetAsn: 1.292 ± 0.327
0.83MetPro: 0.83 ± 0.274
1.384MetGln: 1.384 ± 0.529
0.923MetArg: 0.923 ± 0.287
1.753MetSer: 1.753 ± 0.552
1.568MetThr: 1.568 ± 0.35
2.306MetVal: 2.306 ± 0.597
0.0MetTrp: 0.0 ± 0.0
0.738MetTyr: 0.738 ± 0.27
0.0MetXaa: 0.0 ± 0.0
Asn
4.336AsnAla: 4.336 ± 0.554
0.277AsnCys: 0.277 ± 0.172
4.059AsnAsp: 4.059 ± 0.722
4.244AsnGlu: 4.244 ± 0.867
2.122AsnPhe: 2.122 ± 0.48
5.443AsnGly: 5.443 ± 1.069
1.568AsnHis: 1.568 ± 0.501
3.321AsnIle: 3.321 ± 0.578
3.69AsnLys: 3.69 ± 0.696
4.059AsnLeu: 4.059 ± 0.52
1.015AsnMet: 1.015 ± 0.337
2.676AsnAsn: 2.676 ± 0.513
3.045AsnPro: 3.045 ± 0.706
1.753AsnGln: 1.753 ± 0.394
2.214AsnArg: 2.214 ± 0.565
2.676AsnSer: 2.676 ± 0.618
3.137AsnThr: 3.137 ± 0.677
3.045AsnVal: 3.045 ± 0.477
1.753AsnTrp: 1.753 ± 0.453
1.845AsnTyr: 1.845 ± 0.415
0.0AsnXaa: 0.0 ± 0.0
Pro
1.476ProAla: 1.476 ± 0.346
0.092ProCys: 0.092 ± 0.088
2.122ProAsp: 2.122 ± 0.517
1.568ProGlu: 1.568 ± 0.482
0.923ProPhe: 0.923 ± 0.242
1.476ProGly: 1.476 ± 0.433
0.277ProHis: 0.277 ± 0.135
1.753ProIle: 1.753 ± 0.405
3.783ProLys: 3.783 ± 0.588
2.583ProLeu: 2.583 ± 0.471
0.092ProMet: 0.092 ± 0.087
1.845ProAsn: 1.845 ± 0.484
1.384ProPro: 1.384 ± 0.383
1.568ProGln: 1.568 ± 0.362
1.568ProArg: 1.568 ± 0.546
2.491ProSer: 2.491 ± 0.465
2.03ProThr: 2.03 ± 0.592
2.03ProVal: 2.03 ± 0.463
0.277ProTrp: 0.277 ± 0.133
1.107ProTyr: 1.107 ± 0.368
0.0ProXaa: 0.0 ± 0.0
Gln
3.69GlnAla: 3.69 ± 1.174
0.277GlnCys: 0.277 ± 0.155
1.937GlnAsp: 1.937 ± 0.39
2.583GlnGlu: 2.583 ± 0.693
2.214GlnPhe: 2.214 ± 0.44
2.768GlnGly: 2.768 ± 0.801
0.554GlnHis: 0.554 ± 0.212
2.03GlnIle: 2.03 ± 0.458
2.491GlnLys: 2.491 ± 0.48
3.598GlnLeu: 3.598 ± 0.512
1.015GlnMet: 1.015 ± 0.326
1.661GlnAsn: 1.661 ± 0.45
0.646GlnPro: 0.646 ± 0.23
1.384GlnGln: 1.384 ± 0.349
1.199GlnArg: 1.199 ± 0.323
3.137GlnSer: 3.137 ± 0.605
2.583GlnThr: 2.583 ± 0.445
2.676GlnVal: 2.676 ± 0.475
0.369GlnTrp: 0.369 ± 0.177
1.292GlnTyr: 1.292 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
3.783ArgAla: 3.783 ± 0.507
0.461ArgCys: 0.461 ± 0.235
2.491ArgAsp: 2.491 ± 0.54
2.86ArgGlu: 2.86 ± 0.604
1.568ArgPhe: 1.568 ± 0.436
3.045ArgGly: 3.045 ± 0.378
0.554ArgHis: 0.554 ± 0.257
3.229ArgIle: 3.229 ± 0.701
3.321ArgLys: 3.321 ± 0.764
3.783ArgLeu: 3.783 ± 0.564
1.384ArgMet: 1.384 ± 0.39
1.845ArgAsn: 1.845 ± 0.36
1.199ArgPro: 1.199 ± 0.365
1.568ArgGln: 1.568 ± 0.351
1.753ArgArg: 1.753 ± 0.49
1.937ArgSer: 1.937 ± 0.38
2.122ArgThr: 2.122 ± 0.615
2.952ArgVal: 2.952 ± 0.717
0.923ArgTrp: 0.923 ± 0.308
2.306ArgTyr: 2.306 ± 0.482
0.0ArgXaa: 0.0 ± 0.0
Ser
5.72SerAla: 5.72 ± 2.527
0.369SerCys: 0.369 ± 0.164
4.521SerAsp: 4.521 ± 0.847
3.967SerGlu: 3.967 ± 0.783
3.137SerPhe: 3.137 ± 0.679
4.797SerGly: 4.797 ± 0.569
0.83SerHis: 0.83 ± 0.271
5.997SerIle: 5.997 ± 0.997
4.521SerLys: 4.521 ± 0.848
4.705SerLeu: 4.705 ± 0.773
1.753SerMet: 1.753 ± 0.324
3.69SerAsn: 3.69 ± 0.672
1.292SerPro: 1.292 ± 0.359
4.152SerGln: 4.152 ± 1.093
1.753SerArg: 1.753 ± 0.424
4.982SerSer: 4.982 ± 1.107
4.705SerThr: 4.705 ± 0.969
6.181SerVal: 6.181 ± 0.832
0.738SerTrp: 0.738 ± 0.275
1.753SerTyr: 1.753 ± 0.399
0.0SerXaa: 0.0 ± 0.0
Thr
4.89ThrAla: 4.89 ± 1.59
0.277ThrCys: 0.277 ± 0.176
2.952ThrAsp: 2.952 ± 0.627
3.783ThrGlu: 3.783 ± 0.597
4.059ThrPhe: 4.059 ± 0.505
4.89ThrGly: 4.89 ± 0.849
1.107ThrHis: 1.107 ± 0.321
5.351ThrIle: 5.351 ± 0.829
5.351ThrLys: 5.351 ± 0.848
5.628ThrLeu: 5.628 ± 0.819
1.568ThrMet: 1.568 ± 0.888
2.768ThrAsn: 2.768 ± 0.388
1.568ThrPro: 1.568 ± 0.417
2.122ThrGln: 2.122 ± 0.415
2.03ThrArg: 2.03 ± 0.386
3.69ThrSer: 3.69 ± 0.857
3.967ThrThr: 3.967 ± 0.682
5.443ThrVal: 5.443 ± 0.736
0.369ThrTrp: 0.369 ± 0.327
2.306ThrTyr: 2.306 ± 0.685
0.0ThrXaa: 0.0 ± 0.0
Val
4.613ValAla: 4.613 ± 0.931
0.185ValCys: 0.185 ± 0.14
5.812ValAsp: 5.812 ± 0.882
4.613ValGlu: 4.613 ± 0.917
2.491ValPhe: 2.491 ± 0.419
4.152ValGly: 4.152 ± 0.568
0.461ValHis: 0.461 ± 0.185
4.705ValIle: 4.705 ± 0.774
5.351ValLys: 5.351 ± 0.655
4.982ValLeu: 4.982 ± 0.693
1.015ValMet: 1.015 ± 0.238
4.705ValAsn: 4.705 ± 0.872
1.845ValPro: 1.845 ± 0.449
2.122ValGln: 2.122 ± 0.611
2.491ValArg: 2.491 ± 0.437
5.997ValSer: 5.997 ± 0.734
4.705ValThr: 4.705 ± 0.842
4.797ValVal: 4.797 ± 0.676
0.738ValTrp: 0.738 ± 0.242
1.753ValTyr: 1.753 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
0.461TrpAla: 0.461 ± 0.181
0.092TrpCys: 0.092 ± 0.099
0.646TrpAsp: 0.646 ± 0.387
0.923TrpGlu: 0.923 ± 0.321
0.83TrpPhe: 0.83 ± 0.293
0.646TrpGly: 0.646 ± 0.246
0.092TrpHis: 0.092 ± 0.099
0.369TrpIle: 0.369 ± 0.201
0.923TrpLys: 0.923 ± 0.243
0.923TrpLeu: 0.923 ± 0.269
0.185TrpMet: 0.185 ± 0.123
1.015TrpAsn: 1.015 ± 0.424
0.0TrpPro: 0.0 ± 0.0
0.554TrpGln: 0.554 ± 0.195
0.554TrpArg: 0.554 ± 0.236
1.661TrpSer: 1.661 ± 0.687
0.83TrpThr: 0.83 ± 0.369
1.199TrpVal: 1.199 ± 0.28
0.369TrpTrp: 0.369 ± 0.214
0.277TrpTyr: 0.277 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.321TyrAla: 3.321 ± 0.515
0.277TyrCys: 0.277 ± 0.143
3.137TyrAsp: 3.137 ± 0.83
2.306TyrGlu: 2.306 ± 0.498
1.384TyrPhe: 1.384 ± 0.432
2.583TyrGly: 2.583 ± 0.524
0.369TyrHis: 0.369 ± 0.201
2.491TyrIle: 2.491 ± 0.507
2.583TyrLys: 2.583 ± 0.564
3.137TyrLeu: 3.137 ± 0.72
0.646TyrMet: 0.646 ± 0.206
1.661TyrAsn: 1.661 ± 0.424
1.107TyrPro: 1.107 ± 0.353
1.661TyrGln: 1.661 ± 0.416
3.137TyrArg: 3.137 ± 0.776
2.306TyrSer: 2.306 ± 0.537
2.214TyrThr: 2.214 ± 0.604
2.214TyrVal: 2.214 ± 0.429
0.277TyrTrp: 0.277 ± 0.141
1.661TyrTyr: 1.661 ± 0.577
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10840 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski