Amino acid dipepetide frequency for Streptococcus phage P9853

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.651AlaAla: 6.651 ± 2.561
0.375AlaCys: 0.375 ± 0.271
4.965AlaAsp: 4.965 ± 1.108
4.403AlaGlu: 4.403 ± 0.696
2.623AlaPhe: 2.623 ± 1.295
5.621AlaGly: 5.621 ± 1.561
0.749AlaHis: 0.749 ± 0.239
6.651AlaIle: 6.651 ± 1.921
4.403AlaLys: 4.403 ± 0.712
7.119AlaLeu: 7.119 ± 1.634
2.529AlaMet: 2.529 ± 1.089
4.215AlaAsn: 4.215 ± 0.813
2.155AlaPro: 2.155 ± 0.425
2.904AlaGln: 2.904 ± 1.397
3.091AlaArg: 3.091 ± 0.625
6.276AlaSer: 6.276 ± 1.782
5.246AlaThr: 5.246 ± 1.159
4.215AlaVal: 4.215 ± 1.189
0.656AlaTrp: 0.656 ± 0.231
2.623AlaTyr: 2.623 ± 0.532
0.0AlaXaa: 0.0 ± 0.0
Cys
0.187CysAla: 0.187 ± 0.14
0.094CysCys: 0.094 ± 0.098
0.468CysAsp: 0.468 ± 0.25
0.468CysGlu: 0.468 ± 0.246
0.0CysPhe: 0.0 ± 0.0
0.468CysGly: 0.468 ± 0.256
0.187CysHis: 0.187 ± 0.145
0.187CysIle: 0.187 ± 0.116
0.468CysLys: 0.468 ± 0.24
0.281CysLeu: 0.281 ± 0.181
0.094CysMet: 0.094 ± 0.082
0.281CysAsn: 0.281 ± 0.159
0.187CysPro: 0.187 ± 0.137
0.094CysGln: 0.094 ± 0.089
0.562CysArg: 0.562 ± 0.315
0.562CysSer: 0.562 ± 0.322
0.094CysThr: 0.094 ± 0.11
0.375CysVal: 0.375 ± 0.174
0.094CysTrp: 0.094 ± 0.089
0.281CysTyr: 0.281 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
2.998AspAla: 2.998 ± 0.402
0.281AspCys: 0.281 ± 0.188
4.215AspAsp: 4.215 ± 0.58
3.841AspGlu: 3.841 ± 0.706
3.279AspPhe: 3.279 ± 0.65
6.183AspGly: 6.183 ± 1.419
0.281AspHis: 0.281 ± 0.15
3.372AspIle: 3.372 ± 0.776
4.965AspLys: 4.965 ± 1.117
4.215AspLeu: 4.215 ± 0.708
1.311AspMet: 1.311 ± 0.411
3.934AspAsn: 3.934 ± 0.714
0.937AspPro: 0.937 ± 0.378
1.593AspGln: 1.593 ± 0.383
2.248AspArg: 2.248 ± 0.386
4.403AspSer: 4.403 ± 0.648
4.028AspThr: 4.028 ± 0.707
4.122AspVal: 4.122 ± 0.835
0.843AspTrp: 0.843 ± 0.362
3.653AspTyr: 3.653 ± 0.8
0.0AspXaa: 0.0 ± 0.0
Glu
4.309GluAla: 4.309 ± 0.7
0.281GluCys: 0.281 ± 0.163
2.248GluAsp: 2.248 ± 0.442
3.091GluGlu: 3.091 ± 0.736
2.717GluPhe: 2.717 ± 0.61
3.841GluGly: 3.841 ± 0.574
1.218GluHis: 1.218 ± 0.38
4.778GluIle: 4.778 ± 0.683
4.871GluLys: 4.871 ± 1.111
6.464GluLeu: 6.464 ± 1.258
2.436GluMet: 2.436 ± 0.607
3.653GluAsn: 3.653 ± 0.581
1.78GluPro: 1.78 ± 0.62
2.342GluGln: 2.342 ± 0.444
3.841GluArg: 3.841 ± 0.842
2.248GluSer: 2.248 ± 0.672
3.185GluThr: 3.185 ± 0.694
5.34GluVal: 5.34 ± 1.013
1.03GluTrp: 1.03 ± 0.329
2.81GluTyr: 2.81 ± 0.814
0.0GluXaa: 0.0 ± 0.0
Phe
2.436PheAla: 2.436 ± 0.615
0.375PheCys: 0.375 ± 0.241
2.81PheAsp: 2.81 ± 0.572
3.747PheGlu: 3.747 ± 0.665
1.405PhePhe: 1.405 ± 0.428
3.841PheGly: 3.841 ± 0.7
0.468PheHis: 0.468 ± 0.168
2.529PheIle: 2.529 ± 0.417
4.215PheLys: 4.215 ± 0.514
2.248PheLeu: 2.248 ± 0.755
0.468PheMet: 0.468 ± 0.237
2.998PheAsn: 2.998 ± 0.562
0.562PhePro: 0.562 ± 0.28
1.405PheGln: 1.405 ± 0.342
1.311PheArg: 1.311 ± 0.388
3.279PheSer: 3.279 ± 0.738
3.279PheThr: 3.279 ± 0.645
2.155PheVal: 2.155 ± 0.628
0.656PheTrp: 0.656 ± 0.261
1.124PheTyr: 1.124 ± 0.367
0.0PheXaa: 0.0 ± 0.0
Gly
5.34GlyAla: 5.34 ± 1.255
0.375GlyCys: 0.375 ± 0.284
3.747GlyAsp: 3.747 ± 0.47
3.091GlyGlu: 3.091 ± 0.449
3.466GlyPhe: 3.466 ± 0.756
3.185GlyGly: 3.185 ± 0.596
0.656GlyHis: 0.656 ± 0.248
6.838GlyIle: 6.838 ± 2.206
7.026GlyLys: 7.026 ± 1.082
6.183GlyLeu: 6.183 ± 0.882
1.967GlyMet: 1.967 ± 0.84
3.747GlyAsn: 3.747 ± 0.512
0.937GlyPro: 0.937 ± 0.424
2.904GlyGln: 2.904 ± 0.559
3.091GlyArg: 3.091 ± 0.781
5.34GlySer: 5.34 ± 0.899
5.152GlyThr: 5.152 ± 1.014
4.309GlyVal: 4.309 ± 0.603
0.843GlyTrp: 0.843 ± 0.288
2.81GlyTyr: 2.81 ± 0.628
0.0GlyXaa: 0.0 ± 0.0
His
0.937HisAla: 0.937 ± 0.295
0.0HisCys: 0.0 ± 0.0
1.03HisAsp: 1.03 ± 0.281
0.375HisGlu: 0.375 ± 0.169
0.468HisPhe: 0.468 ± 0.232
0.749HisGly: 0.749 ± 0.289
0.468HisHis: 0.468 ± 0.19
1.124HisIle: 1.124 ± 0.34
0.843HisLys: 0.843 ± 0.279
1.03HisLeu: 1.03 ± 0.311
0.281HisMet: 0.281 ± 0.174
0.281HisAsn: 0.281 ± 0.167
0.281HisPro: 0.281 ± 0.196
0.281HisGln: 0.281 ± 0.174
0.562HisArg: 0.562 ± 0.205
0.937HisSer: 0.937 ± 0.329
0.656HisThr: 0.656 ± 0.223
1.124HisVal: 1.124 ± 0.383
0.187HisTrp: 0.187 ± 0.121
0.468HisTyr: 0.468 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
5.902IleAla: 5.902 ± 1.31
0.468IleCys: 0.468 ± 0.251
5.152IleAsp: 5.152 ± 0.652
3.653IleGlu: 3.653 ± 0.733
1.686IlePhe: 1.686 ± 0.393
5.246IleGly: 5.246 ± 1.36
1.124IleHis: 1.124 ± 0.335
3.185IleIle: 3.185 ± 0.844
4.684IleLys: 4.684 ± 0.588
3.841IleLeu: 3.841 ± 0.532
2.061IleMet: 2.061 ± 0.462
3.747IleAsn: 3.747 ± 0.786
3.185IlePro: 3.185 ± 0.711
2.623IleGln: 2.623 ± 0.522
3.185IleArg: 3.185 ± 0.734
6.557IleSer: 6.557 ± 1.894
4.122IleThr: 4.122 ± 0.594
4.403IleVal: 4.403 ± 0.862
0.468IleTrp: 0.468 ± 0.225
3.091IleTyr: 3.091 ± 0.693
0.0IleXaa: 0.0 ± 0.0
Lys
6.838LysAla: 6.838 ± 0.993
0.375LysCys: 0.375 ± 0.192
3.841LysAsp: 3.841 ± 0.707
6.838LysGlu: 6.838 ± 1.072
2.155LysPhe: 2.155 ± 0.484
5.152LysGly: 5.152 ± 0.679
0.937LysHis: 0.937 ± 0.298
4.684LysIle: 4.684 ± 0.71
5.902LysLys: 5.902 ± 1.6
5.902LysLeu: 5.902 ± 0.897
1.78LysMet: 1.78 ± 0.511
4.122LysAsn: 4.122 ± 0.761
3.185LysPro: 3.185 ± 0.575
2.342LysGln: 2.342 ± 0.603
4.403LysArg: 4.403 ± 0.835
4.59LysSer: 4.59 ± 0.589
5.621LysThr: 5.621 ± 0.89
3.841LysVal: 3.841 ± 0.721
1.124LysTrp: 1.124 ± 0.335
3.841LysTyr: 3.841 ± 1.1
0.0LysXaa: 0.0 ± 0.0
Leu
5.995LeuAla: 5.995 ± 0.887
0.0LeuCys: 0.0 ± 0.0
4.684LeuAsp: 4.684 ± 0.792
5.433LeuGlu: 5.433 ± 1.009
2.529LeuPhe: 2.529 ± 0.442
5.714LeuGly: 5.714 ± 1.215
0.562LeuHis: 0.562 ± 0.24
4.122LeuIle: 4.122 ± 0.584
5.621LeuLys: 5.621 ± 0.99
4.965LeuLeu: 4.965 ± 0.75
1.593LeuMet: 1.593 ± 0.364
5.34LeuAsn: 5.34 ± 0.688
2.717LeuPro: 2.717 ± 0.618
2.717LeuGln: 2.717 ± 0.523
3.279LeuArg: 3.279 ± 0.723
6.089LeuSer: 6.089 ± 0.602
5.902LeuThr: 5.902 ± 0.979
5.059LeuVal: 5.059 ± 0.672
0.468LeuTrp: 0.468 ± 0.317
2.81LeuTyr: 2.81 ± 0.546
0.0LeuXaa: 0.0 ± 0.0
Met
2.81MetAla: 2.81 ± 0.974
0.094MetCys: 0.094 ± 0.089
1.03MetAsp: 1.03 ± 0.262
1.124MetGlu: 1.124 ± 0.369
1.124MetPhe: 1.124 ± 0.286
1.03MetGly: 1.03 ± 0.393
0.281MetHis: 0.281 ± 0.197
1.405MetIle: 1.405 ± 0.399
1.967MetLys: 1.967 ± 0.448
1.405MetLeu: 1.405 ± 0.329
1.218MetMet: 1.218 ± 0.584
1.218MetAsn: 1.218 ± 0.329
0.937MetPro: 0.937 ± 0.332
1.593MetGln: 1.593 ± 0.468
1.218MetArg: 1.218 ± 0.367
1.967MetSer: 1.967 ± 0.605
1.218MetThr: 1.218 ± 0.295
2.342MetVal: 2.342 ± 0.453
0.0MetTrp: 0.0 ± 0.0
1.03MetTyr: 1.03 ± 0.38
0.0MetXaa: 0.0 ± 0.0
Asn
4.403AsnAla: 4.403 ± 0.599
0.375AsnCys: 0.375 ± 0.191
3.372AsnAsp: 3.372 ± 0.701
4.215AsnGlu: 4.215 ± 0.92
2.248AsnPhe: 2.248 ± 0.583
5.808AsnGly: 5.808 ± 0.801
1.218AsnHis: 1.218 ± 0.479
3.185AsnIle: 3.185 ± 0.539
4.028AsnLys: 4.028 ± 0.704
3.466AsnLeu: 3.466 ± 0.601
1.124AsnMet: 1.124 ± 0.341
3.279AsnAsn: 3.279 ± 0.694
2.717AsnPro: 2.717 ± 0.627
1.499AsnGln: 1.499 ± 0.334
1.967AsnArg: 1.967 ± 0.576
3.185AsnSer: 3.185 ± 0.649
3.747AsnThr: 3.747 ± 0.834
2.81AsnVal: 2.81 ± 0.495
1.405AsnTrp: 1.405 ± 0.343
2.155AsnTyr: 2.155 ± 0.562
0.0AsnXaa: 0.0 ± 0.0
Pro
1.311ProAla: 1.311 ± 0.34
0.187ProCys: 0.187 ± 0.196
1.967ProAsp: 1.967 ± 0.603
1.78ProGlu: 1.78 ± 0.562
1.03ProPhe: 1.03 ± 0.256
1.593ProGly: 1.593 ± 0.463
0.187ProHis: 0.187 ± 0.112
1.593ProIle: 1.593 ± 0.442
3.279ProLys: 3.279 ± 0.591
2.155ProLeu: 2.155 ± 0.558
0.187ProMet: 0.187 ± 0.136
2.155ProAsn: 2.155 ± 0.512
1.03ProPro: 1.03 ± 0.44
1.218ProGln: 1.218 ± 0.379
1.686ProArg: 1.686 ± 0.448
2.717ProSer: 2.717 ± 0.54
1.967ProThr: 1.967 ± 0.523
2.061ProVal: 2.061 ± 0.547
0.375ProTrp: 0.375 ± 0.165
1.124ProTyr: 1.124 ± 0.348
0.0ProXaa: 0.0 ± 0.0
Gln
3.841GlnAla: 3.841 ± 1.217
0.187GlnCys: 0.187 ± 0.118
1.874GlnAsp: 1.874 ± 0.486
2.529GlnGlu: 2.529 ± 0.636
2.061GlnPhe: 2.061 ± 0.4
2.623GlnGly: 2.623 ± 0.796
0.187GlnHis: 0.187 ± 0.153
2.342GlnIle: 2.342 ± 0.599
2.342GlnLys: 2.342 ± 0.437
3.934GlnLeu: 3.934 ± 0.462
1.124GlnMet: 1.124 ± 0.374
1.499GlnAsn: 1.499 ± 0.311
0.749GlnPro: 0.749 ± 0.272
1.218GlnGln: 1.218 ± 0.346
1.124GlnArg: 1.124 ± 0.365
2.623GlnSer: 2.623 ± 0.705
2.998GlnThr: 2.998 ± 0.431
2.529GlnVal: 2.529 ± 0.424
0.468GlnTrp: 0.468 ± 0.204
1.311GlnTyr: 1.311 ± 0.395
0.0GlnXaa: 0.0 ± 0.0
Arg
3.279ArgAla: 3.279 ± 0.571
0.749ArgCys: 0.749 ± 0.293
2.436ArgAsp: 2.436 ± 0.445
2.904ArgGlu: 2.904 ± 0.74
1.686ArgPhe: 1.686 ± 0.468
2.904ArgGly: 2.904 ± 0.361
0.468ArgHis: 0.468 ± 0.227
3.185ArgIle: 3.185 ± 0.688
3.372ArgLys: 3.372 ± 0.744
3.466ArgLeu: 3.466 ± 0.488
1.499ArgMet: 1.499 ± 0.328
1.686ArgAsn: 1.686 ± 0.426
0.937ArgPro: 0.937 ± 0.252
1.311ArgGln: 1.311 ± 0.421
1.78ArgArg: 1.78 ± 0.564
2.342ArgSer: 2.342 ± 0.402
2.248ArgThr: 2.248 ± 0.593
2.529ArgVal: 2.529 ± 0.629
0.937ArgTrp: 0.937 ± 0.337
2.81ArgTyr: 2.81 ± 0.548
0.0ArgXaa: 0.0 ± 0.0
Ser
6.745SerAla: 6.745 ± 3.454
0.375SerCys: 0.375 ± 0.191
4.59SerAsp: 4.59 ± 0.79
3.372SerGlu: 3.372 ± 0.778
3.841SerPhe: 3.841 ± 0.645
5.059SerGly: 5.059 ± 0.629
0.937SerHis: 0.937 ± 0.317
5.246SerIle: 5.246 ± 0.977
5.527SerLys: 5.527 ± 0.88
4.965SerLeu: 4.965 ± 0.913
1.593SerMet: 1.593 ± 0.335
3.653SerAsn: 3.653 ± 0.576
1.78SerPro: 1.78 ± 0.473
3.56SerGln: 3.56 ± 1.007
1.874SerArg: 1.874 ± 0.422
4.309SerSer: 4.309 ± 1.013
4.871SerThr: 4.871 ± 0.716
6.183SerVal: 6.183 ± 0.915
0.937SerTrp: 0.937 ± 0.365
1.78SerTyr: 1.78 ± 0.458
0.0SerXaa: 0.0 ± 0.0
Thr
5.527ThrAla: 5.527 ± 2.084
0.187ThrCys: 0.187 ± 0.134
3.372ThrAsp: 3.372 ± 0.572
3.185ThrGlu: 3.185 ± 0.755
4.496ThrPhe: 4.496 ± 0.589
4.496ThrGly: 4.496 ± 0.849
0.937ThrHis: 0.937 ± 0.258
6.464ThrIle: 6.464 ± 1.187
5.527ThrLys: 5.527 ± 0.804
5.152ThrLeu: 5.152 ± 0.768
1.218ThrMet: 1.218 ± 0.767
3.091ThrAsn: 3.091 ± 0.586
2.248ThrPro: 2.248 ± 0.427
2.81ThrGln: 2.81 ± 0.541
1.593ThrArg: 1.593 ± 0.428
4.122ThrSer: 4.122 ± 0.951
4.028ThrThr: 4.028 ± 0.757
5.714ThrVal: 5.714 ± 0.819
0.375ThrTrp: 0.375 ± 0.221
2.248ThrTyr: 2.248 ± 0.626
0.0ThrXaa: 0.0 ± 0.0
Val
4.403ValAla: 4.403 ± 0.985
0.281ValCys: 0.281 ± 0.17
5.246ValAsp: 5.246 ± 1.01
5.059ValGlu: 5.059 ± 0.857
2.717ValPhe: 2.717 ± 0.63
4.215ValGly: 4.215 ± 0.638
0.468ValHis: 0.468 ± 0.19
4.59ValIle: 4.59 ± 0.651
5.246ValLys: 5.246 ± 0.616
4.59ValLeu: 4.59 ± 0.501
1.218ValMet: 1.218 ± 0.371
4.309ValAsn: 4.309 ± 0.819
2.061ValPro: 2.061 ± 0.371
2.436ValGln: 2.436 ± 0.717
2.248ValArg: 2.248 ± 0.481
6.37ValSer: 6.37 ± 0.837
4.496ValThr: 4.496 ± 0.847
4.778ValVal: 4.778 ± 0.758
0.749ValTrp: 0.749 ± 0.224
1.78ValTyr: 1.78 ± 0.502
0.0ValXaa: 0.0 ± 0.0
Trp
0.656TrpAla: 0.656 ± 0.226
0.094TrpCys: 0.094 ± 0.097
0.749TrpAsp: 0.749 ± 0.362
1.218TrpGlu: 1.218 ± 0.298
0.468TrpPhe: 0.468 ± 0.223
0.843TrpGly: 0.843 ± 0.303
0.094TrpHis: 0.094 ± 0.097
0.562TrpIle: 0.562 ± 0.2
0.843TrpLys: 0.843 ± 0.205
0.843TrpLeu: 0.843 ± 0.263
0.375TrpMet: 0.375 ± 0.172
0.749TrpAsn: 0.749 ± 0.35
0.094TrpPro: 0.094 ± 0.11
0.656TrpGln: 0.656 ± 0.252
0.468TrpArg: 0.468 ± 0.228
1.124TrpSer: 1.124 ± 0.452
0.937TrpThr: 0.937 ± 0.286
0.843TrpVal: 0.843 ± 0.226
0.281TrpTrp: 0.281 ± 0.191
0.281TrpTyr: 0.281 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.904TyrAla: 2.904 ± 0.432
0.281TyrCys: 0.281 ± 0.153
2.904TyrAsp: 2.904 ± 0.763
2.155TyrGlu: 2.155 ± 0.546
1.311TyrPhe: 1.311 ± 0.374
2.436TyrGly: 2.436 ± 0.57
0.656TyrHis: 0.656 ± 0.229
2.436TyrIle: 2.436 ± 0.628
2.342TyrLys: 2.342 ± 0.572
3.372TyrLeu: 3.372 ± 0.642
0.843TyrMet: 0.843 ± 0.287
2.248TyrAsn: 2.248 ± 0.513
1.124TyrPro: 1.124 ± 0.375
2.061TyrGln: 2.061 ± 0.487
2.998TyrArg: 2.998 ± 0.819
2.155TyrSer: 2.155 ± 0.526
2.998TyrThr: 2.998 ± 0.929
2.529TyrVal: 2.529 ± 0.542
0.281TyrTrp: 0.281 ± 0.161
1.686TyrTyr: 1.686 ± 0.517
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (10676 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski