Amino acid dipepetide frequency for Klebsiella phage vB_KleS-HSE3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.172AlaAla: 13.172 ± 1.061
0.783AlaCys: 0.783 ± 0.199
6.717AlaAsp: 6.717 ± 0.712
6.391AlaGlu: 6.391 ± 0.707
3.065AlaPhe: 3.065 ± 0.372
8.738AlaGly: 8.738 ± 0.615
1.5AlaHis: 1.5 ± 0.314
6.26AlaIle: 6.26 ± 0.742
4.173AlaLys: 4.173 ± 0.709
5.999AlaLeu: 5.999 ± 0.611
2.608AlaMet: 2.608 ± 0.506
6.26AlaAsn: 6.26 ± 0.824
3.13AlaPro: 3.13 ± 0.477
4.5AlaGln: 4.5 ± 0.801
5.152AlaArg: 5.152 ± 0.635
6.325AlaSer: 6.325 ± 0.722
5.347AlaThr: 5.347 ± 0.561
7.173AlaVal: 7.173 ± 0.588
1.435AlaTrp: 1.435 ± 0.274
2.739AlaTyr: 2.739 ± 0.334
0.0AlaXaa: 0.0 ± 0.0
Cys
0.587CysAla: 0.587 ± 0.23
0.13CysCys: 0.13 ± 0.086
0.652CysAsp: 0.652 ± 0.187
0.391CysGlu: 0.391 ± 0.161
0.391CysPhe: 0.391 ± 0.16
1.304CysGly: 1.304 ± 0.305
0.326CysHis: 0.326 ± 0.147
0.522CysIle: 0.522 ± 0.183
0.652CysLys: 0.652 ± 0.252
0.783CysLeu: 0.783 ± 0.225
0.456CysMet: 0.456 ± 0.192
0.717CysAsn: 0.717 ± 0.185
0.522CysPro: 0.522 ± 0.163
0.587CysGln: 0.587 ± 0.194
1.239CysArg: 1.239 ± 0.304
0.848CysSer: 0.848 ± 0.211
0.261CysThr: 0.261 ± 0.123
0.717CysVal: 0.717 ± 0.236
0.326CysTrp: 0.326 ± 0.146
0.456CysTyr: 0.456 ± 0.201
0.0CysXaa: 0.0 ± 0.0
Asp
7.304AspAla: 7.304 ± 0.673
0.326AspCys: 0.326 ± 0.145
4.108AspAsp: 4.108 ± 0.556
3.456AspGlu: 3.456 ± 0.653
2.674AspPhe: 2.674 ± 0.414
5.347AspGly: 5.347 ± 0.789
0.978AspHis: 0.978 ± 0.203
3.326AspIle: 3.326 ± 0.371
2.217AspLys: 2.217 ± 0.395
5.673AspLeu: 5.673 ± 0.495
1.63AspMet: 1.63 ± 0.335
2.674AspAsn: 2.674 ± 0.462
2.087AspPro: 2.087 ± 0.446
2.022AspGln: 2.022 ± 0.416
3.782AspArg: 3.782 ± 0.528
3.978AspSer: 3.978 ± 0.582
2.543AspThr: 2.543 ± 0.37
4.173AspVal: 4.173 ± 0.58
0.978AspTrp: 0.978 ± 0.22
2.217AspTyr: 2.217 ± 0.371
0.0AspXaa: 0.0 ± 0.0
Glu
6.13GluAla: 6.13 ± 0.721
0.587GluCys: 0.587 ± 0.186
3.782GluAsp: 3.782 ± 0.689
3.326GluGlu: 3.326 ± 0.749
1.695GluPhe: 1.695 ± 0.396
3.652GluGly: 3.652 ± 0.597
0.848GluHis: 0.848 ± 0.273
4.434GluIle: 4.434 ± 0.399
2.869GluLys: 2.869 ± 0.668
4.956GluLeu: 4.956 ± 0.688
1.5GluMet: 1.5 ± 0.365
1.956GluAsn: 1.956 ± 0.446
2.087GluPro: 2.087 ± 0.526
2.739GluGln: 2.739 ± 0.506
3.521GluArg: 3.521 ± 0.469
2.478GluSer: 2.478 ± 0.424
2.739GluThr: 2.739 ± 0.43
2.739GluVal: 2.739 ± 0.372
0.456GluTrp: 0.456 ± 0.163
1.761GluTyr: 1.761 ± 0.267
0.0GluXaa: 0.0 ± 0.0
Phe
3.0PheAla: 3.0 ± 0.462
0.456PheCys: 0.456 ± 0.162
2.869PheAsp: 2.869 ± 0.358
1.956PheGlu: 1.956 ± 0.353
1.174PhePhe: 1.174 ± 0.27
3.13PheGly: 3.13 ± 0.455
0.587PheHis: 0.587 ± 0.17
2.152PheIle: 2.152 ± 0.359
2.087PheLys: 2.087 ± 0.39
2.413PheLeu: 2.413 ± 0.332
0.717PheMet: 0.717 ± 0.228
2.674PheAsn: 2.674 ± 0.292
1.174PhePro: 1.174 ± 0.309
1.174PheGln: 1.174 ± 0.208
2.413PheArg: 2.413 ± 0.365
2.217PheSer: 2.217 ± 0.379
2.152PheThr: 2.152 ± 0.376
2.348PheVal: 2.348 ± 0.383
0.783PheTrp: 0.783 ± 0.267
0.717PheTyr: 0.717 ± 0.233
0.0PheXaa: 0.0 ± 0.0
Gly
6.065GlyAla: 6.065 ± 0.679
1.304GlyCys: 1.304 ± 0.29
4.434GlyAsp: 4.434 ± 0.456
4.239GlyGlu: 4.239 ± 0.62
2.739GlyPhe: 2.739 ± 0.439
7.304GlyGly: 7.304 ± 0.878
1.239GlyHis: 1.239 ± 0.316
4.369GlyIle: 4.369 ± 0.549
5.021GlyLys: 5.021 ± 0.781
5.934GlyLeu: 5.934 ± 0.666
2.282GlyMet: 2.282 ± 0.368
3.847GlyAsn: 3.847 ± 0.589
1.435GlyPro: 1.435 ± 0.326
4.956GlyGln: 4.956 ± 0.477
5.412GlyArg: 5.412 ± 0.585
4.369GlySer: 4.369 ± 0.688
5.217GlyThr: 5.217 ± 0.907
5.869GlyVal: 5.869 ± 0.595
1.304GlyTrp: 1.304 ± 0.368
2.478GlyTyr: 2.478 ± 0.385
0.0GlyXaa: 0.0 ± 0.0
His
1.826HisAla: 1.826 ± 0.337
0.391HisCys: 0.391 ± 0.19
0.783HisAsp: 0.783 ± 0.262
0.913HisGlu: 0.913 ± 0.279
0.522HisPhe: 0.522 ± 0.175
0.783HisGly: 0.783 ± 0.219
0.587HisHis: 0.587 ± 0.202
0.848HisIle: 0.848 ± 0.26
0.456HisLys: 0.456 ± 0.183
1.5HisLeu: 1.5 ± 0.332
0.587HisMet: 0.587 ± 0.203
0.587HisAsn: 0.587 ± 0.253
0.848HisPro: 0.848 ± 0.239
0.326HisGln: 0.326 ± 0.129
1.63HisArg: 1.63 ± 0.412
0.848HisSer: 0.848 ± 0.21
0.848HisThr: 0.848 ± 0.233
1.304HisVal: 1.304 ± 0.298
0.326HisTrp: 0.326 ± 0.141
0.717HisTyr: 0.717 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
5.217IleAla: 5.217 ± 0.611
0.652IleCys: 0.652 ± 0.233
4.239IleAsp: 4.239 ± 0.549
3.065IleGlu: 3.065 ± 0.474
2.348IlePhe: 2.348 ± 0.367
4.173IleGly: 4.173 ± 0.548
1.043IleHis: 1.043 ± 0.372
3.0IleIle: 3.0 ± 0.47
3.195IleLys: 3.195 ± 0.339
4.108IleLeu: 4.108 ± 0.494
1.239IleMet: 1.239 ± 0.331
3.587IleAsn: 3.587 ± 0.382
2.934IlePro: 2.934 ± 0.522
2.217IleGln: 2.217 ± 0.384
2.804IleArg: 2.804 ± 0.346
3.717IleSer: 3.717 ± 0.45
5.804IleThr: 5.804 ± 1.072
3.456IleVal: 3.456 ± 0.536
0.587IleTrp: 0.587 ± 0.152
1.695IleTyr: 1.695 ± 0.337
0.0IleXaa: 0.0 ± 0.0
Lys
4.695LysAla: 4.695 ± 0.443
0.456LysCys: 0.456 ± 0.179
2.674LysAsp: 2.674 ± 0.495
2.543LysGlu: 2.543 ± 0.534
1.695LysPhe: 1.695 ± 0.294
3.587LysGly: 3.587 ± 0.45
1.239LysHis: 1.239 ± 0.397
3.717LysIle: 3.717 ± 0.574
2.739LysLys: 2.739 ± 0.505
3.326LysLeu: 3.326 ± 0.582
1.435LysMet: 1.435 ± 0.329
1.435LysAsn: 1.435 ± 0.347
2.022LysPro: 2.022 ± 0.4
2.152LysGln: 2.152 ± 0.475
3.195LysArg: 3.195 ± 0.491
3.587LysSer: 3.587 ± 0.475
3.0LysThr: 3.0 ± 0.433
2.804LysVal: 2.804 ± 0.483
0.783LysTrp: 0.783 ± 0.201
0.913LysTyr: 0.913 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
7.76LeuAla: 7.76 ± 0.85
0.717LeuCys: 0.717 ± 0.282
4.043LeuAsp: 4.043 ± 0.701
3.326LeuGlu: 3.326 ± 0.443
2.217LeuPhe: 2.217 ± 0.475
4.304LeuGly: 4.304 ± 0.615
1.043LeuHis: 1.043 ± 0.309
4.239LeuIle: 4.239 ± 0.566
3.782LeuLys: 3.782 ± 0.435
6.13LeuLeu: 6.13 ± 0.809
1.239LeuMet: 1.239 ± 0.35
4.434LeuAsn: 4.434 ± 0.511
3.13LeuPro: 3.13 ± 0.478
3.195LeuGln: 3.195 ± 0.409
5.608LeuArg: 5.608 ± 0.669
4.76LeuSer: 4.76 ± 0.506
6.195LeuThr: 6.195 ± 0.941
4.826LeuVal: 4.826 ± 0.62
1.043LeuTrp: 1.043 ± 0.266
1.826LeuTyr: 1.826 ± 0.295
0.0LeuXaa: 0.0 ± 0.0
Met
2.804MetAla: 2.804 ± 0.484
0.196MetCys: 0.196 ± 0.113
1.369MetAsp: 1.369 ± 0.266
0.978MetGlu: 0.978 ± 0.252
0.587MetPhe: 0.587 ± 0.193
1.369MetGly: 1.369 ± 0.404
0.13MetHis: 0.13 ± 0.081
1.826MetIle: 1.826 ± 0.371
2.022MetLys: 2.022 ± 0.363
1.5MetLeu: 1.5 ± 0.289
0.456MetMet: 0.456 ± 0.18
1.435MetAsn: 1.435 ± 0.332
1.043MetPro: 1.043 ± 0.266
1.174MetGln: 1.174 ± 0.233
1.565MetArg: 1.565 ± 0.295
1.956MetSer: 1.956 ± 0.409
1.695MetThr: 1.695 ± 0.354
1.826MetVal: 1.826 ± 0.32
0.196MetTrp: 0.196 ± 0.119
0.456MetTyr: 0.456 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
5.608AsnAla: 5.608 ± 1.01
0.848AsnCys: 0.848 ± 0.249
2.282AsnAsp: 2.282 ± 0.339
2.608AsnGlu: 2.608 ± 0.438
1.695AsnPhe: 1.695 ± 0.283
5.282AsnGly: 5.282 ± 0.736
0.717AsnHis: 0.717 ± 0.239
2.674AsnIle: 2.674 ± 0.384
2.739AsnLys: 2.739 ± 0.329
2.608AsnLeu: 2.608 ± 0.402
0.783AsnMet: 0.783 ± 0.177
2.608AsnAsn: 2.608 ± 0.416
2.608AsnPro: 2.608 ± 0.424
2.217AsnGln: 2.217 ± 0.356
2.869AsnArg: 2.869 ± 0.431
3.0AsnSer: 3.0 ± 0.689
3.326AsnThr: 3.326 ± 0.455
3.391AsnVal: 3.391 ± 0.407
0.587AsnTrp: 0.587 ± 0.187
1.304AsnTyr: 1.304 ± 0.267
0.0AsnXaa: 0.0 ± 0.0
Pro
4.239ProAla: 4.239 ± 0.481
0.456ProCys: 0.456 ± 0.181
2.543ProAsp: 2.543 ± 0.492
2.217ProGlu: 2.217 ± 0.392
2.152ProPhe: 2.152 ± 0.41
3.326ProGly: 3.326 ± 0.498
0.783ProHis: 0.783 ± 0.33
1.565ProIle: 1.565 ± 0.3
1.891ProLys: 1.891 ± 0.302
2.804ProLeu: 2.804 ± 0.519
0.522ProMet: 0.522 ± 0.189
1.239ProAsn: 1.239 ± 0.316
1.891ProPro: 1.891 ± 0.403
1.5ProGln: 1.5 ± 0.306
1.761ProArg: 1.761 ± 0.386
2.087ProSer: 2.087 ± 0.318
2.869ProThr: 2.869 ± 0.514
3.13ProVal: 3.13 ± 0.587
0.913ProTrp: 0.913 ± 0.296
1.239ProTyr: 1.239 ± 0.342
0.0ProXaa: 0.0 ± 0.0
Gln
5.282GlnAla: 5.282 ± 0.746
0.456GlnCys: 0.456 ± 0.168
2.152GlnAsp: 2.152 ± 0.405
1.826GlnGlu: 1.826 ± 0.394
1.369GlnPhe: 1.369 ± 0.366
3.847GlnGly: 3.847 ± 0.607
0.913GlnHis: 0.913 ± 0.239
3.0GlnIle: 3.0 ± 0.507
1.369GlnLys: 1.369 ± 0.284
4.63GlnLeu: 4.63 ± 0.519
1.5GlnMet: 1.5 ± 0.372
2.413GlnAsn: 2.413 ± 0.339
1.956GlnPro: 1.956 ± 0.456
3.587GlnGln: 3.587 ± 0.583
2.543GlnArg: 2.543 ± 0.407
2.348GlnSer: 2.348 ± 0.481
1.826GlnThr: 1.826 ± 0.317
3.391GlnVal: 3.391 ± 0.402
0.848GlnTrp: 0.848 ± 0.216
1.304GlnTyr: 1.304 ± 0.26
0.0GlnXaa: 0.0 ± 0.0
Arg
5.608ArgAla: 5.608 ± 0.926
1.304ArgCys: 1.304 ± 0.351
3.521ArgAsp: 3.521 ± 0.501
3.913ArgGlu: 3.913 ± 0.686
2.608ArgPhe: 2.608 ± 0.422
4.173ArgGly: 4.173 ± 0.574
1.826ArgHis: 1.826 ± 0.367
4.239ArgIle: 4.239 ± 0.458
2.869ArgLys: 2.869 ± 0.573
5.086ArgLeu: 5.086 ± 0.562
1.891ArgMet: 1.891 ± 0.313
2.413ArgAsn: 2.413 ± 0.418
2.348ArgPro: 2.348 ± 0.403
2.739ArgGln: 2.739 ± 0.371
4.891ArgArg: 4.891 ± 0.638
3.065ArgSer: 3.065 ± 0.51
2.739ArgThr: 2.739 ± 0.384
4.565ArgVal: 4.565 ± 0.722
0.913ArgTrp: 0.913 ± 0.21
2.087ArgTyr: 2.087 ± 0.389
0.0ArgXaa: 0.0 ± 0.0
Ser
6.782SerAla: 6.782 ± 0.608
0.587SerCys: 0.587 ± 0.185
3.326SerAsp: 3.326 ± 0.666
2.934SerGlu: 2.934 ± 0.444
3.0SerPhe: 3.0 ± 0.412
5.152SerGly: 5.152 ± 0.699
0.717SerHis: 0.717 ± 0.191
4.108SerIle: 4.108 ± 0.59
2.739SerLys: 2.739 ± 0.385
4.239SerLeu: 4.239 ± 0.471
1.891SerMet: 1.891 ± 0.315
2.739SerAsn: 2.739 ± 0.685
2.804SerPro: 2.804 ± 0.354
3.456SerGln: 3.456 ± 0.591
3.521SerArg: 3.521 ± 0.468
4.369SerSer: 4.369 ± 0.657
4.304SerThr: 4.304 ± 0.566
3.521SerVal: 3.521 ± 0.39
1.109SerTrp: 1.109 ± 0.234
1.369SerTyr: 1.369 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
6.586ThrAla: 6.586 ± 1.106
0.717ThrCys: 0.717 ± 0.215
4.565ThrAsp: 4.565 ± 0.611
2.543ThrGlu: 2.543 ± 0.28
2.478ThrPhe: 2.478 ± 0.376
5.934ThrGly: 5.934 ± 0.689
0.783ThrHis: 0.783 ± 0.22
3.326ThrIle: 3.326 ± 0.381
2.152ThrLys: 2.152 ± 0.338
4.826ThrLeu: 4.826 ± 0.527
1.174ThrMet: 1.174 ± 0.322
2.674ThrAsn: 2.674 ± 0.808
3.456ThrPro: 3.456 ± 0.516
3.326ThrGln: 3.326 ± 0.651
2.543ThrArg: 2.543 ± 0.404
4.695ThrSer: 4.695 ± 0.901
3.913ThrThr: 3.913 ± 0.723
3.913ThrVal: 3.913 ± 0.542
0.848ThrTrp: 0.848 ± 0.228
2.217ThrTyr: 2.217 ± 0.454
0.0ThrXaa: 0.0 ± 0.0
Val
4.956ValAla: 4.956 ± 0.537
0.717ValCys: 0.717 ± 0.217
5.021ValAsp: 5.021 ± 0.7
5.347ValGlu: 5.347 ± 0.628
2.152ValPhe: 2.152 ± 0.362
5.086ValGly: 5.086 ± 0.637
0.717ValHis: 0.717 ± 0.22
3.195ValIle: 3.195 ± 0.385
3.0ValLys: 3.0 ± 0.556
4.304ValLeu: 4.304 ± 0.584
1.63ValMet: 1.63 ± 0.245
3.913ValAsn: 3.913 ± 0.406
1.695ValPro: 1.695 ± 0.253
2.478ValGln: 2.478 ± 0.333
4.565ValArg: 4.565 ± 0.441
5.282ValSer: 5.282 ± 0.577
4.695ValThr: 4.695 ± 0.773
4.239ValVal: 4.239 ± 0.605
0.587ValTrp: 0.587 ± 0.179
2.282ValTyr: 2.282 ± 0.397
0.0ValXaa: 0.0 ± 0.0
Trp
1.369TrpAla: 1.369 ± 0.334
0.326TrpCys: 0.326 ± 0.133
0.456TrpAsp: 0.456 ± 0.185
0.783TrpGlu: 0.783 ± 0.234
0.522TrpPhe: 0.522 ± 0.148
0.848TrpGly: 0.848 ± 0.238
0.391TrpHis: 0.391 ± 0.172
1.239TrpIle: 1.239 ± 0.278
0.456TrpLys: 0.456 ± 0.153
0.978TrpLeu: 0.978 ± 0.234
0.196TrpMet: 0.196 ± 0.099
0.717TrpAsn: 0.717 ± 0.177
0.456TrpPro: 0.456 ± 0.192
0.783TrpGln: 0.783 ± 0.231
1.695TrpArg: 1.695 ± 0.295
0.848TrpSer: 0.848 ± 0.24
1.369TrpThr: 1.369 ± 0.363
0.783TrpVal: 0.783 ± 0.215
0.261TrpTrp: 0.261 ± 0.142
0.261TrpTyr: 0.261 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.478TyrAla: 2.478 ± 0.474
0.456TyrCys: 0.456 ± 0.161
1.956TyrAsp: 1.956 ± 0.328
1.5TyrGlu: 1.5 ± 0.332
1.109TyrPhe: 1.109 ± 0.274
2.478TyrGly: 2.478 ± 0.386
0.261TyrHis: 0.261 ± 0.135
1.043TyrIle: 1.043 ± 0.227
1.565TyrLys: 1.565 ± 0.36
2.087TyrLeu: 2.087 ± 0.384
0.783TyrMet: 0.783 ± 0.222
1.435TyrAsn: 1.435 ± 0.371
1.5TyrPro: 1.5 ± 0.345
1.435TyrGln: 1.435 ± 0.289
2.087TyrArg: 2.087 ± 0.362
1.891TyrSer: 1.891 ± 0.348
1.826TyrThr: 1.826 ± 0.32
1.695TyrVal: 1.695 ± 0.362
0.456TyrTrp: 0.456 ± 0.161
0.587TyrTyr: 0.587 ± 0.208
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (15336 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski