Amino acid dipepetide frequency for Streptococcus phage phi-SsuFJSM5_rum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.698AlaAla: 3.698 ± 1.111
0.594AlaCys: 0.594 ± 0.168
3.17AlaAsp: 3.17 ± 0.352
4.557AlaGlu: 4.557 ± 0.606
2.972AlaPhe: 2.972 ± 0.358
4.028AlaGly: 4.028 ± 0.55
0.66AlaHis: 0.66 ± 0.207
5.547AlaIle: 5.547 ± 0.764
5.613AlaLys: 5.613 ± 0.485
4.953AlaLeu: 4.953 ± 0.687
1.519AlaMet: 1.519 ± 0.345
3.17AlaAsn: 3.17 ± 0.394
1.585AlaPro: 1.585 ± 0.373
2.377AlaGln: 2.377 ± 0.586
2.708AlaArg: 2.708 ± 0.476
4.424AlaSer: 4.424 ± 0.694
4.292AlaThr: 4.292 ± 0.696
3.368AlaVal: 3.368 ± 0.468
0.726AlaTrp: 0.726 ± 0.192
3.104AlaTyr: 3.104 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.594CysAla: 0.594 ± 0.253
0.396CysCys: 0.396 ± 0.133
0.33CysAsp: 0.33 ± 0.131
0.792CysGlu: 0.792 ± 0.224
0.33CysPhe: 0.33 ± 0.138
0.858CysGly: 0.858 ± 0.213
0.33CysHis: 0.33 ± 0.131
0.594CysIle: 0.594 ± 0.199
0.66CysLys: 0.66 ± 0.297
0.792CysLeu: 0.792 ± 0.321
0.066CysMet: 0.066 ± 0.061
0.528CysAsn: 0.528 ± 0.17
0.396CysPro: 0.396 ± 0.19
0.792CysGln: 0.792 ± 0.212
0.66CysArg: 0.66 ± 0.247
0.528CysSer: 0.528 ± 0.205
0.198CysThr: 0.198 ± 0.106
0.528CysVal: 0.528 ± 0.239
0.0CysTrp: 0.0 ± 0.0
0.792CysTyr: 0.792 ± 0.247
0.0CysXaa: 0.0 ± 0.0
Asp
2.575AspAla: 2.575 ± 0.366
0.66AspCys: 0.66 ± 0.245
3.698AspAsp: 3.698 ± 0.684
5.613AspGlu: 5.613 ± 0.793
3.17AspPhe: 3.17 ± 0.445
4.557AspGly: 4.557 ± 0.632
0.726AspHis: 0.726 ± 0.219
4.623AspIle: 4.623 ± 0.452
4.491AspLys: 4.491 ± 0.435
5.019AspLeu: 5.019 ± 0.538
1.849AspMet: 1.849 ± 0.462
2.575AspAsn: 2.575 ± 0.419
1.651AspPro: 1.651 ± 0.478
1.255AspGln: 1.255 ± 0.269
2.509AspArg: 2.509 ± 0.492
3.896AspSer: 3.896 ± 0.615
2.774AspThr: 2.774 ± 0.406
3.104AspVal: 3.104 ± 0.447
1.189AspTrp: 1.189 ± 0.216
3.764AspTyr: 3.764 ± 0.658
0.0AspXaa: 0.0 ± 0.0
Glu
4.358GluAla: 4.358 ± 0.433
0.66GluCys: 0.66 ± 0.209
4.557GluAsp: 4.557 ± 0.675
5.943GluGlu: 5.943 ± 0.754
2.509GluPhe: 2.509 ± 0.553
4.292GluGly: 4.292 ± 0.528
1.321GluHis: 1.321 ± 0.284
4.953GluIle: 4.953 ± 0.415
6.802GluLys: 6.802 ± 0.801
8.519GluLeu: 8.519 ± 0.719
2.575GluMet: 2.575 ± 0.545
4.557GluAsn: 4.557 ± 0.602
1.255GluPro: 1.255 ± 0.315
4.028GluGln: 4.028 ± 0.508
2.84GluArg: 2.84 ± 0.3
3.896GluSer: 3.896 ± 0.459
4.623GluThr: 4.623 ± 0.544
4.491GluVal: 4.491 ± 0.574
0.858GluTrp: 0.858 ± 0.286
2.377GluTyr: 2.377 ± 0.532
0.0GluXaa: 0.0 ± 0.0
Phe
2.113PheAla: 2.113 ± 0.381
0.66PheCys: 0.66 ± 0.255
2.774PheAsp: 2.774 ± 0.515
3.434PheGlu: 3.434 ± 0.531
1.387PhePhe: 1.387 ± 0.371
2.377PheGly: 2.377 ± 0.301
0.792PheHis: 0.792 ± 0.216
1.915PheIle: 1.915 ± 0.367
2.906PheLys: 2.906 ± 0.558
2.972PheLeu: 2.972 ± 0.505
0.991PheMet: 0.991 ± 0.232
1.981PheAsn: 1.981 ± 0.293
0.991PhePro: 0.991 ± 0.279
1.519PheGln: 1.519 ± 0.333
1.981PheArg: 1.981 ± 0.368
2.377PheSer: 2.377 ± 0.408
2.509PheThr: 2.509 ± 0.479
2.113PheVal: 2.113 ± 0.377
0.858PheTrp: 0.858 ± 0.21
1.717PheTyr: 1.717 ± 0.329
0.0PheXaa: 0.0 ± 0.0
Gly
3.104GlyAla: 3.104 ± 0.601
0.528GlyCys: 0.528 ± 0.164
3.896GlyAsp: 3.896 ± 0.562
4.094GlyGlu: 4.094 ± 0.519
2.377GlyPhe: 2.377 ± 0.322
3.83GlyGly: 3.83 ± 0.644
1.651GlyHis: 1.651 ± 0.42
5.349GlyIle: 5.349 ± 0.761
4.689GlyLys: 4.689 ± 0.527
5.745GlyLeu: 5.745 ± 0.84
1.717GlyMet: 1.717 ± 0.334
3.302GlyAsn: 3.302 ± 0.501
0.792GlyPro: 0.792 ± 0.215
2.84GlyGln: 2.84 ± 0.445
3.83GlyArg: 3.83 ± 0.397
3.698GlySer: 3.698 ± 0.432
3.5GlyThr: 3.5 ± 0.542
3.962GlyVal: 3.962 ± 0.706
0.726GlyTrp: 0.726 ± 0.172
3.104GlyTyr: 3.104 ± 0.467
0.0GlyXaa: 0.0 ± 0.0
His
0.792HisAla: 0.792 ± 0.211
0.264HisCys: 0.264 ± 0.148
1.123HisAsp: 1.123 ± 0.294
0.858HisGlu: 0.858 ± 0.279
1.057HisPhe: 1.057 ± 0.272
1.651HisGly: 1.651 ± 0.315
0.726HisHis: 0.726 ± 0.214
1.255HisIle: 1.255 ± 0.23
1.057HisLys: 1.057 ± 0.288
1.915HisLeu: 1.915 ± 0.282
0.396HisMet: 0.396 ± 0.17
0.991HisAsn: 0.991 ± 0.264
1.321HisPro: 1.321 ± 0.318
0.858HisGln: 0.858 ± 0.305
1.123HisArg: 1.123 ± 0.332
0.858HisSer: 0.858 ± 0.248
1.057HisThr: 1.057 ± 0.278
1.123HisVal: 1.123 ± 0.316
0.264HisTrp: 0.264 ± 0.108
0.726HisTyr: 0.726 ± 0.218
0.0HisXaa: 0.0 ± 0.0
Ile
4.689IleAla: 4.689 ± 0.46
0.66IleCys: 0.66 ± 0.18
5.217IleAsp: 5.217 ± 0.551
4.226IleGlu: 4.226 ± 0.576
1.981IlePhe: 1.981 ± 0.433
4.755IleGly: 4.755 ± 0.749
0.991IleHis: 0.991 ± 0.218
3.104IleIle: 3.104 ± 0.355
4.424IleLys: 4.424 ± 0.628
6.009IleLeu: 6.009 ± 0.505
1.057IleMet: 1.057 ± 0.242
3.434IleAsn: 3.434 ± 0.436
2.575IlePro: 2.575 ± 0.321
2.377IleGln: 2.377 ± 0.36
2.774IleArg: 2.774 ± 0.432
5.151IleSer: 5.151 ± 0.867
4.491IleThr: 4.491 ± 0.787
4.821IleVal: 4.821 ± 0.589
0.925IleTrp: 0.925 ± 0.343
2.906IleTyr: 2.906 ± 0.486
0.0IleXaa: 0.0 ± 0.0
Lys
5.613LysAla: 5.613 ± 0.625
0.462LysCys: 0.462 ± 0.154
3.764LysAsp: 3.764 ± 0.437
5.811LysGlu: 5.811 ± 0.568
2.641LysPhe: 2.641 ± 0.488
4.689LysGly: 4.689 ± 0.562
1.651LysHis: 1.651 ± 0.374
4.887LysIle: 4.887 ± 0.447
5.217LysLys: 5.217 ± 0.602
6.604LysLeu: 6.604 ± 0.656
2.047LysMet: 2.047 ± 0.438
3.368LysAsn: 3.368 ± 0.479
2.509LysPro: 2.509 ± 0.398
3.236LysGln: 3.236 ± 0.511
3.83LysArg: 3.83 ± 0.549
4.623LysSer: 4.623 ± 0.611
3.962LysThr: 3.962 ± 0.401
5.019LysVal: 5.019 ± 0.675
1.123LysTrp: 1.123 ± 0.268
2.972LysTyr: 2.972 ± 0.488
0.0LysXaa: 0.0 ± 0.0
Leu
6.009LeuAla: 6.009 ± 0.713
0.66LeuCys: 0.66 ± 0.243
5.217LeuAsp: 5.217 ± 0.468
7.396LeuGlu: 7.396 ± 0.781
2.906LeuPhe: 2.906 ± 0.482
5.019LeuGly: 5.019 ± 0.566
1.519LeuHis: 1.519 ± 0.271
5.349LeuIle: 5.349 ± 0.483
7.264LeuLys: 7.264 ± 0.624
7.594LeuLeu: 7.594 ± 0.739
2.047LeuMet: 2.047 ± 0.319
4.424LeuAsn: 4.424 ± 0.52
3.236LeuPro: 3.236 ± 0.453
3.566LeuGln: 3.566 ± 0.53
3.962LeuArg: 3.962 ± 0.439
7.594LeuSer: 7.594 ± 0.574
6.736LeuThr: 6.736 ± 0.828
5.547LeuVal: 5.547 ± 0.692
0.66LeuTrp: 0.66 ± 0.192
3.83LeuTyr: 3.83 ± 0.563
0.0LeuXaa: 0.0 ± 0.0
Met
1.915MetAla: 1.915 ± 0.251
0.132MetCys: 0.132 ± 0.099
2.311MetAsp: 2.311 ± 0.459
1.849MetGlu: 1.849 ± 0.383
1.057MetPhe: 1.057 ± 0.237
1.255MetGly: 1.255 ± 0.335
0.066MetHis: 0.066 ± 0.072
1.321MetIle: 1.321 ± 0.278
1.585MetLys: 1.585 ± 0.344
1.519MetLeu: 1.519 ± 0.38
1.255MetMet: 1.255 ± 0.353
0.792MetAsn: 0.792 ± 0.178
0.66MetPro: 0.66 ± 0.236
0.792MetGln: 0.792 ± 0.245
1.453MetArg: 1.453 ± 0.286
2.047MetSer: 2.047 ± 0.458
1.981MetThr: 1.981 ± 0.476
1.585MetVal: 1.585 ± 0.373
0.264MetTrp: 0.264 ± 0.123
0.66MetTyr: 0.66 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
3.962AsnAla: 3.962 ± 0.551
0.462AsnCys: 0.462 ± 0.19
2.641AsnAsp: 2.641 ± 0.434
3.764AsnGlu: 3.764 ± 0.546
1.915AsnPhe: 1.915 ± 0.351
4.821AsnGly: 4.821 ± 0.524
1.651AsnHis: 1.651 ± 0.329
2.641AsnIle: 2.641 ± 0.363
3.434AsnLys: 3.434 ± 0.591
4.953AsnLeu: 4.953 ± 0.926
1.123AsnMet: 1.123 ± 0.241
2.179AsnAsn: 2.179 ± 0.433
1.915AsnPro: 1.915 ± 0.357
2.443AsnGln: 2.443 ± 0.375
2.245AsnArg: 2.245 ± 0.48
2.906AsnSer: 2.906 ± 0.368
1.783AsnThr: 1.783 ± 0.438
1.981AsnVal: 1.981 ± 0.394
0.925AsnTrp: 0.925 ± 0.255
1.585AsnTyr: 1.585 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
1.057ProAla: 1.057 ± 0.268
0.396ProCys: 0.396 ± 0.137
1.783ProAsp: 1.783 ± 0.314
2.708ProGlu: 2.708 ± 0.444
1.123ProPhe: 1.123 ± 0.293
1.057ProGly: 1.057 ± 0.389
0.726ProHis: 0.726 ± 0.224
1.783ProIle: 1.783 ± 0.34
2.575ProLys: 2.575 ± 0.474
2.972ProLeu: 2.972 ± 0.471
0.528ProMet: 0.528 ± 0.17
1.519ProAsn: 1.519 ± 0.327
1.057ProPro: 1.057 ± 0.311
1.057ProGln: 1.057 ± 0.284
1.387ProArg: 1.387 ± 0.287
2.641ProSer: 2.641 ± 0.48
2.113ProThr: 2.113 ± 0.337
2.179ProVal: 2.179 ± 0.39
0.396ProTrp: 0.396 ± 0.158
1.321ProTyr: 1.321 ± 0.301
0.0ProXaa: 0.0 ± 0.0
Gln
3.632GlnAla: 3.632 ± 0.542
0.594GlnCys: 0.594 ± 0.23
1.981GlnAsp: 1.981 ± 0.375
2.84GlnGlu: 2.84 ± 0.39
1.849GlnPhe: 1.849 ± 0.323
1.783GlnGly: 1.783 ± 0.355
0.462GlnHis: 0.462 ± 0.138
2.575GlnIle: 2.575 ± 0.338
2.708GlnLys: 2.708 ± 0.547
4.226GlnLeu: 4.226 ± 0.511
1.387GlnMet: 1.387 ± 0.285
2.509GlnAsn: 2.509 ± 0.461
1.387GlnPro: 1.387 ± 0.322
1.783GlnGln: 1.783 ± 0.286
1.717GlnArg: 1.717 ± 0.439
2.509GlnSer: 2.509 ± 0.43
2.906GlnThr: 2.906 ± 0.73
3.698GlnVal: 3.698 ± 0.538
0.594GlnTrp: 0.594 ± 0.214
0.925GlnTyr: 0.925 ± 0.253
0.0GlnXaa: 0.0 ± 0.0
Arg
2.113ArgAla: 2.113 ± 0.361
0.594ArgCys: 0.594 ± 0.201
2.84ArgAsp: 2.84 ± 0.515
2.906ArgGlu: 2.906 ± 0.277
1.651ArgPhe: 1.651 ± 0.349
2.84ArgGly: 2.84 ± 0.485
0.726ArgHis: 0.726 ± 0.217
3.698ArgIle: 3.698 ± 0.681
4.094ArgLys: 4.094 ± 0.586
4.623ArgLeu: 4.623 ± 0.458
1.057ArgMet: 1.057 ± 0.318
2.509ArgAsn: 2.509 ± 0.245
1.651ArgPro: 1.651 ± 0.308
2.641ArgGln: 2.641 ± 0.36
2.377ArgArg: 2.377 ± 0.496
2.641ArgSer: 2.641 ± 0.375
2.774ArgThr: 2.774 ± 0.676
3.104ArgVal: 3.104 ± 0.578
1.057ArgTrp: 1.057 ± 0.228
1.387ArgTyr: 1.387 ± 0.324
0.0ArgXaa: 0.0 ± 0.0
Ser
4.226SerAla: 4.226 ± 0.628
0.726SerCys: 0.726 ± 0.234
3.896SerAsp: 3.896 ± 0.621
4.424SerGlu: 4.424 ± 0.593
2.443SerPhe: 2.443 ± 0.472
4.491SerGly: 4.491 ± 0.61
1.453SerHis: 1.453 ± 0.271
5.085SerIle: 5.085 ± 0.715
4.953SerLys: 4.953 ± 0.736
5.613SerLeu: 5.613 ± 0.515
1.189SerMet: 1.189 ± 0.281
2.84SerAsn: 2.84 ± 0.514
1.981SerPro: 1.981 ± 0.35
2.906SerGln: 2.906 ± 0.735
3.566SerArg: 3.566 ± 0.585
5.283SerSer: 5.283 ± 0.799
4.094SerThr: 4.094 ± 0.572
4.292SerVal: 4.292 ± 0.476
1.255SerTrp: 1.255 ± 0.24
2.377SerTyr: 2.377 ± 0.449
0.0SerXaa: 0.0 ± 0.0
Thr
4.755ThrAla: 4.755 ± 0.6
0.198ThrCys: 0.198 ± 0.111
3.038ThrAsp: 3.038 ± 0.533
4.557ThrGlu: 4.557 ± 0.53
2.377ThrPhe: 2.377 ± 0.463
3.962ThrGly: 3.962 ± 0.609
1.123ThrHis: 1.123 ± 0.241
4.292ThrIle: 4.292 ± 0.728
4.424ThrLys: 4.424 ± 0.449
5.613ThrLeu: 5.613 ± 0.493
1.123ThrMet: 1.123 ± 0.246
2.443ThrAsn: 2.443 ± 0.429
1.849ThrPro: 1.849 ± 0.366
2.377ThrGln: 2.377 ± 0.666
2.377ThrArg: 2.377 ± 0.501
4.557ThrSer: 4.557 ± 0.981
4.887ThrThr: 4.887 ± 0.673
5.217ThrVal: 5.217 ± 0.82
0.991ThrTrp: 0.991 ± 0.27
2.245ThrTyr: 2.245 ± 0.442
0.0ThrXaa: 0.0 ± 0.0
Val
4.292ValAla: 4.292 ± 0.597
0.528ValCys: 0.528 ± 0.243
4.094ValAsp: 4.094 ± 0.62
4.689ValGlu: 4.689 ± 0.686
2.311ValPhe: 2.311 ± 0.439
3.368ValGly: 3.368 ± 0.614
1.189ValHis: 1.189 ± 0.241
4.226ValIle: 4.226 ± 0.521
3.962ValLys: 3.962 ± 0.576
5.811ValLeu: 5.811 ± 0.578
1.519ValMet: 1.519 ± 0.279
3.038ValAsn: 3.038 ± 0.432
2.245ValPro: 2.245 ± 0.37
2.509ValGln: 2.509 ± 0.458
3.236ValArg: 3.236 ± 0.656
4.358ValSer: 4.358 ± 0.681
4.557ValThr: 4.557 ± 0.728
3.104ValVal: 3.104 ± 0.398
1.189ValTrp: 1.189 ± 0.261
1.981ValTyr: 1.981 ± 0.374
0.0ValXaa: 0.0 ± 0.0
Trp
1.123TrpAla: 1.123 ± 0.3
0.198TrpCys: 0.198 ± 0.123
0.396TrpAsp: 0.396 ± 0.147
1.387TrpGlu: 1.387 ± 0.265
0.66TrpPhe: 0.66 ± 0.237
0.726TrpGly: 0.726 ± 0.162
0.33TrpHis: 0.33 ± 0.119
0.991TrpIle: 0.991 ± 0.304
0.726TrpLys: 0.726 ± 0.208
1.123TrpLeu: 1.123 ± 0.286
0.528TrpMet: 0.528 ± 0.182
1.387TrpAsn: 1.387 ± 0.324
0.066TrpPro: 0.066 ± 0.063
0.858TrpGln: 0.858 ± 0.261
0.594TrpArg: 0.594 ± 0.209
0.792TrpSer: 0.792 ± 0.263
1.255TrpThr: 1.255 ± 0.305
0.991TrpVal: 0.991 ± 0.219
0.264TrpTrp: 0.264 ± 0.138
0.462TrpTyr: 0.462 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.443TyrAla: 2.443 ± 0.396
0.792TyrCys: 0.792 ± 0.216
2.972TyrAsp: 2.972 ± 0.492
3.566TyrGlu: 3.566 ± 0.542
1.651TyrPhe: 1.651 ± 0.41
2.377TyrGly: 2.377 ± 0.459
1.321TyrHis: 1.321 ± 0.374
2.377TyrIle: 2.377 ± 0.374
2.443TyrLys: 2.443 ± 0.509
3.962TyrLeu: 3.962 ± 0.514
0.462TyrMet: 0.462 ± 0.195
1.783TyrAsn: 1.783 ± 0.376
1.255TyrPro: 1.255 ± 0.241
1.849TyrGln: 1.849 ± 0.309
2.113TyrArg: 2.113 ± 0.324
2.311TyrSer: 2.311 ± 0.445
1.915TyrThr: 1.915 ± 0.366
2.047TyrVal: 2.047 ± 0.399
0.594TyrTrp: 0.594 ± 0.274
1.651TyrTyr: 1.651 ± 0.455
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (15144 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski