Amino acid dipepetide frequency for Staphylococcus virus phiNM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.214AlaAla: 1.214 ± 0.467
0.228AlaCys: 0.228 ± 0.119
2.503AlaAsp: 2.503 ± 0.432
4.172AlaGlu: 4.172 ± 0.643
3.186AlaPhe: 3.186 ± 0.574
3.565AlaGly: 3.565 ± 0.68
0.986AlaHis: 0.986 ± 0.272
4.703AlaIle: 4.703 ± 0.988
5.006AlaLys: 5.006 ± 0.674
4.399AlaLeu: 4.399 ± 0.743
1.517AlaMet: 1.517 ± 0.404
3.792AlaAsn: 3.792 ± 0.537
1.593AlaPro: 1.593 ± 0.359
2.124AlaGln: 2.124 ± 0.424
2.731AlaArg: 2.731 ± 0.51
3.641AlaSer: 3.641 ± 0.643
3.792AlaThr: 3.792 ± 0.59
3.717AlaVal: 3.717 ± 0.594
0.834AlaTrp: 0.834 ± 0.3
2.958AlaTyr: 2.958 ± 0.387
0.0AlaXaa: 0.0 ± 0.0
Cys
0.076CysAla: 0.076 ± 0.068
0.0CysCys: 0.0 ± 0.0
0.228CysAsp: 0.228 ± 0.159
0.455CysGlu: 0.455 ± 0.227
0.303CysPhe: 0.303 ± 0.163
0.152CysGly: 0.152 ± 0.118
0.076CysHis: 0.076 ± 0.071
0.607CysIle: 0.607 ± 0.237
0.455CysLys: 0.455 ± 0.19
0.303CysLeu: 0.303 ± 0.181
0.076CysMet: 0.076 ± 0.071
0.303CysAsn: 0.303 ± 0.17
0.152CysPro: 0.152 ± 0.107
0.076CysGln: 0.076 ± 0.061
0.303CysArg: 0.303 ± 0.183
0.379CysSer: 0.379 ± 0.194
0.228CysThr: 0.228 ± 0.111
0.152CysVal: 0.152 ± 0.116
0.076CysTrp: 0.076 ± 0.061
0.303CysTyr: 0.303 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
3.641AspAla: 3.641 ± 0.553
0.379AspCys: 0.379 ± 0.189
4.172AspAsp: 4.172 ± 0.862
5.84AspGlu: 5.84 ± 1.036
3.792AspPhe: 3.792 ± 0.584
3.717AspGly: 3.717 ± 0.507
0.379AspHis: 0.379 ± 0.218
4.779AspIle: 4.779 ± 0.692
5.461AspLys: 5.461 ± 0.679
4.475AspLeu: 4.475 ± 0.742
2.579AspMet: 2.579 ± 0.424
4.096AspAsn: 4.096 ± 0.593
1.062AspPro: 1.062 ± 0.283
0.986AspGln: 0.986 ± 0.309
2.2AspArg: 2.2 ± 0.427
3.717AspSer: 3.717 ± 0.588
3.868AspThr: 3.868 ± 0.63
4.248AspVal: 4.248 ± 0.608
0.683AspTrp: 0.683 ± 0.176
2.124AspTyr: 2.124 ± 0.413
0.0AspXaa: 0.0 ± 0.0
Glu
5.385GluAla: 5.385 ± 0.779
0.455GluCys: 0.455 ± 0.18
3.034GluAsp: 3.034 ± 0.591
5.84GluGlu: 5.84 ± 0.885
3.717GluPhe: 3.717 ± 0.598
3.413GluGly: 3.413 ± 0.597
0.986GluHis: 0.986 ± 0.283
6.447GluIle: 6.447 ± 1.015
5.916GluLys: 5.916 ± 0.958
6.826GluLeu: 6.826 ± 0.817
2.731GluMet: 2.731 ± 0.549
4.779GluAsn: 4.779 ± 0.677
2.351GluPro: 2.351 ± 0.379
3.717GluGln: 3.717 ± 0.563
3.717GluArg: 3.717 ± 0.545
4.248GluSer: 4.248 ± 0.601
3.413GluThr: 3.413 ± 0.503
5.006GluVal: 5.006 ± 0.656
0.607GluTrp: 0.607 ± 0.203
5.158GluTyr: 5.158 ± 0.762
0.0GluXaa: 0.0 ± 0.0
Phe
2.427PheAla: 2.427 ± 0.405
0.303PheCys: 0.303 ± 0.145
3.034PheAsp: 3.034 ± 0.481
4.551PheGlu: 4.551 ± 0.632
1.517PhePhe: 1.517 ± 0.345
2.882PheGly: 2.882 ± 0.456
0.758PheHis: 0.758 ± 0.247
2.731PheIle: 2.731 ± 0.516
4.02PheLys: 4.02 ± 0.587
2.958PheLeu: 2.958 ± 0.451
1.365PheMet: 1.365 ± 0.334
3.792PheAsn: 3.792 ± 0.424
0.91PhePro: 0.91 ± 0.321
1.365PheGln: 1.365 ± 0.369
1.441PheArg: 1.441 ± 0.31
2.806PheSer: 2.806 ± 0.509
2.882PheThr: 2.882 ± 0.429
2.731PheVal: 2.731 ± 0.629
0.152PheTrp: 0.152 ± 0.097
1.745PheTyr: 1.745 ± 0.283
0.0PheXaa: 0.0 ± 0.0
Gly
2.806GlyAla: 2.806 ± 0.487
0.303GlyCys: 0.303 ± 0.146
3.262GlyAsp: 3.262 ± 0.506
4.096GlyGlu: 4.096 ± 0.668
2.579GlyPhe: 2.579 ± 0.473
3.262GlyGly: 3.262 ± 0.612
1.517GlyHis: 1.517 ± 0.437
4.172GlyIle: 4.172 ± 0.564
5.537GlyLys: 5.537 ± 0.617
5.461GlyLeu: 5.461 ± 0.793
1.593GlyMet: 1.593 ± 0.34
3.337GlyAsn: 3.337 ± 0.609
0.683GlyPro: 0.683 ± 0.276
2.048GlyGln: 2.048 ± 0.474
2.048GlyArg: 2.048 ± 0.408
2.731GlySer: 2.731 ± 0.422
3.337GlyThr: 3.337 ± 0.492
4.703GlyVal: 4.703 ± 0.588
0.834GlyTrp: 0.834 ± 0.306
2.731GlyTyr: 2.731 ± 0.49
0.0GlyXaa: 0.0 ± 0.0
His
1.289HisAla: 1.289 ± 0.318
0.076HisCys: 0.076 ± 0.077
1.062HisAsp: 1.062 ± 0.258
0.91HisGlu: 0.91 ± 0.287
0.91HisPhe: 0.91 ± 0.31
1.214HisGly: 1.214 ± 0.315
0.531HisHis: 0.531 ± 0.226
0.986HisIle: 0.986 ± 0.28
0.758HisLys: 0.758 ± 0.238
1.517HisLeu: 1.517 ± 0.32
0.303HisMet: 0.303 ± 0.175
0.91HisAsn: 0.91 ± 0.274
0.607HisPro: 0.607 ± 0.169
0.455HisGln: 0.455 ± 0.175
0.303HisArg: 0.303 ± 0.157
1.365HisSer: 1.365 ± 0.284
0.531HisThr: 0.531 ± 0.214
1.441HisVal: 1.441 ± 0.392
0.076HisTrp: 0.076 ± 0.082
0.986HisTyr: 0.986 ± 0.315
0.0HisXaa: 0.0 ± 0.0
Ile
5.385IleAla: 5.385 ± 0.777
0.076IleCys: 0.076 ± 0.082
5.992IleAsp: 5.992 ± 0.588
6.523IleGlu: 6.523 ± 0.742
2.427IlePhe: 2.427 ± 0.488
3.868IleGly: 3.868 ± 0.548
1.289IleHis: 1.289 ± 0.288
4.475IleIle: 4.475 ± 0.706
7.509IleLys: 7.509 ± 0.668
4.627IleLeu: 4.627 ± 0.506
2.124IleMet: 2.124 ± 0.357
5.006IleAsn: 5.006 ± 0.635
2.427IlePro: 2.427 ± 0.406
2.579IleGln: 2.579 ± 0.423
3.337IleArg: 3.337 ± 0.5
4.172IleSer: 4.172 ± 0.758
5.234IleThr: 5.234 ± 0.863
4.172IleVal: 4.172 ± 0.527
1.365IleTrp: 1.365 ± 0.633
3.413IleTyr: 3.413 ± 0.611
0.0IleXaa: 0.0 ± 0.0
Lys
5.689LysAla: 5.689 ± 0.647
0.379LysCys: 0.379 ± 0.206
6.068LysAsp: 6.068 ± 0.822
7.661LysGlu: 7.661 ± 1.024
3.717LysPhe: 3.717 ± 0.607
5.309LysGly: 5.309 ± 0.638
1.669LysHis: 1.669 ± 0.391
6.523LysIle: 6.523 ± 0.816
8.95LysLys: 8.95 ± 0.852
6.751LysLeu: 6.751 ± 0.713
1.669LysMet: 1.669 ± 0.363
5.765LysAsn: 5.765 ± 0.792
2.882LysPro: 2.882 ± 0.554
4.323LysGln: 4.323 ± 0.627
3.944LysArg: 3.944 ± 0.647
5.082LysSer: 5.082 ± 0.567
5.082LysThr: 5.082 ± 0.594
5.84LysVal: 5.84 ± 0.642
0.986LysTrp: 0.986 ± 0.287
4.551LysTyr: 4.551 ± 0.794
0.0LysXaa: 0.0 ± 0.0
Leu
3.337LeuAla: 3.337 ± 0.575
0.379LeuCys: 0.379 ± 0.213
4.172LeuAsp: 4.172 ± 0.61
5.385LeuGlu: 5.385 ± 0.571
2.958LeuPhe: 2.958 ± 0.529
3.868LeuGly: 3.868 ± 0.495
0.834LeuHis: 0.834 ± 0.305
4.703LeuIle: 4.703 ± 0.471
7.737LeuLys: 7.737 ± 0.637
5.234LeuLeu: 5.234 ± 0.616
1.517LeuMet: 1.517 ± 0.302
5.537LeuAsn: 5.537 ± 0.521
2.275LeuPro: 2.275 ± 0.355
3.489LeuGln: 3.489 ± 0.459
3.641LeuArg: 3.641 ± 0.581
4.854LeuSer: 4.854 ± 0.615
4.551LeuThr: 4.551 ± 0.714
4.323LeuVal: 4.323 ± 0.689
0.683LeuTrp: 0.683 ± 0.266
3.034LeuTyr: 3.034 ± 0.554
0.0LeuXaa: 0.0 ± 0.0
Met
1.289MetAla: 1.289 ± 0.3
0.228MetCys: 0.228 ± 0.133
1.289MetAsp: 1.289 ± 0.302
1.745MetGlu: 1.745 ± 0.393
1.062MetPhe: 1.062 ± 0.28
0.91MetGly: 0.91 ± 0.249
0.303MetHis: 0.303 ± 0.189
1.441MetIle: 1.441 ± 0.273
2.124MetLys: 2.124 ± 0.381
2.503MetLeu: 2.503 ± 0.375
0.683MetMet: 0.683 ± 0.205
1.82MetAsn: 1.82 ± 0.381
1.289MetPro: 1.289 ± 0.355
1.593MetGln: 1.593 ± 0.4
1.138MetArg: 1.138 ± 0.301
1.365MetSer: 1.365 ± 0.347
2.731MetThr: 2.731 ± 0.511
0.91MetVal: 0.91 ± 0.218
0.455MetTrp: 0.455 ± 0.181
1.441MetTyr: 1.441 ± 0.297
0.0MetXaa: 0.0 ± 0.0
Asn
4.475AsnAla: 4.475 ± 0.917
0.152AsnCys: 0.152 ± 0.112
5.082AsnAsp: 5.082 ± 0.591
4.399AsnGlu: 4.399 ± 0.52
2.882AsnPhe: 2.882 ± 0.602
6.068AsnGly: 6.068 ± 0.69
1.062AsnHis: 1.062 ± 0.309
4.248AsnIle: 4.248 ± 0.496
6.675AsnLys: 6.675 ± 0.639
4.323AsnLeu: 4.323 ± 0.665
1.972AsnMet: 1.972 ± 0.332
4.779AsnAsn: 4.779 ± 0.683
2.958AsnPro: 2.958 ± 0.589
2.427AsnGln: 2.427 ± 0.415
2.806AsnArg: 2.806 ± 0.414
2.882AsnSer: 2.882 ± 0.469
4.096AsnThr: 4.096 ± 0.469
4.248AsnVal: 4.248 ± 0.742
0.683AsnTrp: 0.683 ± 0.197
3.262AsnTyr: 3.262 ± 0.558
0.0AsnXaa: 0.0 ± 0.0
Pro
1.138ProAla: 1.138 ± 0.272
0.379ProCys: 0.379 ± 0.157
1.365ProAsp: 1.365 ± 0.272
1.896ProGlu: 1.896 ± 0.331
1.289ProPhe: 1.289 ± 0.323
1.593ProGly: 1.593 ± 0.472
0.228ProHis: 0.228 ± 0.124
2.351ProIle: 2.351 ± 0.424
2.882ProLys: 2.882 ± 0.544
1.365ProLeu: 1.365 ± 0.302
0.986ProMet: 0.986 ± 0.255
2.503ProAsn: 2.503 ± 0.471
0.607ProPro: 0.607 ± 0.231
1.593ProGln: 1.593 ± 0.36
1.138ProArg: 1.138 ± 0.296
1.593ProSer: 1.593 ± 0.391
2.351ProThr: 2.351 ± 0.406
1.82ProVal: 1.82 ± 0.434
0.152ProTrp: 0.152 ± 0.107
1.441ProTyr: 1.441 ± 0.367
0.0ProXaa: 0.0 ± 0.0
Gln
2.882GlnAla: 2.882 ± 0.468
0.379GlnCys: 0.379 ± 0.159
2.275GlnAsp: 2.275 ± 0.474
2.806GlnGlu: 2.806 ± 0.686
1.896GlnPhe: 1.896 ± 0.456
2.2GlnGly: 2.2 ± 0.34
0.607GlnHis: 0.607 ± 0.193
2.731GlnIle: 2.731 ± 0.348
2.882GlnLys: 2.882 ± 0.451
2.958GlnLeu: 2.958 ± 0.587
1.062GlnMet: 1.062 ± 0.33
1.972GlnAsn: 1.972 ± 0.326
1.289GlnPro: 1.289 ± 0.275
1.593GlnGln: 1.593 ± 0.508
1.669GlnArg: 1.669 ± 0.423
2.275GlnSer: 2.275 ± 0.392
1.972GlnThr: 1.972 ± 0.389
1.972GlnVal: 1.972 ± 0.432
0.303GlnTrp: 0.303 ± 0.176
1.365GlnTyr: 1.365 ± 0.393
0.0GlnXaa: 0.0 ± 0.0
Arg
1.82ArgAla: 1.82 ± 0.379
0.303ArgCys: 0.303 ± 0.163
2.655ArgAsp: 2.655 ± 0.563
3.186ArgGlu: 3.186 ± 0.434
1.745ArgPhe: 1.745 ± 0.398
2.124ArgGly: 2.124 ± 0.44
1.138ArgHis: 1.138 ± 0.287
3.565ArgIle: 3.565 ± 0.544
4.399ArgLys: 4.399 ± 0.657
4.02ArgLeu: 4.02 ± 0.523
0.683ArgMet: 0.683 ± 0.26
3.11ArgAsn: 3.11 ± 0.516
0.683ArgPro: 0.683 ± 0.208
1.365ArgGln: 1.365 ± 0.393
1.593ArgArg: 1.593 ± 0.394
1.82ArgSer: 1.82 ± 0.305
1.669ArgThr: 1.669 ± 0.402
2.2ArgVal: 2.2 ± 0.466
0.303ArgTrp: 0.303 ± 0.129
2.351ArgTyr: 2.351 ± 0.461
0.0ArgXaa: 0.0 ± 0.0
Ser
3.717SerAla: 3.717 ± 0.658
0.152SerCys: 0.152 ± 0.136
4.096SerAsp: 4.096 ± 0.64
3.489SerGlu: 3.489 ± 0.593
2.503SerPhe: 2.503 ± 0.396
3.565SerGly: 3.565 ± 0.702
1.214SerHis: 1.214 ± 0.356
6.22SerIle: 6.22 ± 0.559
5.385SerLys: 5.385 ± 0.6
3.262SerLeu: 3.262 ± 0.437
1.669SerMet: 1.669 ± 0.336
3.868SerAsn: 3.868 ± 0.481
1.517SerPro: 1.517 ± 0.434
1.745SerGln: 1.745 ± 0.431
2.351SerArg: 2.351 ± 0.311
3.641SerSer: 3.641 ± 0.434
3.717SerThr: 3.717 ± 0.431
3.11SerVal: 3.11 ± 0.513
0.228SerTrp: 0.228 ± 0.137
2.275SerTyr: 2.275 ± 0.513
0.0SerXaa: 0.0 ± 0.0
Thr
3.641ThrAla: 3.641 ± 0.591
0.076ThrCys: 0.076 ± 0.08
3.413ThrAsp: 3.413 ± 0.648
4.172ThrGlu: 4.172 ± 0.558
2.579ThrPhe: 2.579 ± 0.562
3.641ThrGly: 3.641 ± 0.525
0.683ThrHis: 0.683 ± 0.189
5.84ThrIle: 5.84 ± 1.206
5.84ThrLys: 5.84 ± 0.584
4.096ThrLeu: 4.096 ± 0.436
0.986ThrMet: 0.986 ± 0.316
4.475ThrAsn: 4.475 ± 0.782
1.745ThrPro: 1.745 ± 0.367
2.351ThrGln: 2.351 ± 0.532
2.275ThrArg: 2.275 ± 0.328
3.944ThrSer: 3.944 ± 0.675
3.717ThrThr: 3.717 ± 0.784
4.323ThrVal: 4.323 ± 0.651
0.531ThrTrp: 0.531 ± 0.219
2.2ThrTyr: 2.2 ± 0.43
0.0ThrXaa: 0.0 ± 0.0
Val
3.641ValAla: 3.641 ± 0.987
0.228ValCys: 0.228 ± 0.126
5.082ValAsp: 5.082 ± 0.713
6.296ValGlu: 6.296 ± 0.776
2.427ValPhe: 2.427 ± 0.487
2.124ValGly: 2.124 ± 0.503
0.986ValHis: 0.986 ± 0.265
5.309ValIle: 5.309 ± 0.558
5.689ValLys: 5.689 ± 0.594
4.02ValLeu: 4.02 ± 0.702
1.745ValMet: 1.745 ± 0.414
4.779ValAsn: 4.779 ± 0.603
2.2ValPro: 2.2 ± 0.418
1.289ValGln: 1.289 ± 0.286
1.896ValArg: 1.896 ± 0.447
3.792ValSer: 3.792 ± 0.635
3.489ValThr: 3.489 ± 0.482
4.399ValVal: 4.399 ± 0.562
0.986ValTrp: 0.986 ± 0.397
2.275ValTyr: 2.275 ± 0.483
0.0ValXaa: 0.0 ± 0.0
Trp
0.986TrpAla: 0.986 ± 0.294
0.076TrpCys: 0.076 ± 0.061
0.379TrpAsp: 0.379 ± 0.174
0.607TrpGlu: 0.607 ± 0.208
0.379TrpPhe: 0.379 ± 0.181
0.683TrpGly: 0.683 ± 0.299
0.076TrpHis: 0.076 ± 0.071
0.758TrpIle: 0.758 ± 0.234
0.683TrpLys: 0.683 ± 0.224
0.455TrpLeu: 0.455 ± 0.151
0.152TrpMet: 0.152 ± 0.113
1.745TrpAsn: 1.745 ± 1.016
0.152TrpPro: 0.152 ± 0.104
0.531TrpGln: 0.531 ± 0.229
0.228TrpArg: 0.228 ± 0.134
0.91TrpSer: 0.91 ± 0.33
0.91TrpThr: 0.91 ± 0.304
0.531TrpVal: 0.531 ± 0.179
0.0TrpTrp: 0.0 ± 0.0
0.607TrpTyr: 0.607 ± 0.21
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.82TyrAla: 1.82 ± 0.375
0.152TyrCys: 0.152 ± 0.113
3.034TyrAsp: 3.034 ± 0.609
3.717TyrGlu: 3.717 ± 0.664
2.503TyrPhe: 2.503 ± 0.432
2.882TyrGly: 2.882 ± 0.578
0.91TyrHis: 0.91 ± 0.316
3.489TyrIle: 3.489 ± 0.509
4.93TyrLys: 4.93 ± 0.729
2.958TyrLeu: 2.958 ± 0.596
0.834TyrMet: 0.834 ± 0.239
3.11TyrAsn: 3.11 ± 0.416
1.289TyrPro: 1.289 ± 0.371
1.441TyrGln: 1.441 ± 0.341
1.972TyrArg: 1.972 ± 0.461
2.503TyrSer: 2.503 ± 0.483
2.958TyrThr: 2.958 ± 0.48
2.806TyrVal: 2.806 ± 0.439
0.91TyrTrp: 0.91 ± 0.249
1.441TyrTyr: 1.441 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (13185 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski