Amino acid dipepetide frequency for Staphylococcus phage phiSP119-3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.292AlaAla: 2.292 ± 0.457
0.296AlaCys: 0.296 ± 0.137
3.031AlaAsp: 3.031 ± 0.463
4.658AlaGlu: 4.658 ± 0.562
2.588AlaPhe: 2.588 ± 0.429
3.031AlaGly: 3.031 ± 0.481
0.739AlaHis: 0.739 ± 0.219
5.988AlaIle: 5.988 ± 1.027
5.767AlaLys: 5.767 ± 0.792
5.545AlaLeu: 5.545 ± 0.695
1.479AlaMet: 1.479 ± 0.345
4.436AlaAsn: 4.436 ± 0.587
0.739AlaPro: 0.739 ± 0.2
2.07AlaGln: 2.07 ± 0.442
2.366AlaArg: 2.366 ± 0.348
2.809AlaSer: 2.809 ± 0.458
4.214AlaThr: 4.214 ± 0.566
3.475AlaVal: 3.475 ± 0.588
0.739AlaTrp: 0.739 ± 0.241
2.366AlaTyr: 2.366 ± 0.383
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.148CysCys: 0.148 ± 0.137
0.148CysAsp: 0.148 ± 0.101
0.296CysGlu: 0.296 ± 0.141
0.148CysPhe: 0.148 ± 0.092
0.148CysGly: 0.148 ± 0.096
0.37CysHis: 0.37 ± 0.183
0.148CysIle: 0.148 ± 0.11
0.444CysLys: 0.444 ± 0.154
0.37CysLeu: 0.37 ± 0.138
0.222CysMet: 0.222 ± 0.13
0.37CysAsn: 0.37 ± 0.153
0.222CysPro: 0.222 ± 0.149
0.37CysGln: 0.37 ± 0.168
0.591CysArg: 0.591 ± 0.213
0.296CysSer: 0.296 ± 0.129
0.222CysThr: 0.222 ± 0.133
0.37CysVal: 0.37 ± 0.175
0.222CysTrp: 0.222 ± 0.113
0.148CysTyr: 0.148 ± 0.096
0.0CysXaa: 0.0 ± 0.0
Asp
3.697AspAla: 3.697 ± 0.483
0.296AspCys: 0.296 ± 0.165
5.249AspAsp: 5.249 ± 0.671
6.062AspGlu: 6.062 ± 0.764
3.549AspPhe: 3.549 ± 0.591
4.436AspGly: 4.436 ± 0.624
0.961AspHis: 0.961 ± 0.267
4.51AspIle: 4.51 ± 0.624
5.841AspLys: 5.841 ± 0.786
6.432AspLeu: 6.432 ± 0.762
1.996AspMet: 1.996 ± 0.349
3.697AspAsn: 3.697 ± 0.438
1.035AspPro: 1.035 ± 0.243
1.035AspGln: 1.035 ± 0.271
2.07AspArg: 2.07 ± 0.345
3.992AspSer: 3.992 ± 0.645
3.327AspThr: 3.327 ± 0.458
4.436AspVal: 4.436 ± 0.449
0.518AspTrp: 0.518 ± 0.15
3.031AspTyr: 3.031 ± 0.517
0.0AspXaa: 0.0 ± 0.0
Glu
4.953GluAla: 4.953 ± 0.62
0.813GluCys: 0.813 ± 0.269
3.844GluAsp: 3.844 ± 0.645
6.284GluGlu: 6.284 ± 0.899
3.697GluPhe: 3.697 ± 0.417
2.366GluGly: 2.366 ± 0.38
1.7GluHis: 1.7 ± 0.364
4.658GluIle: 4.658 ± 0.614
5.397GluLys: 5.397 ± 0.668
7.689GluLeu: 7.689 ± 0.675
2.957GluMet: 2.957 ± 0.485
5.767GluAsn: 5.767 ± 0.806
1.774GluPro: 1.774 ± 0.394
3.253GluGln: 3.253 ± 0.47
4.214GluArg: 4.214 ± 0.592
3.253GluSer: 3.253 ± 0.447
3.105GluThr: 3.105 ± 0.44
5.619GluVal: 5.619 ± 0.775
0.887GluTrp: 0.887 ± 0.251
3.179GluTyr: 3.179 ± 0.467
0.0GluXaa: 0.0 ± 0.0
Phe
2.366PheAla: 2.366 ± 0.425
0.37PheCys: 0.37 ± 0.178
2.883PheAsp: 2.883 ± 0.476
3.549PheGlu: 3.549 ± 0.592
2.07PhePhe: 2.07 ± 0.333
2.883PheGly: 2.883 ± 0.493
0.148PheHis: 0.148 ± 0.112
3.327PheIle: 3.327 ± 0.443
5.101PheLys: 5.101 ± 0.556
2.514PheLeu: 2.514 ± 0.557
0.739PheMet: 0.739 ± 0.205
3.475PheAsn: 3.475 ± 0.475
0.444PhePro: 0.444 ± 0.18
0.887PheGln: 0.887 ± 0.237
1.848PheArg: 1.848 ± 0.338
2.514PheSer: 2.514 ± 0.529
2.514PheThr: 2.514 ± 0.475
3.549PheVal: 3.549 ± 0.561
0.37PheTrp: 0.37 ± 0.154
1.405PheTyr: 1.405 ± 0.277
0.0PheXaa: 0.0 ± 0.0
Gly
3.253GlyAla: 3.253 ± 0.663
0.222GlyCys: 0.222 ± 0.138
2.662GlyAsp: 2.662 ± 0.492
3.401GlyGlu: 3.401 ± 0.57
2.957GlyPhe: 2.957 ± 0.492
2.44GlyGly: 2.44 ± 0.482
1.035GlyHis: 1.035 ± 0.267
3.918GlyIle: 3.918 ± 0.534
5.619GlyLys: 5.619 ± 0.931
4.362GlyLeu: 4.362 ± 0.685
1.109GlyMet: 1.109 ± 0.248
2.514GlyAsn: 2.514 ± 0.524
0.37GlyPro: 0.37 ± 0.217
1.774GlyGln: 1.774 ± 0.324
2.957GlyArg: 2.957 ± 0.392
3.031GlySer: 3.031 ± 0.459
3.327GlyThr: 3.327 ± 0.382
4.288GlyVal: 4.288 ± 0.638
0.37GlyTrp: 0.37 ± 0.154
3.031GlyTyr: 3.031 ± 0.509
0.0GlyXaa: 0.0 ± 0.0
His
0.665HisAla: 0.665 ± 0.2
0.444HisCys: 0.444 ± 0.174
1.774HisAsp: 1.774 ± 0.448
1.331HisGlu: 1.331 ± 0.361
0.444HisPhe: 0.444 ± 0.208
0.887HisGly: 0.887 ± 0.266
0.518HisHis: 0.518 ± 0.173
1.331HisIle: 1.331 ± 0.435
1.553HisLys: 1.553 ± 0.372
1.774HisLeu: 1.774 ± 0.351
0.148HisMet: 0.148 ± 0.092
1.035HisAsn: 1.035 ± 0.246
0.591HisPro: 0.591 ± 0.193
1.183HisGln: 1.183 ± 0.302
0.444HisArg: 0.444 ± 0.175
1.257HisSer: 1.257 ± 0.251
1.109HisThr: 1.109 ± 0.298
0.961HisVal: 0.961 ± 0.271
0.296HisTrp: 0.296 ± 0.133
1.183HisTyr: 1.183 ± 0.329
0.0HisXaa: 0.0 ± 0.0
Ile
4.214IleAla: 4.214 ± 0.661
0.296IleCys: 0.296 ± 0.125
6.21IleAsp: 6.21 ± 0.553
5.027IleGlu: 5.027 ± 0.763
3.031IlePhe: 3.031 ± 0.645
3.992IleGly: 3.992 ± 0.74
1.035IleHis: 1.035 ± 0.286
4.806IleIle: 4.806 ± 0.654
7.024IleLys: 7.024 ± 0.568
4.806IleLeu: 4.806 ± 0.553
1.183IleMet: 1.183 ± 0.343
5.619IleAsn: 5.619 ± 0.693
3.031IlePro: 3.031 ± 0.423
2.883IleGln: 2.883 ± 0.491
2.809IleArg: 2.809 ± 0.416
4.214IleSer: 4.214 ± 0.72
4.066IleThr: 4.066 ± 0.561
4.436IleVal: 4.436 ± 0.504
0.813IleTrp: 0.813 ± 0.251
3.992IleTyr: 3.992 ± 0.563
0.0IleXaa: 0.0 ± 0.0
Lys
5.915LysAla: 5.915 ± 0.755
0.148LysCys: 0.148 ± 0.092
6.728LysAsp: 6.728 ± 0.764
6.432LysGlu: 6.432 ± 0.826
3.401LysPhe: 3.401 ± 0.506
4.879LysGly: 4.879 ± 0.636
1.331LysHis: 1.331 ± 0.283
7.911LysIle: 7.911 ± 0.72
7.615LysLys: 7.615 ± 0.881
6.876LysLeu: 6.876 ± 0.734
2.588LysMet: 2.588 ± 0.393
5.471LysAsn: 5.471 ± 0.627
2.809LysPro: 2.809 ± 0.742
5.249LysGln: 5.249 ± 0.666
4.214LysArg: 4.214 ± 0.586
5.988LysSer: 5.988 ± 0.662
6.062LysThr: 6.062 ± 0.756
5.619LysVal: 5.619 ± 0.641
1.331LysTrp: 1.331 ± 0.242
4.288LysTyr: 4.288 ± 0.622
0.0LysXaa: 0.0 ± 0.0
Leu
4.362LeuAla: 4.362 ± 0.441
0.296LeuCys: 0.296 ± 0.134
6.136LeuAsp: 6.136 ± 0.681
6.062LeuGlu: 6.062 ± 0.769
3.179LeuPhe: 3.179 ± 0.436
4.214LeuGly: 4.214 ± 0.607
1.774LeuHis: 1.774 ± 0.39
5.841LeuIle: 5.841 ± 0.689
8.946LeuLys: 8.946 ± 0.865
6.506LeuLeu: 6.506 ± 0.658
1.922LeuMet: 1.922 ± 0.345
6.358LeuAsn: 6.358 ± 0.64
2.44LeuPro: 2.44 ± 0.371
2.735LeuGln: 2.735 ± 0.398
3.105LeuArg: 3.105 ± 0.486
5.397LeuSer: 5.397 ± 0.871
5.545LeuThr: 5.545 ± 0.586
4.066LeuVal: 4.066 ± 0.537
0.813LeuTrp: 0.813 ± 0.348
2.883LeuTyr: 2.883 ± 0.429
0.0LeuXaa: 0.0 ± 0.0
Met
1.553MetAla: 1.553 ± 0.3
0.0MetCys: 0.0 ± 0.0
1.257MetAsp: 1.257 ± 0.267
1.7MetGlu: 1.7 ± 0.352
0.887MetPhe: 0.887 ± 0.274
0.665MetGly: 0.665 ± 0.291
0.37MetHis: 0.37 ± 0.134
1.848MetIle: 1.848 ± 0.331
2.44MetLys: 2.44 ± 0.442
1.7MetLeu: 1.7 ± 0.375
0.518MetMet: 0.518 ± 0.212
1.626MetAsn: 1.626 ± 0.37
0.296MetPro: 0.296 ± 0.133
0.887MetGln: 0.887 ± 0.249
1.183MetArg: 1.183 ± 0.277
1.848MetSer: 1.848 ± 0.377
1.7MetThr: 1.7 ± 0.366
0.813MetVal: 0.813 ± 0.271
0.37MetTrp: 0.37 ± 0.161
1.183MetTyr: 1.183 ± 0.337
0.0MetXaa: 0.0 ± 0.0
Asn
4.658AsnAla: 4.658 ± 0.666
0.222AsnCys: 0.222 ± 0.131
4.362AsnAsp: 4.362 ± 0.755
5.545AsnGlu: 5.545 ± 0.738
2.514AsnPhe: 2.514 ± 0.368
4.732AsnGly: 4.732 ± 0.648
1.035AsnHis: 1.035 ± 0.29
3.253AsnIle: 3.253 ± 0.448
6.358AsnLys: 6.358 ± 0.701
4.732AsnLeu: 4.732 ± 0.64
1.7AsnMet: 1.7 ± 0.335
4.658AsnAsn: 4.658 ± 0.703
2.588AsnPro: 2.588 ± 0.428
1.996AsnGln: 1.996 ± 0.344
2.292AsnArg: 2.292 ± 0.37
3.844AsnSer: 3.844 ± 0.605
4.214AsnThr: 4.214 ± 0.443
5.471AsnVal: 5.471 ± 0.65
0.887AsnTrp: 0.887 ± 0.283
2.292AsnTyr: 2.292 ± 0.49
0.0AsnXaa: 0.0 ± 0.0
Pro
1.257ProAla: 1.257 ± 0.332
0.222ProCys: 0.222 ± 0.111
1.7ProAsp: 1.7 ± 0.399
1.848ProGlu: 1.848 ± 0.392
1.183ProPhe: 1.183 ± 0.341
1.331ProGly: 1.331 ± 0.337
0.444ProHis: 0.444 ± 0.155
2.144ProIle: 2.144 ± 0.51
2.809ProLys: 2.809 ± 0.56
1.331ProLeu: 1.331 ± 0.318
0.37ProMet: 0.37 ± 0.146
2.07ProAsn: 2.07 ± 0.34
0.518ProPro: 0.518 ± 0.203
1.035ProGln: 1.035 ± 0.329
0.813ProArg: 0.813 ± 0.185
1.553ProSer: 1.553 ± 0.312
1.774ProThr: 1.774 ± 0.387
1.183ProVal: 1.183 ± 0.37
0.074ProTrp: 0.074 ± 0.069
0.887ProTyr: 0.887 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
2.292GlnAla: 2.292 ± 0.427
0.37GlnCys: 0.37 ± 0.155
2.218GlnAsp: 2.218 ± 0.334
1.922GlnGlu: 1.922 ± 0.384
1.331GlnPhe: 1.331 ± 0.282
1.626GlnGly: 1.626 ± 0.285
0.665GlnHis: 0.665 ± 0.212
2.957GlnIle: 2.957 ± 0.473
4.214GlnLys: 4.214 ± 0.569
3.918GlnLeu: 3.918 ± 0.567
0.444GlnMet: 0.444 ± 0.168
2.735GlnAsn: 2.735 ± 0.434
0.444GlnPro: 0.444 ± 0.215
1.331GlnGln: 1.331 ± 0.335
1.257GlnArg: 1.257 ± 0.283
1.774GlnSer: 1.774 ± 0.325
2.44GlnThr: 2.44 ± 0.412
2.292GlnVal: 2.292 ± 0.411
0.222GlnTrp: 0.222 ± 0.1
1.626GlnTyr: 1.626 ± 0.311
0.0GlnXaa: 0.0 ± 0.0
Arg
2.292ArgAla: 2.292 ± 0.351
0.222ArgCys: 0.222 ± 0.136
2.588ArgAsp: 2.588 ± 0.396
2.44ArgGlu: 2.44 ± 0.404
2.218ArgPhe: 2.218 ± 0.395
2.144ArgGly: 2.144 ± 0.423
1.257ArgHis: 1.257 ± 0.377
3.697ArgIle: 3.697 ± 0.577
3.327ArgLys: 3.327 ± 0.515
4.806ArgLeu: 4.806 ± 0.527
1.257ArgMet: 1.257 ± 0.285
1.922ArgAsn: 1.922 ± 0.315
1.035ArgPro: 1.035 ± 0.273
1.553ArgGln: 1.553 ± 0.33
1.848ArgArg: 1.848 ± 0.375
1.996ArgSer: 1.996 ± 0.359
1.848ArgThr: 1.848 ± 0.366
2.366ArgVal: 2.366 ± 0.317
0.665ArgTrp: 0.665 ± 0.208
2.292ArgTyr: 2.292 ± 0.357
0.0ArgXaa: 0.0 ± 0.0
Ser
3.992SerAla: 3.992 ± 0.599
0.148SerCys: 0.148 ± 0.092
2.957SerAsp: 2.957 ± 0.434
5.101SerGlu: 5.101 ± 0.526
2.735SerPhe: 2.735 ± 0.541
3.918SerGly: 3.918 ± 0.471
1.331SerHis: 1.331 ± 0.354
4.732SerIle: 4.732 ± 0.613
5.323SerLys: 5.323 ± 0.611
5.619SerLeu: 5.619 ± 0.609
1.183SerMet: 1.183 ± 0.288
4.066SerAsn: 4.066 ± 0.613
1.479SerPro: 1.479 ± 0.323
1.405SerGln: 1.405 ± 0.299
2.514SerArg: 2.514 ± 0.41
3.031SerSer: 3.031 ± 0.655
3.992SerThr: 3.992 ± 0.451
3.401SerVal: 3.401 ± 0.475
0.222SerTrp: 0.222 ± 0.157
1.626SerTyr: 1.626 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
3.771ThrAla: 3.771 ± 0.441
0.0ThrCys: 0.0 ± 0.0
3.105ThrAsp: 3.105 ± 0.46
4.14ThrGlu: 4.14 ± 0.664
2.44ThrPhe: 2.44 ± 0.525
3.105ThrGly: 3.105 ± 0.388
2.07ThrHis: 2.07 ± 0.37
5.101ThrIle: 5.101 ± 0.622
5.545ThrLys: 5.545 ± 0.662
5.693ThrLeu: 5.693 ± 0.611
0.813ThrMet: 0.813 ± 0.192
3.475ThrAsn: 3.475 ± 0.594
1.774ThrPro: 1.774 ± 0.336
1.996ThrGln: 1.996 ± 0.329
1.922ThrArg: 1.922 ± 0.339
3.401ThrSer: 3.401 ± 0.537
4.732ThrThr: 4.732 ± 0.557
4.14ThrVal: 4.14 ± 0.793
0.591ThrTrp: 0.591 ± 0.218
2.883ThrTyr: 2.883 ± 0.404
0.0ThrXaa: 0.0 ± 0.0
Val
4.288ValAla: 4.288 ± 0.562
0.296ValCys: 0.296 ± 0.151
5.027ValAsp: 5.027 ± 0.528
5.841ValGlu: 5.841 ± 0.687
2.44ValPhe: 2.44 ± 0.449
2.957ValGly: 2.957 ± 0.504
0.961ValHis: 0.961 ± 0.238
3.918ValIle: 3.918 ± 0.416
5.915ValLys: 5.915 ± 0.676
4.066ValLeu: 4.066 ± 0.604
1.035ValMet: 1.035 ± 0.287
4.066ValAsn: 4.066 ± 0.52
2.07ValPro: 2.07 ± 0.441
2.218ValGln: 2.218 ± 0.356
2.366ValArg: 2.366 ± 0.278
5.027ValSer: 5.027 ± 0.595
3.623ValThr: 3.623 ± 0.626
4.066ValVal: 4.066 ± 0.585
0.591ValTrp: 0.591 ± 0.206
2.588ValTyr: 2.588 ± 0.491
0.0ValXaa: 0.0 ± 0.0
Trp
0.961TrpAla: 0.961 ± 0.271
0.296TrpCys: 0.296 ± 0.159
0.739TrpAsp: 0.739 ± 0.257
0.518TrpGlu: 0.518 ± 0.198
0.37TrpPhe: 0.37 ± 0.175
0.444TrpGly: 0.444 ± 0.184
0.148TrpHis: 0.148 ± 0.108
0.444TrpIle: 0.444 ± 0.162
0.813TrpLys: 0.813 ± 0.283
1.035TrpLeu: 1.035 ± 0.274
0.148TrpMet: 0.148 ± 0.1
1.109TrpAsn: 1.109 ± 0.561
0.074TrpPro: 0.074 ± 0.079
0.444TrpGln: 0.444 ± 0.202
0.591TrpArg: 0.591 ± 0.18
0.961TrpSer: 0.961 ± 0.249
0.665TrpThr: 0.665 ± 0.216
0.444TrpVal: 0.444 ± 0.183
0.0TrpTrp: 0.0 ± 0.0
0.518TrpTyr: 0.518 ± 0.21
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.848TyrAla: 1.848 ± 0.332
0.148TyrCys: 0.148 ± 0.101
3.549TyrAsp: 3.549 ± 0.517
3.253TyrGlu: 3.253 ± 0.422
1.774TyrPhe: 1.774 ± 0.45
2.44TyrGly: 2.44 ± 0.409
1.035TyrHis: 1.035 ± 0.356
2.957TyrIle: 2.957 ± 0.623
4.879TyrLys: 4.879 ± 0.707
2.809TyrLeu: 2.809 ± 0.418
0.961TyrMet: 0.961 ± 0.281
2.883TyrAsn: 2.883 ± 0.476
0.813TyrPro: 0.813 ± 0.219
1.774TyrGln: 1.774 ± 0.365
2.366TyrArg: 2.366 ± 0.523
2.588TyrSer: 2.588 ± 0.367
2.218TyrThr: 2.218 ± 0.357
2.366TyrVal: 2.366 ± 0.362
0.739TyrTrp: 0.739 ± 0.34
1.553TyrTyr: 1.553 ± 0.365
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (13527 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski