Amino acid dipepetide frequency for Staphylococcus phage phiSa2wa_st5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.417AlaAla: 2.417 ± 0.841
0.207AlaCys: 0.207 ± 0.126
2.417AlaAsp: 2.417 ± 0.539
4.35AlaGlu: 4.35 ± 0.512
1.243AlaPhe: 1.243 ± 0.253
3.384AlaGly: 3.384 ± 0.8
1.105AlaHis: 1.105 ± 0.269
4.143AlaIle: 4.143 ± 0.677
6.146AlaLys: 6.146 ± 1.177
5.11AlaLeu: 5.11 ± 0.743
0.898AlaMet: 0.898 ± 0.223
4.626AlaAsn: 4.626 ± 0.746
1.45AlaPro: 1.45 ± 0.331
1.519AlaGln: 1.519 ± 0.385
2.555AlaArg: 2.555 ± 0.385
4.074AlaSer: 4.074 ± 0.681
3.798AlaThr: 3.798 ± 0.548
2.831AlaVal: 2.831 ± 0.515
0.967AlaTrp: 0.967 ± 0.316
2.141AlaTyr: 2.141 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
0.138CysAla: 0.138 ± 0.11
0.069CysCys: 0.069 ± 0.085
0.207CysAsp: 0.207 ± 0.134
0.552CysGlu: 0.552 ± 0.28
0.483CysPhe: 0.483 ± 0.207
0.276CysGly: 0.276 ± 0.153
0.138CysHis: 0.138 ± 0.094
0.414CysIle: 0.414 ± 0.159
0.483CysLys: 0.483 ± 0.216
0.345CysLeu: 0.345 ± 0.181
0.138CysMet: 0.138 ± 0.098
0.414CysAsn: 0.414 ± 0.181
0.0CysPro: 0.0 ± 0.0
0.069CysGln: 0.069 ± 0.065
0.207CysArg: 0.207 ± 0.123
0.276CysSer: 0.276 ± 0.171
0.345CysThr: 0.345 ± 0.151
0.483CysVal: 0.483 ± 0.248
0.0CysTrp: 0.0 ± 0.0
0.276CysTyr: 0.276 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
3.245AspAla: 3.245 ± 0.596
0.345AspCys: 0.345 ± 0.194
3.936AspAsp: 3.936 ± 0.674
4.419AspGlu: 4.419 ± 0.738
3.176AspPhe: 3.176 ± 0.5
3.591AspGly: 3.591 ± 0.644
0.483AspHis: 0.483 ± 0.149
5.248AspIle: 5.248 ± 0.547
6.836AspLys: 6.836 ± 0.879
5.248AspLeu: 5.248 ± 0.538
1.933AspMet: 1.933 ± 0.388
3.591AspAsn: 3.591 ± 0.45
0.898AspPro: 0.898 ± 0.27
1.174AspGln: 1.174 ± 0.217
2.21AspArg: 2.21 ± 0.42
3.798AspSer: 3.798 ± 0.626
3.384AspThr: 3.384 ± 0.443
3.867AspVal: 3.867 ± 0.489
0.967AspTrp: 0.967 ± 0.228
3.176AspTyr: 3.176 ± 0.518
0.0AspXaa: 0.0 ± 0.0
Glu
4.074GluAla: 4.074 ± 0.483
0.621GluCys: 0.621 ± 0.232
4.074GluAsp: 4.074 ± 0.647
5.662GluGlu: 5.662 ± 1.017
3.038GluPhe: 3.038 ± 0.455
2.9GluGly: 2.9 ± 0.401
0.967GluHis: 0.967 ± 0.231
5.869GluIle: 5.869 ± 0.989
8.217GluLys: 8.217 ± 0.896
6.698GluLeu: 6.698 ± 0.866
3.107GluMet: 3.107 ± 0.421
5.248GluAsn: 5.248 ± 0.612
1.243GluPro: 1.243 ± 0.334
3.245GluGln: 3.245 ± 0.549
3.107GluArg: 3.107 ± 0.496
3.384GluSer: 3.384 ± 0.556
4.419GluThr: 4.419 ± 0.638
4.281GluVal: 4.281 ± 0.596
0.967GluTrp: 0.967 ± 0.22
3.038GluTyr: 3.038 ± 0.63
0.0GluXaa: 0.0 ± 0.0
Phe
1.795PheAla: 1.795 ± 0.353
0.207PheCys: 0.207 ± 0.122
2.969PheAsp: 2.969 ± 0.506
3.245PheGlu: 3.245 ± 0.549
1.105PhePhe: 1.105 ± 0.257
3.038PheGly: 3.038 ± 0.554
0.621PheHis: 0.621 ± 0.207
3.522PheIle: 3.522 ± 0.625
4.35PheLys: 4.35 ± 0.549
2.279PheLeu: 2.279 ± 0.406
1.036PheMet: 1.036 ± 0.233
3.591PheAsn: 3.591 ± 0.461
0.691PhePro: 0.691 ± 0.238
1.036PheGln: 1.036 ± 0.229
1.312PheArg: 1.312 ± 0.284
1.933PheSer: 1.933 ± 0.356
1.933PheThr: 1.933 ± 0.377
2.141PheVal: 2.141 ± 0.444
0.207PheTrp: 0.207 ± 0.103
1.588PheTyr: 1.588 ± 0.374
0.0PheXaa: 0.0 ± 0.0
Gly
3.66GlyAla: 3.66 ± 0.978
0.276GlyCys: 0.276 ± 0.14
3.867GlyAsp: 3.867 ± 0.44
3.245GlyGlu: 3.245 ± 0.424
2.624GlyPhe: 2.624 ± 0.365
4.834GlyGly: 4.834 ± 1.147
1.45GlyHis: 1.45 ± 0.398
3.591GlyIle: 3.591 ± 0.51
5.11GlyLys: 5.11 ± 0.537
4.834GlyLeu: 4.834 ± 0.839
1.312GlyMet: 1.312 ± 0.389
3.522GlyAsn: 3.522 ± 0.569
0.829GlyPro: 0.829 ± 0.223
1.864GlyGln: 1.864 ± 0.455
1.933GlyArg: 1.933 ± 0.423
4.281GlySer: 4.281 ± 0.529
3.176GlyThr: 3.176 ± 0.565
4.834GlyVal: 4.834 ± 0.666
1.174GlyTrp: 1.174 ± 0.272
3.453GlyTyr: 3.453 ± 0.58
0.0GlyXaa: 0.0 ± 0.0
His
1.174HisAla: 1.174 ± 0.274
0.069HisCys: 0.069 ± 0.078
0.621HisAsp: 0.621 ± 0.194
1.105HisGlu: 1.105 ± 0.29
0.691HisPhe: 0.691 ± 0.171
1.243HisGly: 1.243 ± 0.316
0.276HisHis: 0.276 ± 0.157
1.243HisIle: 1.243 ± 0.371
1.105HisLys: 1.105 ± 0.255
1.45HisLeu: 1.45 ± 0.295
0.276HisMet: 0.276 ± 0.129
1.381HisAsn: 1.381 ± 0.337
0.621HisPro: 0.621 ± 0.169
0.621HisGln: 0.621 ± 0.159
0.76HisArg: 0.76 ± 0.182
0.691HisSer: 0.691 ± 0.181
1.312HisThr: 1.312 ± 0.325
0.76HisVal: 0.76 ± 0.249
0.276HisTrp: 0.276 ± 0.131
0.967HisTyr: 0.967 ± 0.293
0.0HisXaa: 0.0 ± 0.0
Ile
4.281IleAla: 4.281 ± 0.591
0.414IleCys: 0.414 ± 0.224
4.972IleAsp: 4.972 ± 0.74
5.524IleGlu: 5.524 ± 0.626
2.9IlePhe: 2.9 ± 0.577
3.936IleGly: 3.936 ± 0.491
1.657IleHis: 1.657 ± 0.33
3.798IleIle: 3.798 ± 0.585
8.562IleLys: 8.562 ± 0.718
5.317IleLeu: 5.317 ± 0.788
1.864IleMet: 1.864 ± 0.442
5.662IleAsn: 5.662 ± 0.534
2.21IlePro: 2.21 ± 0.32
2.279IleGln: 2.279 ± 0.35
2.9IleArg: 2.9 ± 0.435
5.041IleSer: 5.041 ± 0.461
4.626IleThr: 4.626 ± 0.481
3.936IleVal: 3.936 ± 0.723
0.621IleTrp: 0.621 ± 0.197
2.624IleTyr: 2.624 ± 0.442
0.0IleXaa: 0.0 ± 0.0
Lys
7.527LysAla: 7.527 ± 1.035
0.276LysCys: 0.276 ± 0.15
5.455LysAsp: 5.455 ± 0.57
8.424LysGlu: 8.424 ± 0.901
2.555LysPhe: 2.555 ± 0.39
5.869LysGly: 5.869 ± 0.867
1.381LysHis: 1.381 ± 0.251
6.146LysIle: 6.146 ± 0.73
8.079LysLys: 8.079 ± 1.077
8.7LysLeu: 8.7 ± 0.79
3.522LysMet: 3.522 ± 0.58
6.422LysAsn: 6.422 ± 0.57
2.348LysPro: 2.348 ± 0.38
5.041LysGln: 5.041 ± 0.557
4.143LysArg: 4.143 ± 0.704
6.146LysSer: 6.146 ± 1.254
5.593LysThr: 5.593 ± 0.735
5.386LysVal: 5.386 ± 0.731
1.588LysTrp: 1.588 ± 0.401
5.386LysTyr: 5.386 ± 0.582
0.0LysXaa: 0.0 ± 0.0
Leu
3.798LeuAla: 3.798 ± 0.848
0.621LeuCys: 0.621 ± 0.257
4.903LeuAsp: 4.903 ± 0.904
6.146LeuGlu: 6.146 ± 0.729
3.245LeuPhe: 3.245 ± 0.481
4.35LeuGly: 4.35 ± 0.912
1.243LeuHis: 1.243 ± 0.358
5.938LeuIle: 5.938 ± 0.736
8.493LeuLys: 8.493 ± 1.046
6.284LeuLeu: 6.284 ± 0.73
1.795LeuMet: 1.795 ± 0.341
5.731LeuAsn: 5.731 ± 0.56
2.348LeuPro: 2.348 ± 0.442
2.762LeuGln: 2.762 ± 0.458
3.522LeuArg: 3.522 ± 0.526
5.179LeuSer: 5.179 ± 0.648
5.317LeuThr: 5.317 ± 0.826
3.66LeuVal: 3.66 ± 0.413
0.414LeuTrp: 0.414 ± 0.187
3.314LeuTyr: 3.314 ± 0.744
0.0LeuXaa: 0.0 ± 0.0
Met
1.174MetAla: 1.174 ± 0.275
0.276MetCys: 0.276 ± 0.162
1.45MetAsp: 1.45 ± 0.378
1.105MetGlu: 1.105 ± 0.292
1.036MetPhe: 1.036 ± 0.269
1.519MetGly: 1.519 ± 0.412
0.414MetHis: 0.414 ± 0.2
1.795MetIle: 1.795 ± 0.282
2.9MetLys: 2.9 ± 0.522
1.933MetLeu: 1.933 ± 0.419
0.483MetMet: 0.483 ± 0.161
2.417MetAsn: 2.417 ± 0.444
1.036MetPro: 1.036 ± 0.229
1.45MetGln: 1.45 ± 0.434
0.898MetArg: 0.898 ± 0.247
2.002MetSer: 2.002 ± 0.393
2.417MetThr: 2.417 ± 0.374
1.174MetVal: 1.174 ± 0.228
0.414MetTrp: 0.414 ± 0.165
1.105MetTyr: 1.105 ± 0.26
0.0MetXaa: 0.0 ± 0.0
Asn
3.522AsnAla: 3.522 ± 0.585
0.207AsnCys: 0.207 ± 0.13
4.903AsnAsp: 4.903 ± 0.513
4.972AsnGlu: 4.972 ± 0.669
2.141AsnPhe: 2.141 ± 0.393
4.695AsnGly: 4.695 ± 0.556
0.691AsnHis: 0.691 ± 0.199
4.834AsnIle: 4.834 ± 0.417
7.319AsnLys: 7.319 ± 0.805
4.903AsnLeu: 4.903 ± 0.463
1.243AsnMet: 1.243 ± 0.274
5.179AsnAsn: 5.179 ± 0.944
2.555AsnPro: 2.555 ± 0.399
3.314AsnGln: 3.314 ± 0.443
3.176AsnArg: 3.176 ± 0.533
4.281AsnSer: 4.281 ± 0.702
4.143AsnThr: 4.143 ± 0.533
3.936AsnVal: 3.936 ± 0.537
1.174AsnTrp: 1.174 ± 0.298
3.038AsnTyr: 3.038 ± 0.496
0.0AsnXaa: 0.0 ± 0.0
Pro
0.967ProAla: 0.967 ± 0.237
0.207ProCys: 0.207 ± 0.123
1.174ProAsp: 1.174 ± 0.277
2.072ProGlu: 2.072 ± 0.479
1.45ProPhe: 1.45 ± 0.368
1.174ProGly: 1.174 ± 0.343
0.138ProHis: 0.138 ± 0.106
2.072ProIle: 2.072 ± 0.418
2.21ProLys: 2.21 ± 0.414
2.072ProLeu: 2.072 ± 0.488
0.829ProMet: 0.829 ± 0.253
1.588ProAsn: 1.588 ± 0.302
0.621ProPro: 0.621 ± 0.208
1.243ProGln: 1.243 ± 0.26
1.036ProArg: 1.036 ± 0.265
2.072ProSer: 2.072 ± 0.293
1.657ProThr: 1.657 ± 0.38
1.036ProVal: 1.036 ± 0.342
0.207ProTrp: 0.207 ± 0.106
1.174ProTyr: 1.174 ± 0.323
0.0ProXaa: 0.0 ± 0.0
Gln
2.555GlnAla: 2.555 ± 0.428
0.207GlnCys: 0.207 ± 0.127
1.864GlnAsp: 1.864 ± 0.369
2.969GlnGlu: 2.969 ± 0.42
1.312GlnPhe: 1.312 ± 0.286
2.002GlnGly: 2.002 ± 0.407
0.898GlnHis: 0.898 ± 0.224
3.107GlnIle: 3.107 ± 0.522
3.245GlnLys: 3.245 ± 0.396
3.107GlnLeu: 3.107 ± 0.449
1.105GlnMet: 1.105 ± 0.302
2.348GlnAsn: 2.348 ± 0.467
1.105GlnPro: 1.105 ± 0.276
2.002GlnGln: 2.002 ± 0.642
2.279GlnArg: 2.279 ± 0.379
2.693GlnSer: 2.693 ± 0.419
1.519GlnThr: 1.519 ± 0.33
1.933GlnVal: 1.933 ± 0.326
0.207GlnTrp: 0.207 ± 0.118
1.726GlnTyr: 1.726 ± 0.355
0.0GlnXaa: 0.0 ± 0.0
Arg
1.933ArgAla: 1.933 ± 0.467
0.207ArgCys: 0.207 ± 0.148
2.624ArgAsp: 2.624 ± 0.347
3.038ArgGlu: 3.038 ± 0.547
1.519ArgPhe: 1.519 ± 0.317
2.21ArgGly: 2.21 ± 0.373
1.036ArgHis: 1.036 ± 0.271
3.798ArgIle: 3.798 ± 0.526
4.419ArgLys: 4.419 ± 0.527
2.831ArgLeu: 2.831 ± 0.436
1.381ArgMet: 1.381 ± 0.307
2.693ArgAsn: 2.693 ± 0.462
0.967ArgPro: 0.967 ± 0.305
1.312ArgGln: 1.312 ± 0.266
2.002ArgArg: 2.002 ± 0.345
1.795ArgSer: 1.795 ± 0.398
2.072ArgThr: 2.072 ± 0.395
1.864ArgVal: 1.864 ± 0.331
0.414ArgTrp: 0.414 ± 0.146
2.417ArgTyr: 2.417 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
3.453SerAla: 3.453 ± 0.847
0.276SerCys: 0.276 ± 0.128
5.041SerAsp: 5.041 ± 0.473
4.557SerGlu: 4.557 ± 0.515
2.9SerPhe: 2.9 ± 0.518
4.419SerGly: 4.419 ± 0.863
0.898SerHis: 0.898 ± 0.235
4.212SerIle: 4.212 ± 0.645
6.56SerLys: 6.56 ± 1.038
4.005SerLeu: 4.005 ± 0.491
1.726SerMet: 1.726 ± 0.311
5.179SerAsn: 5.179 ± 0.562
1.312SerPro: 1.312 ± 0.355
2.693SerGln: 2.693 ± 0.452
2.141SerArg: 2.141 ± 0.389
4.143SerSer: 4.143 ± 0.613
2.831SerThr: 2.831 ± 0.496
3.522SerVal: 3.522 ± 0.586
0.691SerTrp: 0.691 ± 0.216
2.693SerTyr: 2.693 ± 0.467
0.0SerXaa: 0.0 ± 0.0
Thr
4.074ThrAla: 4.074 ± 0.616
0.138ThrCys: 0.138 ± 0.109
3.591ThrAsp: 3.591 ± 0.425
3.729ThrGlu: 3.729 ± 0.546
2.348ThrPhe: 2.348 ± 0.359
4.212ThrGly: 4.212 ± 0.587
1.312ThrHis: 1.312 ± 0.344
4.557ThrIle: 4.557 ± 0.609
5.524ThrLys: 5.524 ± 0.815
4.488ThrLeu: 4.488 ± 0.482
1.105ThrMet: 1.105 ± 0.272
3.245ThrAsn: 3.245 ± 0.478
2.348ThrPro: 2.348 ± 0.311
1.864ThrGln: 1.864 ± 0.305
1.933ThrArg: 1.933 ± 0.301
3.591ThrSer: 3.591 ± 0.564
3.107ThrThr: 3.107 ± 0.489
4.419ThrVal: 4.419 ± 0.517
0.552ThrTrp: 0.552 ± 0.225
2.486ThrTyr: 2.486 ± 0.489
0.0ThrXaa: 0.0 ± 0.0
Val
2.9ValAla: 2.9 ± 0.518
0.276ValCys: 0.276 ± 0.161
3.936ValAsp: 3.936 ± 0.702
4.972ValGlu: 4.972 ± 0.47
2.417ValPhe: 2.417 ± 0.506
3.107ValGly: 3.107 ± 0.553
0.829ValHis: 0.829 ± 0.203
4.557ValIle: 4.557 ± 0.606
5.662ValLys: 5.662 ± 0.642
4.35ValLeu: 4.35 ± 0.47
1.243ValMet: 1.243 ± 0.295
4.005ValAsn: 4.005 ± 0.452
1.381ValPro: 1.381 ± 0.315
2.002ValGln: 2.002 ± 0.305
1.933ValArg: 1.933 ± 0.375
3.867ValSer: 3.867 ± 0.5
3.729ValThr: 3.729 ± 0.672
2.831ValVal: 2.831 ± 0.348
0.483ValTrp: 0.483 ± 0.205
1.933ValTyr: 1.933 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
0.414TrpAla: 0.414 ± 0.167
0.0TrpCys: 0.0 ± 0.0
0.621TrpAsp: 0.621 ± 0.218
1.036TrpGlu: 1.036 ± 0.317
1.174TrpPhe: 1.174 ± 0.329
0.414TrpGly: 0.414 ± 0.187
0.0TrpHis: 0.0 ± 0.0
1.243TrpIle: 1.243 ± 0.308
0.76TrpLys: 0.76 ± 0.232
1.036TrpLeu: 1.036 ± 0.259
0.552TrpMet: 0.552 ± 0.247
0.76TrpAsn: 0.76 ± 0.19
0.069TrpPro: 0.069 ± 0.063
0.552TrpGln: 0.552 ± 0.149
0.414TrpArg: 0.414 ± 0.168
0.967TrpSer: 0.967 ± 0.297
0.621TrpThr: 0.621 ± 0.168
0.76TrpVal: 0.76 ± 0.236
0.138TrpTrp: 0.138 ± 0.112
0.621TrpTyr: 0.621 ± 0.231
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.279TyrAla: 2.279 ± 0.348
0.414TyrCys: 0.414 ± 0.17
2.9TyrAsp: 2.9 ± 0.534
3.038TyrGlu: 3.038 ± 0.648
1.243TyrPhe: 1.243 ± 0.322
2.693TyrGly: 2.693 ± 0.541
1.105TyrHis: 1.105 ± 0.313
3.038TyrIle: 3.038 ± 0.578
4.005TyrLys: 4.005 ± 0.558
4.074TyrLeu: 4.074 ± 0.689
1.381TyrMet: 1.381 ± 0.287
2.624TyrAsn: 2.624 ± 0.509
0.967TyrPro: 0.967 ± 0.248
2.141TyrGln: 2.141 ± 0.38
2.072TyrArg: 2.072 ± 0.446
3.038TyrSer: 3.038 ± 0.389
2.624TyrThr: 2.624 ± 0.416
2.831TyrVal: 2.831 ± 0.54
0.691TyrTrp: 0.691 ± 0.166
1.795TyrTyr: 1.795 ± 0.436
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (14483 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski