Amino acid dipepetide frequency for Staphylococcus phage StauST398-4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.006AlaAla: 2.006 ± 0.676
0.321AlaCys: 0.321 ± 0.146
2.648AlaAsp: 2.648 ± 0.377
4.413AlaGlu: 4.413 ± 0.523
2.889AlaPhe: 2.889 ± 0.65
4.092AlaGly: 4.092 ± 0.686
0.642AlaHis: 0.642 ± 0.235
4.413AlaIle: 4.413 ± 0.662
5.376AlaLys: 5.376 ± 0.622
5.136AlaLeu: 5.136 ± 0.64
1.525AlaMet: 1.525 ± 0.417
3.852AlaAsn: 3.852 ± 0.599
1.765AlaPro: 1.765 ± 0.286
2.568AlaGln: 2.568 ± 0.605
2.488AlaArg: 2.488 ± 0.419
3.21AlaSer: 3.21 ± 0.542
2.889AlaThr: 2.889 ± 0.585
3.13AlaVal: 3.13 ± 0.589
0.642AlaTrp: 0.642 ± 0.246
2.488AlaTyr: 2.488 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.321CysAla: 0.321 ± 0.171
0.08CysCys: 0.08 ± 0.076
0.08CysAsp: 0.08 ± 0.086
0.16CysGlu: 0.16 ± 0.117
0.16CysPhe: 0.16 ± 0.105
0.401CysGly: 0.401 ± 0.201
0.16CysHis: 0.16 ± 0.125
0.562CysIle: 0.562 ± 0.174
0.241CysLys: 0.241 ± 0.164
0.481CysLeu: 0.481 ± 0.259
0.0CysMet: 0.0 ± 0.0
0.241CysAsn: 0.241 ± 0.132
0.16CysPro: 0.16 ± 0.117
0.16CysGln: 0.16 ± 0.137
0.16CysArg: 0.16 ± 0.155
0.401CysSer: 0.401 ± 0.194
0.321CysThr: 0.321 ± 0.139
0.481CysVal: 0.481 ± 0.208
0.08CysTrp: 0.08 ± 0.089
0.481CysTyr: 0.481 ± 0.19
0.0CysXaa: 0.0 ± 0.0
Asp
3.13AspAla: 3.13 ± 0.422
0.08AspCys: 0.08 ± 0.082
3.852AspAsp: 3.852 ± 0.749
5.697AspGlu: 5.697 ± 0.802
3.21AspPhe: 3.21 ± 0.494
5.055AspGly: 5.055 ± 0.71
0.883AspHis: 0.883 ± 0.209
4.975AspIle: 4.975 ± 0.548
5.136AspLys: 5.136 ± 0.687
4.895AspLeu: 4.895 ± 0.577
1.765AspMet: 1.765 ± 0.29
4.012AspAsn: 4.012 ± 0.741
1.605AspPro: 1.605 ± 0.35
0.562AspGln: 0.562 ± 0.174
2.327AspArg: 2.327 ± 0.514
3.21AspSer: 3.21 ± 0.527
3.21AspThr: 3.21 ± 0.512
3.771AspVal: 3.771 ± 0.587
0.562AspTrp: 0.562 ± 0.158
3.049AspTyr: 3.049 ± 0.577
0.0AspXaa: 0.0 ± 0.0
Glu
4.494GluAla: 4.494 ± 0.658
0.321GluCys: 0.321 ± 0.167
3.771GluAsp: 3.771 ± 0.592
6.179GluGlu: 6.179 ± 1.017
2.728GluPhe: 2.728 ± 0.453
2.889GluGly: 2.889 ± 0.456
1.364GluHis: 1.364 ± 0.314
5.938GluIle: 5.938 ± 0.843
7.222GluLys: 7.222 ± 0.894
7.784GluLeu: 7.784 ± 0.909
2.327GluMet: 2.327 ± 0.474
6.099GluAsn: 6.099 ± 0.71
1.364GluPro: 1.364 ± 0.255
3.611GluGln: 3.611 ± 0.778
4.333GluArg: 4.333 ± 0.666
4.413GluSer: 4.413 ± 0.517
3.932GluThr: 3.932 ± 0.718
4.574GluVal: 4.574 ± 0.53
0.642GluTrp: 0.642 ± 0.225
3.45GluTyr: 3.45 ± 0.562
0.0GluXaa: 0.0 ± 0.0
Phe
2.247PheAla: 2.247 ± 0.552
0.241PheCys: 0.241 ± 0.171
2.568PheAsp: 2.568 ± 0.462
3.531PheGlu: 3.531 ± 0.453
1.204PhePhe: 1.204 ± 0.288
2.247PheGly: 2.247 ± 0.421
0.481PheHis: 0.481 ± 0.19
3.691PheIle: 3.691 ± 0.638
4.494PheLys: 4.494 ± 0.641
2.809PheLeu: 2.809 ± 0.489
1.204PheMet: 1.204 ± 0.326
3.852PheAsn: 3.852 ± 0.539
0.963PhePro: 0.963 ± 0.315
0.642PheGln: 0.642 ± 0.222
1.284PheArg: 1.284 ± 0.291
2.969PheSer: 2.969 ± 0.686
3.13PheThr: 3.13 ± 0.532
2.167PheVal: 2.167 ± 0.399
0.08PheTrp: 0.08 ± 0.07
1.525PheTyr: 1.525 ± 0.405
0.0PheXaa: 0.0 ± 0.0
Gly
2.969GlyAla: 2.969 ± 0.621
0.481GlyCys: 0.481 ± 0.206
3.29GlyAsp: 3.29 ± 0.567
3.45GlyGlu: 3.45 ± 0.691
2.648GlyPhe: 2.648 ± 0.643
4.253GlyGly: 4.253 ± 1.085
1.284GlyHis: 1.284 ± 0.341
4.734GlyIle: 4.734 ± 0.915
6.259GlyLys: 6.259 ± 0.767
5.697GlyLeu: 5.697 ± 0.682
1.284GlyMet: 1.284 ± 0.296
3.37GlyAsn: 3.37 ± 0.682
1.284GlyPro: 1.284 ± 0.356
1.765GlyGln: 1.765 ± 0.446
2.889GlyArg: 2.889 ± 0.607
2.167GlySer: 2.167 ± 0.39
2.568GlyThr: 2.568 ± 0.513
3.531GlyVal: 3.531 ± 0.701
0.883GlyTrp: 0.883 ± 0.283
2.728GlyTyr: 2.728 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
1.204HisAla: 1.204 ± 0.325
0.16HisCys: 0.16 ± 0.106
1.123HisAsp: 1.123 ± 0.308
1.043HisGlu: 1.043 ± 0.305
1.444HisPhe: 1.444 ± 0.331
1.123HisGly: 1.123 ± 0.341
0.321HisHis: 0.321 ± 0.209
1.685HisIle: 1.685 ± 0.412
1.204HisLys: 1.204 ± 0.274
1.846HisLeu: 1.846 ± 0.396
0.401HisMet: 0.401 ± 0.196
0.963HisAsn: 0.963 ± 0.291
0.642HisPro: 0.642 ± 0.216
0.481HisGln: 0.481 ± 0.162
0.401HisArg: 0.401 ± 0.198
0.963HisSer: 0.963 ± 0.203
0.802HisThr: 0.802 ± 0.251
1.043HisVal: 1.043 ± 0.315
0.241HisTrp: 0.241 ± 0.197
1.204HisTyr: 1.204 ± 0.31
0.0HisXaa: 0.0 ± 0.0
Ile
5.617IleAla: 5.617 ± 0.691
0.241IleCys: 0.241 ± 0.127
5.136IleAsp: 5.136 ± 0.672
6.74IleGlu: 6.74 ± 0.655
3.29IlePhe: 3.29 ± 0.612
3.37IleGly: 3.37 ± 0.616
1.926IleHis: 1.926 ± 0.515
4.654IleIle: 4.654 ± 0.53
8.666IleLys: 8.666 ± 0.653
4.333IleLeu: 4.333 ± 0.626
1.525IleMet: 1.525 ± 0.267
6.821IleAsn: 6.821 ± 1.285
1.765IlePro: 1.765 ± 0.286
3.37IleGln: 3.37 ± 0.513
2.889IleArg: 2.889 ± 0.498
4.734IleSer: 4.734 ± 0.671
4.092IleThr: 4.092 ± 0.546
5.537IleVal: 5.537 ± 0.569
0.802IleTrp: 0.802 ± 0.46
2.648IleTyr: 2.648 ± 0.551
0.0IleXaa: 0.0 ± 0.0
Lys
5.136LysAla: 5.136 ± 0.652
0.321LysCys: 0.321 ± 0.205
6.179LysAsp: 6.179 ± 0.513
8.987LysGlu: 8.987 ± 1.141
3.45LysPhe: 3.45 ± 0.434
5.697LysGly: 5.697 ± 0.828
1.605LysHis: 1.605 ± 0.442
6.259LysIle: 6.259 ± 0.874
8.747LysLys: 8.747 ± 0.859
7.623LysLeu: 7.623 ± 0.613
2.488LysMet: 2.488 ± 0.412
6.42LysAsn: 6.42 ± 0.848
2.648LysPro: 2.648 ± 0.515
4.253LysGln: 4.253 ± 0.621
5.055LysArg: 5.055 ± 0.645
5.296LysSer: 5.296 ± 0.722
5.858LysThr: 5.858 ± 0.825
5.376LysVal: 5.376 ± 0.58
1.123LysTrp: 1.123 ± 0.399
4.413LysTyr: 4.413 ± 0.674
0.0LysXaa: 0.0 ± 0.0
Leu
4.173LeuAla: 4.173 ± 0.627
0.562LeuCys: 0.562 ± 0.246
5.136LeuAsp: 5.136 ± 0.562
6.5LeuGlu: 6.5 ± 0.903
3.691LeuPhe: 3.691 ± 0.519
3.932LeuGly: 3.932 ± 0.654
1.364LeuHis: 1.364 ± 0.285
6.018LeuIle: 6.018 ± 0.798
8.345LeuLys: 8.345 ± 0.851
6.901LeuLeu: 6.901 ± 0.917
2.327LeuMet: 2.327 ± 0.573
5.858LeuAsn: 5.858 ± 0.729
2.247LeuPro: 2.247 ± 0.413
3.049LeuGln: 3.049 ± 0.464
3.771LeuArg: 3.771 ± 0.633
5.376LeuSer: 5.376 ± 0.729
4.092LeuThr: 4.092 ± 0.547
3.852LeuVal: 3.852 ± 0.608
0.562LeuTrp: 0.562 ± 0.236
2.809LeuTyr: 2.809 ± 0.516
0.0LeuXaa: 0.0 ± 0.0
Met
1.284MetAla: 1.284 ± 0.375
0.08MetCys: 0.08 ± 0.081
1.444MetAsp: 1.444 ± 0.308
1.605MetGlu: 1.605 ± 0.286
0.963MetPhe: 0.963 ± 0.242
1.284MetGly: 1.284 ± 0.556
0.16MetHis: 0.16 ± 0.109
2.006MetIle: 2.006 ± 0.359
2.167MetLys: 2.167 ± 0.444
2.247MetLeu: 2.247 ± 0.318
0.963MetMet: 0.963 ± 0.358
1.525MetAsn: 1.525 ± 0.353
1.043MetPro: 1.043 ± 0.306
1.284MetGln: 1.284 ± 0.305
0.883MetArg: 0.883 ± 0.29
1.605MetSer: 1.605 ± 0.303
1.765MetThr: 1.765 ± 0.524
1.204MetVal: 1.204 ± 0.3
0.241MetTrp: 0.241 ± 0.126
0.963MetTyr: 0.963 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
3.932AsnAla: 3.932 ± 0.55
0.16AsnCys: 0.16 ± 0.12
5.296AsnAsp: 5.296 ± 0.752
4.975AsnGlu: 4.975 ± 0.868
1.926AsnPhe: 1.926 ± 0.488
5.376AsnGly: 5.376 ± 0.515
0.963AsnHis: 0.963 ± 0.336
4.975AsnIle: 4.975 ± 0.635
6.901AsnLys: 6.901 ± 0.769
4.815AsnLeu: 4.815 ± 0.563
1.846AsnMet: 1.846 ± 0.388
4.574AsnAsn: 4.574 ± 0.674
2.568AsnPro: 2.568 ± 0.36
3.45AsnGln: 3.45 ± 0.675
2.648AsnArg: 2.648 ± 0.479
4.253AsnSer: 4.253 ± 0.525
3.611AsnThr: 3.611 ± 0.429
4.494AsnVal: 4.494 ± 0.649
0.883AsnTrp: 0.883 ± 0.45
3.37AsnTyr: 3.37 ± 0.462
0.0AsnXaa: 0.0 ± 0.0
Pro
0.963ProAla: 0.963 ± 0.302
0.08ProCys: 0.08 ± 0.08
1.364ProAsp: 1.364 ± 0.271
2.648ProGlu: 2.648 ± 0.424
1.605ProPhe: 1.605 ± 0.351
0.883ProGly: 0.883 ± 0.25
0.642ProHis: 0.642 ± 0.216
2.568ProIle: 2.568 ± 0.393
2.969ProLys: 2.969 ± 0.645
1.444ProLeu: 1.444 ± 0.325
0.642ProMet: 0.642 ± 0.194
1.204ProAsn: 1.204 ± 0.27
0.963ProPro: 0.963 ± 0.215
0.883ProGln: 0.883 ± 0.402
1.284ProArg: 1.284 ± 0.323
1.525ProSer: 1.525 ± 0.31
1.525ProThr: 1.525 ± 0.314
1.926ProVal: 1.926 ± 0.286
0.08ProTrp: 0.08 ± 0.081
0.963ProTyr: 0.963 ± 0.246
0.0ProXaa: 0.0 ± 0.0
Gln
3.13GlnAla: 3.13 ± 0.387
0.401GlnCys: 0.401 ± 0.184
2.086GlnAsp: 2.086 ± 0.504
3.13GlnGlu: 3.13 ± 0.557
1.123GlnPhe: 1.123 ± 0.256
1.846GlnGly: 1.846 ± 0.385
0.722GlnHis: 0.722 ± 0.242
2.728GlnIle: 2.728 ± 0.415
3.37GlnLys: 3.37 ± 0.523
3.13GlnLeu: 3.13 ± 0.473
0.963GlnMet: 0.963 ± 0.256
3.21GlnAsn: 3.21 ± 0.516
0.963GlnPro: 0.963 ± 0.281
2.809GlnGln: 2.809 ± 0.796
2.006GlnArg: 2.006 ± 0.407
2.407GlnSer: 2.407 ± 0.47
1.846GlnThr: 1.846 ± 0.387
2.006GlnVal: 2.006 ± 0.363
0.321GlnTrp: 0.321 ± 0.148
1.444GlnTyr: 1.444 ± 0.314
0.0GlnXaa: 0.0 ± 0.0
Arg
2.407ArgAla: 2.407 ± 0.368
0.16ArgCys: 0.16 ± 0.123
2.488ArgAsp: 2.488 ± 0.506
3.531ArgGlu: 3.531 ± 0.582
1.926ArgPhe: 1.926 ± 0.377
2.006ArgGly: 2.006 ± 0.52
0.722ArgHis: 0.722 ± 0.24
3.531ArgIle: 3.531 ± 0.561
3.932ArgLys: 3.932 ± 0.641
3.932ArgLeu: 3.932 ± 0.558
0.802ArgMet: 0.802 ± 0.241
3.13ArgAsn: 3.13 ± 0.51
0.883ArgPro: 0.883 ± 0.398
1.685ArgGln: 1.685 ± 0.414
1.765ArgArg: 1.765 ± 0.375
2.407ArgSer: 2.407 ± 0.52
1.846ArgThr: 1.846 ± 0.417
2.086ArgVal: 2.086 ± 0.411
0.642ArgTrp: 0.642 ± 0.22
2.488ArgTyr: 2.488 ± 0.56
0.0ArgXaa: 0.0 ± 0.0
Ser
4.253SerAla: 4.253 ± 0.543
0.481SerCys: 0.481 ± 0.229
4.173SerAsp: 4.173 ± 0.913
4.654SerGlu: 4.654 ± 0.654
2.488SerPhe: 2.488 ± 0.436
3.13SerGly: 3.13 ± 0.751
1.685SerHis: 1.685 ± 0.371
5.216SerIle: 5.216 ± 0.697
5.617SerLys: 5.617 ± 0.592
4.092SerLeu: 4.092 ± 0.671
1.364SerMet: 1.364 ± 0.301
4.494SerAsn: 4.494 ± 0.435
0.963SerPro: 0.963 ± 0.264
2.407SerGln: 2.407 ± 0.427
1.605SerArg: 1.605 ± 0.449
3.29SerSer: 3.29 ± 0.515
3.852SerThr: 3.852 ± 0.538
2.889SerVal: 2.889 ± 0.423
0.16SerTrp: 0.16 ± 0.096
1.765SerTyr: 1.765 ± 0.39
0.0SerXaa: 0.0 ± 0.0
Thr
3.45ThrAla: 3.45 ± 0.473
0.16ThrCys: 0.16 ± 0.104
3.45ThrAsp: 3.45 ± 0.567
3.37ThrGlu: 3.37 ± 0.516
2.167ThrPhe: 2.167 ± 0.347
3.21ThrGly: 3.21 ± 0.9
1.525ThrHis: 1.525 ± 0.344
4.173ThrIle: 4.173 ± 0.517
4.975ThrLys: 4.975 ± 0.638
3.932ThrLeu: 3.932 ± 0.471
0.802ThrMet: 0.802 ± 0.2
3.771ThrAsn: 3.771 ± 0.599
2.327ThrPro: 2.327 ± 0.375
1.765ThrGln: 1.765 ± 0.371
2.407ThrArg: 2.407 ± 0.471
3.531ThrSer: 3.531 ± 0.569
3.21ThrThr: 3.21 ± 0.587
3.531ThrVal: 3.531 ± 0.498
0.722ThrTrp: 0.722 ± 0.238
2.568ThrTyr: 2.568 ± 0.555
0.0ThrXaa: 0.0 ± 0.0
Val
3.45ValAla: 3.45 ± 0.593
0.241ValCys: 0.241 ± 0.142
3.37ValAsp: 3.37 ± 0.515
2.889ValGlu: 2.889 ± 0.49
1.926ValPhe: 1.926 ± 0.372
3.691ValGly: 3.691 ± 0.566
0.963ValHis: 0.963 ± 0.289
5.055ValIle: 5.055 ± 0.554
6.259ValLys: 6.259 ± 0.709
5.216ValLeu: 5.216 ± 0.677
1.284ValMet: 1.284 ± 0.367
4.173ValAsn: 4.173 ± 0.526
1.123ValPro: 1.123 ± 0.272
2.006ValGln: 2.006 ± 0.495
2.006ValArg: 2.006 ± 0.381
3.771ValSer: 3.771 ± 0.504
3.771ValThr: 3.771 ± 0.502
3.29ValVal: 3.29 ± 0.596
0.642ValTrp: 0.642 ± 0.245
2.327ValTyr: 2.327 ± 0.311
0.0ValXaa: 0.0 ± 0.0
Trp
0.241TrpAla: 0.241 ± 0.134
0.0TrpCys: 0.0 ± 0.0
0.481TrpAsp: 0.481 ± 0.214
0.802TrpGlu: 0.802 ± 0.239
0.642TrpPhe: 0.642 ± 0.204
0.642TrpGly: 0.642 ± 0.275
0.0TrpHis: 0.0 ± 0.0
1.043TrpIle: 1.043 ± 0.238
0.642TrpLys: 0.642 ± 0.188
0.883TrpLeu: 0.883 ± 0.265
0.401TrpMet: 0.401 ± 0.177
0.802TrpAsn: 0.802 ± 0.315
0.08TrpPro: 0.08 ± 0.068
0.562TrpGln: 0.562 ± 0.187
0.401TrpArg: 0.401 ± 0.158
0.642TrpSer: 0.642 ± 0.22
0.562TrpThr: 0.562 ± 0.175
0.481TrpVal: 0.481 ± 0.15
0.08TrpTrp: 0.08 ± 0.07
0.401TrpTyr: 0.401 ± 0.129
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.086TyrAla: 2.086 ± 0.331
0.562TyrCys: 0.562 ± 0.228
3.049TyrAsp: 3.049 ± 0.644
3.13TyrGlu: 3.13 ± 0.578
1.846TyrPhe: 1.846 ± 0.474
2.407TyrGly: 2.407 ± 0.542
0.883TyrHis: 0.883 ± 0.264
3.932TyrIle: 3.932 ± 0.63
4.413TyrLys: 4.413 ± 0.61
3.37TyrLeu: 3.37 ± 0.513
0.722TyrMet: 0.722 ± 0.243
2.648TyrAsn: 2.648 ± 0.353
0.802TyrPro: 0.802 ± 0.227
2.327TyrGln: 2.327 ± 0.429
1.765TyrArg: 1.765 ± 0.439
2.407TyrSer: 2.407 ± 0.528
2.167TyrThr: 2.167 ± 0.351
2.167TyrVal: 2.167 ± 0.48
0.401TyrTrp: 0.401 ± 0.145
1.284TyrTyr: 1.284 ± 0.478
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (12463 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski