Amino acid dipepetide frequency for Staphylococcus phage B166

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.965AlaAla: 0.965 ± 0.304
0.297AlaCys: 0.297 ± 0.158
2.45AlaAsp: 2.45 ± 0.485
3.861AlaGlu: 3.861 ± 0.532
3.638AlaPhe: 3.638 ± 0.68
3.341AlaGly: 3.341 ± 0.616
0.668AlaHis: 0.668 ± 0.261
4.455AlaIle: 4.455 ± 0.686
5.865AlaLys: 5.865 ± 0.59
4.677AlaLeu: 4.677 ± 0.697
1.708AlaMet: 1.708 ± 0.374
3.786AlaAsn: 3.786 ± 0.45
1.559AlaPro: 1.559 ± 0.309
2.079AlaGln: 2.079 ± 0.433
2.45AlaArg: 2.45 ± 0.439
4.009AlaSer: 4.009 ± 0.59
3.267AlaThr: 3.267 ± 0.564
3.044AlaVal: 3.044 ± 0.684
0.965AlaTrp: 0.965 ± 0.364
2.153AlaTyr: 2.153 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.223CysAla: 0.223 ± 0.145
0.148CysCys: 0.148 ± 0.096
0.148CysAsp: 0.148 ± 0.108
0.074CysGlu: 0.074 ± 0.071
0.52CysPhe: 0.52 ± 0.239
0.223CysGly: 0.223 ± 0.117
0.0CysHis: 0.0 ± 0.0
0.148CysIle: 0.148 ± 0.101
0.371CysLys: 0.371 ± 0.157
0.074CysLeu: 0.074 ± 0.076
0.074CysMet: 0.074 ± 0.08
0.371CysAsn: 0.371 ± 0.187
0.148CysPro: 0.148 ± 0.088
0.297CysGln: 0.297 ± 0.123
0.371CysArg: 0.371 ± 0.153
0.445CysSer: 0.445 ± 0.197
0.223CysThr: 0.223 ± 0.141
0.074CysVal: 0.074 ± 0.079
0.074CysTrp: 0.074 ± 0.067
0.371CysTyr: 0.371 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
4.158AspAla: 4.158 ± 0.673
0.148AspCys: 0.148 ± 0.099
4.752AspAsp: 4.752 ± 0.902
4.677AspGlu: 4.677 ± 0.69
3.564AspPhe: 3.564 ± 0.652
4.306AspGly: 4.306 ± 0.602
0.594AspHis: 0.594 ± 0.194
4.752AspIle: 4.752 ± 0.463
6.608AspLys: 6.608 ± 0.915
4.826AspLeu: 4.826 ± 0.469
1.633AspMet: 1.633 ± 0.313
3.564AspAsn: 3.564 ± 0.617
1.708AspPro: 1.708 ± 0.278
0.965AspGln: 0.965 ± 0.236
2.524AspArg: 2.524 ± 0.415
4.752AspSer: 4.752 ± 0.588
3.638AspThr: 3.638 ± 0.452
4.009AspVal: 4.009 ± 0.597
0.742AspTrp: 0.742 ± 0.282
2.97AspTyr: 2.97 ± 0.427
0.0AspXaa: 0.0 ± 0.0
Glu
4.232GluAla: 4.232 ± 0.554
0.223GluCys: 0.223 ± 0.134
3.935GluAsp: 3.935 ± 0.686
5.494GluGlu: 5.494 ± 0.897
3.044GluPhe: 3.044 ± 0.474
3.193GluGly: 3.193 ± 0.429
1.782GluHis: 1.782 ± 0.369
5.643GluIle: 5.643 ± 0.838
6.237GluLys: 6.237 ± 0.867
7.424GluLeu: 7.424 ± 0.846
3.193GluMet: 3.193 ± 0.517
4.677GluAsn: 4.677 ± 0.622
1.782GluPro: 1.782 ± 0.352
3.786GluGln: 3.786 ± 0.682
3.118GluArg: 3.118 ± 0.491
4.306GluSer: 4.306 ± 0.603
2.896GluThr: 2.896 ± 0.406
5.568GluVal: 5.568 ± 0.668
1.039GluTrp: 1.039 ± 0.313
3.489GluTyr: 3.489 ± 0.481
0.0GluXaa: 0.0 ± 0.0
Phe
2.005PheAla: 2.005 ± 0.317
0.371PheCys: 0.371 ± 0.12
4.455PheAsp: 4.455 ± 0.449
3.786PheGlu: 3.786 ± 0.461
1.559PhePhe: 1.559 ± 0.382
2.599PheGly: 2.599 ± 0.658
0.742PheHis: 0.742 ± 0.242
3.193PheIle: 3.193 ± 0.432
5.123PheLys: 5.123 ± 0.472
2.896PheLeu: 2.896 ± 0.409
1.262PheMet: 1.262 ± 0.346
3.564PheAsn: 3.564 ± 0.511
1.039PhePro: 1.039 ± 0.264
0.742PheGln: 0.742 ± 0.23
1.336PheArg: 1.336 ± 0.292
2.45PheSer: 2.45 ± 0.476
2.747PheThr: 2.747 ± 0.443
2.227PheVal: 2.227 ± 0.392
0.371PheTrp: 0.371 ± 0.169
2.153PheTyr: 2.153 ± 0.398
0.0PheXaa: 0.0 ± 0.0
Gly
3.786GlyAla: 3.786 ± 0.573
0.297GlyCys: 0.297 ± 0.136
3.712GlyAsp: 3.712 ± 0.498
2.599GlyGlu: 2.599 ± 0.365
2.673GlyPhe: 2.673 ± 0.446
2.673GlyGly: 2.673 ± 0.513
1.485GlyHis: 1.485 ± 0.38
5.049GlyIle: 5.049 ± 0.572
4.826GlyLys: 4.826 ± 0.497
4.158GlyLeu: 4.158 ± 0.668
1.782GlyMet: 1.782 ± 0.371
4.009GlyAsn: 4.009 ± 0.572
0.371GlyPro: 0.371 ± 0.154
2.524GlyGln: 2.524 ± 0.312
2.821GlyArg: 2.821 ± 0.428
2.673GlySer: 2.673 ± 0.522
4.009GlyThr: 4.009 ± 0.525
4.9GlyVal: 4.9 ± 0.701
0.742GlyTrp: 0.742 ± 0.269
2.97GlyTyr: 2.97 ± 0.449
0.0GlyXaa: 0.0 ± 0.0
His
1.114HisAla: 1.114 ± 0.223
0.0HisCys: 0.0 ± 0.0
0.965HisAsp: 0.965 ± 0.224
1.336HisGlu: 1.336 ± 0.308
0.668HisPhe: 0.668 ± 0.192
1.188HisGly: 1.188 ± 0.248
0.297HisHis: 0.297 ± 0.145
1.188HisIle: 1.188 ± 0.296
1.188HisLys: 1.188 ± 0.267
1.708HisLeu: 1.708 ± 0.392
0.371HisMet: 0.371 ± 0.153
0.668HisAsn: 0.668 ± 0.23
0.965HisPro: 0.965 ± 0.334
1.188HisGln: 1.188 ± 0.362
0.742HisArg: 0.742 ± 0.255
1.188HisSer: 1.188 ± 0.32
0.891HisThr: 0.891 ± 0.251
1.114HisVal: 1.114 ± 0.243
0.223HisTrp: 0.223 ± 0.128
0.817HisTyr: 0.817 ± 0.344
0.0HisXaa: 0.0 ± 0.0
Ile
3.712IleAla: 3.712 ± 0.736
0.074IleCys: 0.074 ± 0.08
5.568IleAsp: 5.568 ± 0.633
7.796IleGlu: 7.796 ± 0.79
3.415IlePhe: 3.415 ± 0.631
5.494IleGly: 5.494 ± 0.752
1.188IleHis: 1.188 ± 0.338
3.193IleIle: 3.193 ± 0.473
7.87IleLys: 7.87 ± 0.845
4.529IleLeu: 4.529 ± 0.586
2.153IleMet: 2.153 ± 0.393
5.568IleAsn: 5.568 ± 0.901
2.376IlePro: 2.376 ± 0.321
3.193IleGln: 3.193 ± 0.46
3.489IleArg: 3.489 ± 0.608
3.935IleSer: 3.935 ± 0.592
4.083IleThr: 4.083 ± 0.609
4.158IleVal: 4.158 ± 0.575
0.891IleTrp: 0.891 ± 0.29
2.599IleTyr: 2.599 ± 0.562
0.0IleXaa: 0.0 ± 0.0
Lys
5.346LysAla: 5.346 ± 0.672
0.445LysCys: 0.445 ± 0.186
5.94LysAsp: 5.94 ± 0.655
7.796LysGlu: 7.796 ± 0.693
3.935LysPhe: 3.935 ± 0.422
5.197LysGly: 5.197 ± 0.654
2.005LysHis: 2.005 ± 0.46
6.385LysIle: 6.385 ± 0.809
8.018LysLys: 8.018 ± 0.871
6.014LysLeu: 6.014 ± 0.719
3.044LysMet: 3.044 ± 0.492
5.197LysAsn: 5.197 ± 0.581
3.044LysPro: 3.044 ± 0.499
4.232LysGln: 4.232 ± 0.659
4.232LysArg: 4.232 ± 0.534
5.568LysSer: 5.568 ± 0.64
5.865LysThr: 5.865 ± 0.647
5.271LysVal: 5.271 ± 0.607
1.039LysTrp: 1.039 ± 0.285
3.415LysTyr: 3.415 ± 0.529
0.0LysXaa: 0.0 ± 0.0
Leu
3.786LeuAla: 3.786 ± 0.761
0.297LeuCys: 0.297 ± 0.163
6.311LeuAsp: 6.311 ± 0.597
5.643LeuGlu: 5.643 ± 0.756
3.489LeuPhe: 3.489 ± 0.428
3.341LeuGly: 3.341 ± 0.641
1.114LeuHis: 1.114 ± 0.269
5.494LeuIle: 5.494 ± 0.599
7.499LeuLys: 7.499 ± 0.581
5.271LeuLeu: 5.271 ± 0.704
2.079LeuMet: 2.079 ± 0.449
4.9LeuAsn: 4.9 ± 0.454
2.747LeuPro: 2.747 ± 0.317
3.044LeuGln: 3.044 ± 0.49
3.341LeuArg: 3.341 ± 0.538
4.455LeuSer: 4.455 ± 0.578
4.677LeuThr: 4.677 ± 0.612
3.712LeuVal: 3.712 ± 0.469
0.297LeuTrp: 0.297 ± 0.219
4.009LeuTyr: 4.009 ± 0.626
0.0LeuXaa: 0.0 ± 0.0
Met
2.302MetAla: 2.302 ± 0.523
0.074MetCys: 0.074 ± 0.086
0.965MetAsp: 0.965 ± 0.253
1.559MetGlu: 1.559 ± 0.405
0.891MetPhe: 0.891 ± 0.233
1.262MetGly: 1.262 ± 0.272
0.371MetHis: 0.371 ± 0.151
1.708MetIle: 1.708 ± 0.326
1.856MetLys: 1.856 ± 0.345
2.45MetLeu: 2.45 ± 0.43
0.817MetMet: 0.817 ± 0.259
1.93MetAsn: 1.93 ± 0.344
1.039MetPro: 1.039 ± 0.269
1.262MetGln: 1.262 ± 0.391
1.485MetArg: 1.485 ± 0.36
1.559MetSer: 1.559 ± 0.464
3.044MetThr: 3.044 ± 0.436
0.891MetVal: 0.891 ± 0.203
0.594MetTrp: 0.594 ± 0.198
0.742MetTyr: 0.742 ± 0.232
0.0MetXaa: 0.0 ± 0.0
Asn
3.861AsnAla: 3.861 ± 0.481
0.297AsnCys: 0.297 ± 0.135
4.826AsnAsp: 4.826 ± 0.63
5.94AsnGlu: 5.94 ± 0.702
2.97AsnPhe: 2.97 ± 0.497
4.529AsnGly: 4.529 ± 0.715
1.114AsnHis: 1.114 ± 0.266
4.232AsnIle: 4.232 ± 0.521
5.568AsnLys: 5.568 ± 0.721
3.712AsnLeu: 3.712 ± 0.531
1.188AsnMet: 1.188 ± 0.336
5.42AsnAsn: 5.42 ± 0.929
2.747AsnPro: 2.747 ± 0.486
2.376AsnGln: 2.376 ± 0.45
2.524AsnArg: 2.524 ± 0.387
3.786AsnSer: 3.786 ± 0.45
3.564AsnThr: 3.564 ± 0.445
4.455AsnVal: 4.455 ± 0.53
0.668AsnTrp: 0.668 ± 0.206
2.747AsnTyr: 2.747 ± 0.408
0.0AsnXaa: 0.0 ± 0.0
Pro
1.336ProAla: 1.336 ± 0.288
0.0ProCys: 0.0 ± 0.0
1.633ProAsp: 1.633 ± 0.318
2.227ProGlu: 2.227 ± 0.384
1.411ProPhe: 1.411 ± 0.317
1.782ProGly: 1.782 ± 0.44
0.445ProHis: 0.445 ± 0.159
2.747ProIle: 2.747 ± 0.47
3.044ProLys: 3.044 ± 0.461
1.782ProLeu: 1.782 ± 0.422
0.52ProMet: 0.52 ± 0.205
2.153ProAsn: 2.153 ± 0.461
0.371ProPro: 0.371 ± 0.154
0.668ProGln: 0.668 ± 0.222
1.039ProArg: 1.039 ± 0.298
1.782ProSer: 1.782 ± 0.333
2.005ProThr: 2.005 ± 0.328
1.856ProVal: 1.856 ± 0.344
0.074ProTrp: 0.074 ± 0.075
1.708ProTyr: 1.708 ± 0.36
0.0ProXaa: 0.0 ± 0.0
Gln
3.267GlnAla: 3.267 ± 0.561
0.445GlnCys: 0.445 ± 0.175
1.93GlnAsp: 1.93 ± 0.409
2.599GlnGlu: 2.599 ± 0.477
1.559GlnPhe: 1.559 ± 0.377
2.302GlnGly: 2.302 ± 0.422
0.668GlnHis: 0.668 ± 0.271
2.97GlnIle: 2.97 ± 0.321
3.118GlnLys: 3.118 ± 0.462
2.599GlnLeu: 2.599 ± 0.439
1.114GlnMet: 1.114 ± 0.329
2.821GlnAsn: 2.821 ± 0.46
0.817GlnPro: 0.817 ± 0.242
1.93GlnGln: 1.93 ± 0.489
1.856GlnArg: 1.856 ± 0.421
2.524GlnSer: 2.524 ± 0.356
2.45GlnThr: 2.45 ± 0.42
2.005GlnVal: 2.005 ± 0.297
0.297GlnTrp: 0.297 ± 0.147
1.336GlnTyr: 1.336 ± 0.318
0.0GlnXaa: 0.0 ± 0.0
Arg
1.708ArgAla: 1.708 ± 0.338
0.297ArgCys: 0.297 ± 0.138
2.376ArgAsp: 2.376 ± 0.42
2.97ArgGlu: 2.97 ± 0.441
2.302ArgPhe: 2.302 ± 0.43
2.153ArgGly: 2.153 ± 0.348
1.039ArgHis: 1.039 ± 0.278
4.009ArgIle: 4.009 ± 0.48
4.158ArgLys: 4.158 ± 0.564
4.009ArgLeu: 4.009 ± 0.551
0.594ArgMet: 0.594 ± 0.21
3.044ArgAsn: 3.044 ± 0.49
1.188ArgPro: 1.188 ± 0.226
1.411ArgGln: 1.411 ± 0.331
1.782ArgArg: 1.782 ± 0.347
1.93ArgSer: 1.93 ± 0.388
2.153ArgThr: 2.153 ± 0.42
2.005ArgVal: 2.005 ± 0.338
0.371ArgTrp: 0.371 ± 0.181
2.599ArgTyr: 2.599 ± 0.536
0.0ArgXaa: 0.0 ± 0.0
Ser
4.158SerAla: 4.158 ± 0.636
0.223SerCys: 0.223 ± 0.119
4.529SerAsp: 4.529 ± 0.629
3.935SerGlu: 3.935 ± 0.507
2.599SerPhe: 2.599 ± 0.379
4.232SerGly: 4.232 ± 0.596
1.262SerHis: 1.262 ± 0.302
5.494SerIle: 5.494 ± 0.688
6.237SerLys: 6.237 ± 0.809
4.306SerLeu: 4.306 ± 0.496
1.782SerMet: 1.782 ± 0.298
3.638SerAsn: 3.638 ± 0.509
1.114SerPro: 1.114 ± 0.296
2.599SerGln: 2.599 ± 0.465
1.856SerArg: 1.856 ± 0.318
3.044SerSer: 3.044 ± 0.514
3.415SerThr: 3.415 ± 0.426
3.489SerVal: 3.489 ± 0.637
0.668SerTrp: 0.668 ± 0.223
2.302SerTyr: 2.302 ± 0.512
0.0SerXaa: 0.0 ± 0.0
Thr
3.267ThrAla: 3.267 ± 0.441
0.074ThrCys: 0.074 ± 0.06
3.489ThrAsp: 3.489 ± 0.448
3.861ThrGlu: 3.861 ± 0.578
2.302ThrPhe: 2.302 ± 0.465
3.712ThrGly: 3.712 ± 0.489
1.411ThrHis: 1.411 ± 0.285
5.791ThrIle: 5.791 ± 0.662
4.529ThrLys: 4.529 ± 0.581
4.826ThrLeu: 4.826 ± 0.68
1.188ThrMet: 1.188 ± 0.325
4.083ThrAsn: 4.083 ± 0.502
1.782ThrPro: 1.782 ± 0.396
2.97ThrGln: 2.97 ± 0.519
2.821ThrArg: 2.821 ± 0.529
4.306ThrSer: 4.306 ± 0.926
2.97ThrThr: 2.97 ± 0.49
3.341ThrVal: 3.341 ± 0.539
1.114ThrTrp: 1.114 ± 0.284
1.856ThrTyr: 1.856 ± 0.338
0.0ThrXaa: 0.0 ± 0.0
Val
3.044ValAla: 3.044 ± 0.741
0.371ValCys: 0.371 ± 0.147
4.529ValAsp: 4.529 ± 0.803
4.38ValGlu: 4.38 ± 0.57
1.856ValPhe: 1.856 ± 0.32
3.861ValGly: 3.861 ± 0.605
0.52ValHis: 0.52 ± 0.164
4.826ValIle: 4.826 ± 0.623
5.494ValLys: 5.494 ± 0.614
5.197ValLeu: 5.197 ± 0.789
1.336ValMet: 1.336 ± 0.294
3.489ValAsn: 3.489 ± 0.579
2.524ValPro: 2.524 ± 0.486
1.633ValGln: 1.633 ± 0.388
2.227ValArg: 2.227 ± 0.362
4.158ValSer: 4.158 ± 0.671
4.009ValThr: 4.009 ± 0.457
3.712ValVal: 3.712 ± 0.578
0.742ValTrp: 0.742 ± 0.199
2.079ValTyr: 2.079 ± 0.457
0.0ValXaa: 0.0 ± 0.0
Trp
0.594TrpAla: 0.594 ± 0.21
0.074TrpCys: 0.074 ± 0.067
0.52TrpAsp: 0.52 ± 0.211
0.891TrpGlu: 0.891 ± 0.271
0.371TrpPhe: 0.371 ± 0.155
0.52TrpGly: 0.52 ± 0.23
0.297TrpHis: 0.297 ± 0.136
0.817TrpIle: 0.817 ± 0.283
0.891TrpLys: 0.891 ± 0.239
1.336TrpLeu: 1.336 ± 0.326
0.223TrpMet: 0.223 ± 0.136
0.817TrpAsn: 0.817 ± 0.263
0.074TrpPro: 0.074 ± 0.07
0.371TrpGln: 0.371 ± 0.169
0.445TrpArg: 0.445 ± 0.189
1.039TrpSer: 1.039 ± 0.304
1.039TrpThr: 1.039 ± 0.229
0.965TrpVal: 0.965 ± 0.27
0.148TrpTrp: 0.148 ± 0.104
0.445TrpTyr: 0.445 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.376TyrAla: 2.376 ± 0.354
0.223TyrCys: 0.223 ± 0.139
1.856TyrAsp: 1.856 ± 0.457
3.638TyrGlu: 3.638 ± 0.599
1.856TyrPhe: 1.856 ± 0.383
2.227TyrGly: 2.227 ± 0.454
0.817TyrHis: 0.817 ± 0.249
3.193TyrIle: 3.193 ± 0.525
3.489TyrLys: 3.489 ± 0.552
3.712TyrLeu: 3.712 ± 0.546
0.668TyrMet: 0.668 ± 0.259
2.821TyrAsn: 2.821 ± 0.532
1.188TyrPro: 1.188 ± 0.363
1.336TyrGln: 1.336 ± 0.304
1.633TyrArg: 1.633 ± 0.461
2.97TyrSer: 2.97 ± 0.398
2.747TyrThr: 2.747 ± 0.438
3.193TyrVal: 3.193 ± 0.577
0.817TyrTrp: 0.817 ± 0.376
1.336TyrTyr: 1.336 ± 0.307
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (13470 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski