Amino acid dipepetide frequency for Staphylococcus phage 2638A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.519AlaAla: 3.519 ± 1.192
0.235AlaCys: 0.235 ± 0.142
3.754AlaAsp: 3.754 ± 0.644
3.989AlaGlu: 3.989 ± 0.498
2.816AlaPhe: 2.816 ± 0.426
4.302AlaGly: 4.302 ± 0.769
1.173AlaHis: 1.173 ± 0.323
4.458AlaIle: 4.458 ± 0.477
5.24AlaLys: 5.24 ± 0.79
6.726AlaLeu: 6.726 ± 0.76
1.642AlaMet: 1.642 ± 0.318
3.363AlaAsn: 3.363 ± 0.52
1.486AlaPro: 1.486 ± 0.281
2.737AlaGln: 2.737 ± 0.646
3.128AlaArg: 3.128 ± 0.38
4.614AlaSer: 4.614 ± 0.676
3.676AlaThr: 3.676 ± 0.543
3.519AlaVal: 3.519 ± 0.549
0.626AlaTrp: 0.626 ± 0.205
3.519AlaTyr: 3.519 ± 0.581
0.0AlaXaa: 0.0 ± 0.0
Cys
0.156CysAla: 0.156 ± 0.102
0.0CysCys: 0.0 ± 0.0
0.078CysAsp: 0.078 ± 0.065
0.156CysGlu: 0.156 ± 0.11
0.235CysPhe: 0.235 ± 0.133
0.391CysGly: 0.391 ± 0.214
0.156CysHis: 0.156 ± 0.114
0.469CysIle: 0.469 ± 0.199
0.547CysLys: 0.547 ± 0.251
0.391CysLeu: 0.391 ± 0.165
0.156CysMet: 0.156 ± 0.106
0.235CysAsn: 0.235 ± 0.126
0.156CysPro: 0.156 ± 0.115
0.078CysGln: 0.078 ± 0.067
0.391CysArg: 0.391 ± 0.183
0.156CysSer: 0.156 ± 0.11
0.235CysThr: 0.235 ± 0.121
0.391CysVal: 0.391 ± 0.14
0.078CysTrp: 0.078 ± 0.092
0.078CysTyr: 0.078 ± 0.075
0.0CysXaa: 0.0 ± 0.0
Asp
3.754AspAla: 3.754 ± 0.476
0.078AspCys: 0.078 ± 0.079
4.38AspAsp: 4.38 ± 0.988
5.709AspGlu: 5.709 ± 0.66
3.128AspPhe: 3.128 ± 0.605
4.067AspGly: 4.067 ± 0.717
0.626AspHis: 0.626 ± 0.272
3.519AspIle: 3.519 ± 0.652
7.117AspLys: 7.117 ± 0.749
5.318AspLeu: 5.318 ± 0.783
2.346AspMet: 2.346 ± 0.505
3.05AspAsn: 3.05 ± 0.561
1.721AspPro: 1.721 ± 0.373
1.017AspGln: 1.017 ± 0.258
2.112AspArg: 2.112 ± 0.345
2.972AspSer: 2.972 ± 0.568
4.145AspThr: 4.145 ± 0.65
4.536AspVal: 4.536 ± 0.741
0.782AspTrp: 0.782 ± 0.197
3.128AspTyr: 3.128 ± 0.544
0.0AspXaa: 0.0 ± 0.0
Glu
5.631GluAla: 5.631 ± 0.497
0.547GluCys: 0.547 ± 0.243
3.05GluAsp: 3.05 ± 0.585
7.274GluGlu: 7.274 ± 1.107
3.05GluPhe: 3.05 ± 0.496
3.441GluGly: 3.441 ± 0.439
1.173GluHis: 1.173 ± 0.388
4.849GluIle: 4.849 ± 0.469
6.648GluLys: 6.648 ± 1.091
7.508GluLeu: 7.508 ± 0.892
1.955GluMet: 1.955 ± 0.369
4.927GluAsn: 4.927 ± 0.941
1.173GluPro: 1.173 ± 0.278
3.05GluGln: 3.05 ± 0.584
3.676GluArg: 3.676 ± 0.706
4.145GluSer: 4.145 ± 0.614
3.363GluThr: 3.363 ± 0.452
4.849GluVal: 4.849 ± 1.115
1.33GluTrp: 1.33 ± 0.246
3.207GluTyr: 3.207 ± 0.75
0.0GluXaa: 0.0 ± 0.0
Phe
2.425PheAla: 2.425 ± 0.502
0.313PheCys: 0.313 ± 0.152
3.05PheAsp: 3.05 ± 0.492
3.05PheGlu: 3.05 ± 0.695
1.799PhePhe: 1.799 ± 0.366
2.346PheGly: 2.346 ± 0.366
0.156PheHis: 0.156 ± 0.099
3.598PheIle: 3.598 ± 0.496
4.38PheLys: 4.38 ± 0.655
1.955PheLeu: 1.955 ± 0.357
1.017PheMet: 1.017 ± 0.215
3.989PheAsn: 3.989 ± 0.72
0.704PhePro: 0.704 ± 0.19
1.251PheGln: 1.251 ± 0.4
1.408PheArg: 1.408 ± 0.408
3.441PheSer: 3.441 ± 0.43
2.346PheThr: 2.346 ± 0.406
2.033PheVal: 2.033 ± 0.488
0.391PheTrp: 0.391 ± 0.155
1.564PheTyr: 1.564 ± 0.311
0.0PheXaa: 0.0 ± 0.0
Gly
4.223GlyAla: 4.223 ± 0.771
0.313GlyCys: 0.313 ± 0.124
3.441GlyAsp: 3.441 ± 0.403
2.972GlyGlu: 2.972 ± 0.479
2.972GlyPhe: 2.972 ± 0.46
5.005GlyGly: 5.005 ± 1.001
1.408GlyHis: 1.408 ± 0.309
4.145GlyIle: 4.145 ± 0.541
5.866GlyLys: 5.866 ± 0.721
6.022GlyLeu: 6.022 ± 0.751
0.704GlyMet: 0.704 ± 0.292
2.816GlyAsn: 2.816 ± 0.454
1.095GlyPro: 1.095 ± 0.237
2.503GlyGln: 2.503 ± 0.5
2.581GlyArg: 2.581 ± 0.479
3.911GlySer: 3.911 ± 0.79
3.754GlyThr: 3.754 ± 0.538
3.832GlyVal: 3.832 ± 0.763
0.86GlyTrp: 0.86 ± 0.313
1.877GlyTyr: 1.877 ± 0.318
0.0GlyXaa: 0.0 ± 0.0
His
0.86HisAla: 0.86 ± 0.264
0.313HisCys: 0.313 ± 0.17
1.173HisAsp: 1.173 ± 0.253
1.095HisGlu: 1.095 ± 0.29
0.704HisPhe: 0.704 ± 0.257
0.86HisGly: 0.86 ± 0.232
0.626HisHis: 0.626 ± 0.271
1.564HisIle: 1.564 ± 0.273
1.095HisLys: 1.095 ± 0.274
1.799HisLeu: 1.799 ± 0.378
0.391HisMet: 0.391 ± 0.171
1.486HisAsn: 1.486 ± 0.69
0.626HisPro: 0.626 ± 0.156
0.469HisGln: 0.469 ± 0.188
0.626HisArg: 0.626 ± 0.262
1.251HisSer: 1.251 ± 0.403
0.782HisThr: 0.782 ± 0.216
1.017HisVal: 1.017 ± 0.286
0.078HisTrp: 0.078 ± 0.074
1.095HisTyr: 1.095 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
4.223IleAla: 4.223 ± 0.627
0.156IleCys: 0.156 ± 0.098
5.318IleAsp: 5.318 ± 0.763
5.24IleGlu: 5.24 ± 0.937
2.503IlePhe: 2.503 ± 0.543
2.581IleGly: 2.581 ± 0.406
1.486IleHis: 1.486 ± 0.326
4.302IleIle: 4.302 ± 0.561
7.43IleLys: 7.43 ± 0.982
4.145IleLeu: 4.145 ± 0.541
1.251IleMet: 1.251 ± 0.286
5.24IleAsn: 5.24 ± 0.635
2.033IlePro: 2.033 ± 0.387
1.955IleGln: 1.955 ± 0.375
2.894IleArg: 2.894 ± 0.484
4.849IleSer: 4.849 ± 0.616
3.911IleThr: 3.911 ± 0.367
4.771IleVal: 4.771 ± 0.759
0.469IleTrp: 0.469 ± 0.164
2.894IleTyr: 2.894 ± 0.555
0.0IleXaa: 0.0 ± 0.0
Lys
6.335LysAla: 6.335 ± 0.675
0.156LysCys: 0.156 ± 0.111
6.257LysAsp: 6.257 ± 0.572
6.413LysGlu: 6.413 ± 0.884
3.598LysPhe: 3.598 ± 0.471
6.335LysGly: 6.335 ± 0.942
2.425LysHis: 2.425 ± 0.374
6.179LysIle: 6.179 ± 0.407
6.1LysLys: 6.1 ± 0.823
7.586LysLeu: 7.586 ± 0.833
2.816LysMet: 2.816 ± 0.461
5.318LysAsn: 5.318 ± 0.748
2.033LysPro: 2.033 ± 0.396
5.084LysGln: 5.084 ± 0.599
4.771LysArg: 4.771 ± 0.466
4.693LysSer: 4.693 ± 0.961
5.397LysThr: 5.397 ± 0.687
5.553LysVal: 5.553 ± 0.593
1.251LysTrp: 1.251 ± 0.356
3.598LysTyr: 3.598 ± 0.739
0.0LysXaa: 0.0 ± 0.0
Leu
5.084LeuAla: 5.084 ± 0.62
0.547LeuCys: 0.547 ± 0.204
5.005LeuAsp: 5.005 ± 0.784
6.726LeuGlu: 6.726 ± 0.872
2.737LeuPhe: 2.737 ± 0.466
3.598LeuGly: 3.598 ± 0.456
1.017LeuHis: 1.017 ± 0.291
5.084LeuIle: 5.084 ± 0.585
8.994LeuLys: 8.994 ± 0.861
6.726LeuLeu: 6.726 ± 0.706
2.19LeuMet: 2.19 ± 0.403
5.162LeuAsn: 5.162 ± 0.707
1.955LeuPro: 1.955 ± 0.493
3.989LeuGln: 3.989 ± 0.666
4.302LeuArg: 4.302 ± 0.578
5.866LeuSer: 5.866 ± 0.842
5.005LeuThr: 5.005 ± 0.475
4.38LeuVal: 4.38 ± 0.608
0.939LeuTrp: 0.939 ± 0.245
2.737LeuTyr: 2.737 ± 0.533
0.0LeuXaa: 0.0 ± 0.0
Met
1.486MetAla: 1.486 ± 0.343
0.156MetCys: 0.156 ± 0.098
0.86MetAsp: 0.86 ± 0.256
1.955MetGlu: 1.955 ± 0.392
0.86MetPhe: 0.86 ± 0.239
1.408MetGly: 1.408 ± 0.471
0.86MetHis: 0.86 ± 0.261
1.721MetIle: 1.721 ± 0.33
2.659MetLys: 2.659 ± 0.506
1.721MetLeu: 1.721 ± 0.303
0.782MetMet: 0.782 ± 0.254
1.33MetAsn: 1.33 ± 0.364
0.939MetPro: 0.939 ± 0.249
1.408MetGln: 1.408 ± 0.338
1.486MetArg: 1.486 ± 0.323
2.737MetSer: 2.737 ± 0.731
2.033MetThr: 2.033 ± 0.316
1.017MetVal: 1.017 ± 0.203
0.313MetTrp: 0.313 ± 0.136
0.939MetTyr: 0.939 ± 0.252
0.0MetXaa: 0.0 ± 0.0
Asn
3.441AsnAla: 3.441 ± 0.484
0.156AsnCys: 0.156 ± 0.114
4.38AsnAsp: 4.38 ± 0.649
4.771AsnGlu: 4.771 ± 0.837
1.721AsnPhe: 1.721 ± 0.307
4.849AsnGly: 4.849 ± 0.677
0.782AsnHis: 0.782 ± 0.179
3.676AsnIle: 3.676 ± 0.556
5.005AsnLys: 5.005 ± 0.6
4.849AsnLeu: 4.849 ± 0.437
1.564AsnMet: 1.564 ± 0.35
3.363AsnAsn: 3.363 ± 0.443
2.268AsnPro: 2.268 ± 0.273
2.816AsnGln: 2.816 ± 0.726
2.972AsnArg: 2.972 ± 0.406
3.676AsnSer: 3.676 ± 0.474
3.128AsnThr: 3.128 ± 0.407
4.067AsnVal: 4.067 ± 0.551
0.939AsnTrp: 0.939 ± 0.267
3.05AsnTyr: 3.05 ± 0.537
0.0AsnXaa: 0.0 ± 0.0
Pro
1.33ProAla: 1.33 ± 0.318
0.078ProCys: 0.078 ± 0.067
1.486ProAsp: 1.486 ± 0.429
2.112ProGlu: 2.112 ± 0.383
1.017ProPhe: 1.017 ± 0.237
1.095ProGly: 1.095 ± 0.297
0.469ProHis: 0.469 ± 0.152
1.251ProIle: 1.251 ± 0.379
2.737ProLys: 2.737 ± 0.422
1.955ProLeu: 1.955 ± 0.48
0.547ProMet: 0.547 ± 0.183
1.33ProAsn: 1.33 ± 0.316
0.782ProPro: 0.782 ± 0.175
1.251ProGln: 1.251 ± 0.374
1.095ProArg: 1.095 ± 0.289
1.564ProSer: 1.564 ± 0.303
0.782ProThr: 0.782 ± 0.184
2.19ProVal: 2.19 ± 0.446
0.391ProTrp: 0.391 ± 0.157
0.939ProTyr: 0.939 ± 0.271
0.0ProXaa: 0.0 ± 0.0
Gln
3.05GlnAla: 3.05 ± 0.487
0.235GlnCys: 0.235 ± 0.137
2.659GlnAsp: 2.659 ± 0.511
1.955GlnGlu: 1.955 ± 0.301
1.564GlnPhe: 1.564 ± 0.274
2.268GlnGly: 2.268 ± 0.408
1.173GlnHis: 1.173 ± 0.623
3.05GlnIle: 3.05 ± 0.535
3.519GlnLys: 3.519 ± 0.535
2.972GlnLeu: 2.972 ± 0.447
1.095GlnMet: 1.095 ± 0.308
2.737GlnAsn: 2.737 ± 0.421
0.782GlnPro: 0.782 ± 0.283
1.564GlnGln: 1.564 ± 0.479
2.19GlnArg: 2.19 ± 0.426
2.737GlnSer: 2.737 ± 0.41
1.799GlnThr: 1.799 ± 0.302
1.799GlnVal: 1.799 ± 0.321
0.391GlnTrp: 0.391 ± 0.158
2.033GlnTyr: 2.033 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
2.894ArgAla: 2.894 ± 0.556
0.078ArgCys: 0.078 ± 0.075
2.503ArgAsp: 2.503 ± 0.474
3.676ArgGlu: 3.676 ± 0.521
2.346ArgPhe: 2.346 ± 0.337
2.268ArgGly: 2.268 ± 0.356
0.86ArgHis: 0.86 ± 0.332
3.05ArgIle: 3.05 ± 0.523
3.989ArgLys: 3.989 ± 0.594
4.458ArgLeu: 4.458 ± 0.463
0.939ArgMet: 0.939 ± 0.317
2.346ArgAsn: 2.346 ± 0.415
0.782ArgPro: 0.782 ± 0.207
1.955ArgGln: 1.955 ± 0.482
2.033ArgArg: 2.033 ± 0.474
2.503ArgSer: 2.503 ± 0.534
3.519ArgThr: 3.519 ± 0.612
2.894ArgVal: 2.894 ± 0.457
0.547ArgTrp: 0.547 ± 0.184
2.112ArgTyr: 2.112 ± 0.288
0.0ArgXaa: 0.0 ± 0.0
Ser
4.536SerAla: 4.536 ± 0.757
0.156SerCys: 0.156 ± 0.115
4.145SerAsp: 4.145 ± 0.9
5.475SerGlu: 5.475 ± 0.883
2.268SerPhe: 2.268 ± 0.418
4.614SerGly: 4.614 ± 0.625
0.626SerHis: 0.626 ± 0.19
4.771SerIle: 4.771 ± 1.012
5.944SerLys: 5.944 ± 0.648
3.676SerLeu: 3.676 ± 0.59
2.659SerMet: 2.659 ± 0.677
4.067SerAsn: 4.067 ± 0.581
1.251SerPro: 1.251 ± 0.317
2.19SerGln: 2.19 ± 0.376
2.425SerArg: 2.425 ± 0.604
3.754SerSer: 3.754 ± 0.506
2.894SerThr: 2.894 ± 0.613
3.989SerVal: 3.989 ± 0.527
1.017SerTrp: 1.017 ± 0.294
2.19SerTyr: 2.19 ± 0.423
0.0SerXaa: 0.0 ± 0.0
Thr
2.972ThrAla: 2.972 ± 0.549
0.235ThrCys: 0.235 ± 0.117
3.989ThrAsp: 3.989 ± 0.476
3.598ThrGlu: 3.598 ± 0.61
3.519ThrPhe: 3.519 ± 0.493
3.363ThrGly: 3.363 ± 0.569
1.408ThrHis: 1.408 ± 0.285
4.067ThrIle: 4.067 ± 0.692
4.693ThrLys: 4.693 ± 0.648
3.598ThrLeu: 3.598 ± 0.439
1.408ThrMet: 1.408 ± 0.624
3.441ThrAsn: 3.441 ± 0.601
2.033ThrPro: 2.033 ± 0.297
1.721ThrGln: 1.721 ± 0.37
2.503ThrArg: 2.503 ± 0.322
3.285ThrSer: 3.285 ± 0.695
3.285ThrThr: 3.285 ± 0.588
3.519ThrVal: 3.519 ± 0.538
1.017ThrTrp: 1.017 ± 0.302
1.799ThrTyr: 1.799 ± 0.422
0.0ThrXaa: 0.0 ± 0.0
Val
4.536ValAla: 4.536 ± 0.698
0.547ValCys: 0.547 ± 0.21
3.598ValAsp: 3.598 ± 0.423
4.223ValGlu: 4.223 ± 0.784
2.425ValPhe: 2.425 ± 0.504
3.989ValGly: 3.989 ± 0.536
0.547ValHis: 0.547 ± 0.177
3.754ValIle: 3.754 ± 0.435
5.944ValLys: 5.944 ± 0.797
5.318ValLeu: 5.318 ± 0.739
1.642ValMet: 1.642 ± 0.261
4.302ValAsn: 4.302 ± 0.609
1.33ValPro: 1.33 ± 0.361
1.877ValGln: 1.877 ± 0.344
3.05ValArg: 3.05 ± 0.425
3.441ValSer: 3.441 ± 0.527
3.207ValThr: 3.207 ± 0.633
5.005ValVal: 5.005 ± 1.093
0.626ValTrp: 0.626 ± 0.229
2.346ValTyr: 2.346 ± 0.429
0.0ValXaa: 0.0 ± 0.0
Trp
0.939TrpAla: 0.939 ± 0.234
0.0TrpCys: 0.0 ± 0.0
1.017TrpAsp: 1.017 ± 0.386
0.782TrpGlu: 0.782 ± 0.235
0.626TrpPhe: 0.626 ± 0.219
0.469TrpGly: 0.469 ± 0.135
0.156TrpHis: 0.156 ± 0.114
1.173TrpIle: 1.173 ± 0.266
0.939TrpLys: 0.939 ± 0.248
1.251TrpLeu: 1.251 ± 0.277
0.235TrpMet: 0.235 ± 0.156
0.704TrpAsn: 0.704 ± 0.287
0.313TrpPro: 0.313 ± 0.22
0.782TrpGln: 0.782 ± 0.163
0.704TrpArg: 0.704 ± 0.214
0.939TrpSer: 0.939 ± 0.257
0.704TrpThr: 0.704 ± 0.204
0.313TrpVal: 0.313 ± 0.157
0.156TrpTrp: 0.156 ± 0.106
0.547TrpTyr: 0.547 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.128TyrAla: 3.128 ± 0.441
0.235TyrCys: 0.235 ± 0.109
3.363TyrAsp: 3.363 ± 0.651
3.676TyrGlu: 3.676 ± 0.628
1.33TyrPhe: 1.33 ± 0.334
3.05TyrGly: 3.05 ± 0.442
0.782TyrHis: 0.782 ± 0.323
2.972TyrIle: 2.972 ± 0.672
3.05TyrLys: 3.05 ± 0.441
4.067TyrLeu: 4.067 ± 0.646
1.251TyrMet: 1.251 ± 0.289
2.268TyrAsn: 2.268 ± 0.498
0.939TyrPro: 0.939 ± 0.282
1.877TyrGln: 1.877 ± 0.306
1.408TyrArg: 1.408 ± 0.303
2.112TyrSer: 2.112 ± 0.324
1.564TyrThr: 1.564 ± 0.517
2.033TyrVal: 2.033 ± 0.553
0.547TyrTrp: 0.547 ± 0.174
1.486TyrTyr: 1.486 ± 0.407
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (12787 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski