Amino acid dipepetide frequency for Staphylococcus phage SA13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.609AlaAla: 0.609 ± 0.223
0.381AlaCys: 0.381 ± 0.137
2.512AlaAsp: 2.512 ± 0.372
4.186AlaGlu: 4.186 ± 0.415
3.044AlaPhe: 3.044 ± 0.594
4.034AlaGly: 4.034 ± 0.769
1.218AlaHis: 1.218 ± 0.296
4.871AlaIle: 4.871 ± 0.61
6.317AlaLys: 6.317 ± 0.69
4.034AlaLeu: 4.034 ± 0.755
1.446AlaMet: 1.446 ± 0.411
3.044AlaAsn: 3.044 ± 0.496
2.207AlaPro: 2.207 ± 0.449
2.892AlaGln: 2.892 ± 0.489
2.435AlaArg: 2.435 ± 0.303
4.338AlaSer: 4.338 ± 0.747
4.11AlaThr: 4.11 ± 0.592
3.805AlaVal: 3.805 ± 0.738
0.913AlaTrp: 0.913 ± 0.314
2.588AlaTyr: 2.588 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.152CysAla: 0.152 ± 0.11
0.0CysCys: 0.0 ± 0.0
0.228CysAsp: 0.228 ± 0.127
0.228CysGlu: 0.228 ± 0.141
0.381CysPhe: 0.381 ± 0.201
0.228CysGly: 0.228 ± 0.108
0.0CysHis: 0.0 ± 0.0
0.152CysIle: 0.152 ± 0.099
0.609CysLys: 0.609 ± 0.185
0.228CysLeu: 0.228 ± 0.147
0.076CysMet: 0.076 ± 0.095
0.381CysAsn: 0.381 ± 0.192
0.457CysPro: 0.457 ± 0.183
0.228CysGln: 0.228 ± 0.111
0.228CysArg: 0.228 ± 0.119
0.609CysSer: 0.609 ± 0.18
0.228CysThr: 0.228 ± 0.126
0.228CysVal: 0.228 ± 0.131
0.152CysTrp: 0.152 ± 0.101
0.381CysTyr: 0.381 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
3.958AspAla: 3.958 ± 0.603
0.304AspCys: 0.304 ± 0.168
4.262AspAsp: 4.262 ± 0.729
5.784AspGlu: 5.784 ± 0.726
3.729AspPhe: 3.729 ± 0.599
4.643AspGly: 4.643 ± 0.601
0.152AspHis: 0.152 ± 0.106
5.252AspIle: 5.252 ± 0.661
5.632AspLys: 5.632 ± 0.709
5.328AspLeu: 5.328 ± 0.592
2.055AspMet: 2.055 ± 0.401
3.805AspAsn: 3.805 ± 0.578
1.294AspPro: 1.294 ± 0.289
1.446AspGln: 1.446 ± 0.394
2.055AspArg: 2.055 ± 0.385
3.958AspSer: 3.958 ± 0.632
3.12AspThr: 3.12 ± 0.435
3.729AspVal: 3.729 ± 0.543
0.761AspTrp: 0.761 ± 0.254
2.74AspTyr: 2.74 ± 0.449
0.0AspXaa: 0.0 ± 0.0
Glu
4.871GluAla: 4.871 ± 0.722
0.457GluCys: 0.457 ± 0.18
3.882GluAsp: 3.882 ± 0.633
5.175GluGlu: 5.175 ± 0.845
3.501GluPhe: 3.501 ± 0.637
2.512GluGly: 2.512 ± 0.407
1.37GluHis: 1.37 ± 0.316
6.241GluIle: 6.241 ± 0.867
5.937GluLys: 5.937 ± 0.716
8.068GluLeu: 8.068 ± 1.075
1.827GluMet: 1.827 ± 0.314
4.947GluAsn: 4.947 ± 0.723
1.522GluPro: 1.522 ± 0.309
3.197GluGln: 3.197 ± 0.475
3.501GluArg: 3.501 ± 0.649
3.349GluSer: 3.349 ± 0.557
3.425GluThr: 3.425 ± 0.454
5.708GluVal: 5.708 ± 0.558
1.142GluTrp: 1.142 ± 0.264
4.795GluTyr: 4.795 ± 0.689
0.0GluXaa: 0.0 ± 0.0
Phe
1.827PheAla: 1.827 ± 0.359
0.304PheCys: 0.304 ± 0.138
3.882PheAsp: 3.882 ± 0.397
3.349PheGlu: 3.349 ± 0.603
1.142PhePhe: 1.142 ± 0.293
2.74PheGly: 2.74 ± 0.57
0.685PheHis: 0.685 ± 0.219
3.577PheIle: 3.577 ± 0.573
5.252PheLys: 5.252 ± 0.621
3.044PheLeu: 3.044 ± 0.484
0.913PheMet: 0.913 ± 0.241
2.968PheAsn: 2.968 ± 0.421
0.913PhePro: 0.913 ± 0.324
0.989PheGln: 0.989 ± 0.302
1.522PheArg: 1.522 ± 0.271
2.968PheSer: 2.968 ± 0.517
3.044PheThr: 3.044 ± 0.516
2.512PheVal: 2.512 ± 0.407
0.381PheTrp: 0.381 ± 0.216
1.903PheTyr: 1.903 ± 0.456
0.0PheXaa: 0.0 ± 0.0
Gly
4.49GlyAla: 4.49 ± 0.673
0.457GlyCys: 0.457 ± 0.196
3.501GlyAsp: 3.501 ± 0.618
2.664GlyGlu: 2.664 ± 0.461
2.74GlyPhe: 2.74 ± 0.458
2.968GlyGly: 2.968 ± 0.535
1.522GlyHis: 1.522 ± 0.371
4.11GlyIle: 4.11 ± 0.602
5.099GlyLys: 5.099 ± 0.498
4.643GlyLeu: 4.643 ± 0.707
1.294GlyMet: 1.294 ± 0.282
2.892GlyAsn: 2.892 ± 0.463
0.381GlyPro: 0.381 ± 0.153
2.435GlyGln: 2.435 ± 0.423
2.588GlyArg: 2.588 ± 0.4
3.425GlySer: 3.425 ± 0.591
4.034GlyThr: 4.034 ± 0.522
5.328GlyVal: 5.328 ± 0.774
1.142GlyTrp: 1.142 ± 0.402
2.816GlyTyr: 2.816 ± 0.656
0.0GlyXaa: 0.0 ± 0.0
His
1.218HisAla: 1.218 ± 0.271
0.0HisCys: 0.0 ± 0.0
0.533HisAsp: 0.533 ± 0.184
1.37HisGlu: 1.37 ± 0.366
0.913HisPhe: 0.913 ± 0.278
1.218HisGly: 1.218 ± 0.308
0.381HisHis: 0.381 ± 0.157
1.446HisIle: 1.446 ± 0.412
0.989HisLys: 0.989 ± 0.257
1.066HisLeu: 1.066 ± 0.291
0.457HisMet: 0.457 ± 0.156
1.218HisAsn: 1.218 ± 0.28
0.913HisPro: 0.913 ± 0.273
0.381HisGln: 0.381 ± 0.2
0.685HisArg: 0.685 ± 0.203
0.685HisSer: 0.685 ± 0.212
1.674HisThr: 1.674 ± 0.338
0.685HisVal: 0.685 ± 0.239
0.0HisTrp: 0.0 ± 0.0
0.685HisTyr: 0.685 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
4.262IleAla: 4.262 ± 0.694
0.152IleCys: 0.152 ± 0.101
5.937IleAsp: 5.937 ± 0.853
6.013IleGlu: 6.013 ± 0.878
2.74IlePhe: 2.74 ± 0.414
4.947IleGly: 4.947 ± 0.858
1.218IleHis: 1.218 ± 0.354
4.262IleIle: 4.262 ± 0.563
7.611IleLys: 7.611 ± 0.781
3.882IleLeu: 3.882 ± 0.568
1.979IleMet: 1.979 ± 0.449
3.653IleAsn: 3.653 ± 0.579
2.588IlePro: 2.588 ± 0.387
2.892IleGln: 2.892 ± 0.381
3.197IleArg: 3.197 ± 0.579
4.11IleSer: 4.11 ± 0.588
4.947IleThr: 4.947 ± 0.646
4.034IleVal: 4.034 ± 0.487
0.761IleTrp: 0.761 ± 0.335
2.74IleTyr: 2.74 ± 0.563
0.0IleXaa: 0.0 ± 0.0
Lys
5.252LysAla: 5.252 ± 0.604
0.381LysCys: 0.381 ± 0.187
6.241LysAsp: 6.241 ± 0.648
8.524LysGlu: 8.524 ± 0.906
3.273LysPhe: 3.273 ± 0.453
5.937LysGly: 5.937 ± 0.685
2.055LysHis: 2.055 ± 0.421
6.241LysIle: 6.241 ± 0.779
7.23LysLys: 7.23 ± 0.845
6.698LysLeu: 6.698 ± 0.857
2.664LysMet: 2.664 ± 0.372
5.175LysAsn: 5.175 ± 0.827
2.74LysPro: 2.74 ± 0.507
4.643LysGln: 4.643 ± 0.674
4.49LysArg: 4.49 ± 0.543
5.404LysSer: 5.404 ± 0.695
5.86LysThr: 5.86 ± 0.58
5.099LysVal: 5.099 ± 0.586
0.837LysTrp: 0.837 ± 0.238
4.034LysTyr: 4.034 ± 0.704
0.0LysXaa: 0.0 ± 0.0
Leu
4.338LeuAla: 4.338 ± 0.621
0.533LeuCys: 0.533 ± 0.197
5.252LeuAsp: 5.252 ± 0.512
5.86LeuGlu: 5.86 ± 1.007
3.729LeuPhe: 3.729 ± 0.612
3.349LeuGly: 3.349 ± 0.514
1.218LeuHis: 1.218 ± 0.312
4.719LeuIle: 4.719 ± 0.607
7.154LeuLys: 7.154 ± 0.749
5.328LeuLeu: 5.328 ± 0.68
1.674LeuMet: 1.674 ± 0.371
5.86LeuAsn: 5.86 ± 0.573
2.055LeuPro: 2.055 ± 0.423
2.968LeuGln: 2.968 ± 0.413
3.044LeuArg: 3.044 ± 0.559
4.795LeuSer: 4.795 ± 0.508
5.175LeuThr: 5.175 ± 0.749
4.034LeuVal: 4.034 ± 0.486
0.533LeuTrp: 0.533 ± 0.244
3.805LeuTyr: 3.805 ± 0.594
0.0LeuXaa: 0.0 ± 0.0
Met
1.674MetAla: 1.674 ± 0.495
0.0MetCys: 0.0 ± 0.0
1.294MetAsp: 1.294 ± 0.25
1.598MetGlu: 1.598 ± 0.358
1.218MetPhe: 1.218 ± 0.334
1.218MetGly: 1.218 ± 0.244
0.533MetHis: 0.533 ± 0.227
1.446MetIle: 1.446 ± 0.386
2.055MetLys: 2.055 ± 0.463
2.207MetLeu: 2.207 ± 0.359
0.761MetMet: 0.761 ± 0.226
1.827MetAsn: 1.827 ± 0.376
1.066MetPro: 1.066 ± 0.26
1.142MetGln: 1.142 ± 0.411
0.685MetArg: 0.685 ± 0.224
1.751MetSer: 1.751 ± 0.465
2.055MetThr: 2.055 ± 0.382
1.066MetVal: 1.066 ± 0.243
0.457MetTrp: 0.457 ± 0.184
1.066MetTyr: 1.066 ± 0.333
0.0MetXaa: 0.0 ± 0.0
Asn
4.11AsnAla: 4.11 ± 0.52
0.533AsnCys: 0.533 ± 0.227
5.099AsnAsp: 5.099 ± 0.613
4.49AsnGlu: 4.49 ± 0.615
2.74AsnPhe: 2.74 ± 0.449
4.186AsnGly: 4.186 ± 0.714
0.761AsnHis: 0.761 ± 0.248
3.958AsnIle: 3.958 ± 0.595
6.393AsnLys: 6.393 ± 0.863
3.577AsnLeu: 3.577 ± 0.485
1.37AsnMet: 1.37 ± 0.304
5.023AsnAsn: 5.023 ± 0.859
2.664AsnPro: 2.664 ± 0.434
2.512AsnGln: 2.512 ± 0.433
1.979AsnArg: 1.979 ± 0.31
3.653AsnSer: 3.653 ± 0.528
4.262AsnThr: 4.262 ± 0.553
3.349AsnVal: 3.349 ± 0.599
0.913AsnTrp: 0.913 ± 0.232
2.816AsnTyr: 2.816 ± 0.517
0.0AsnXaa: 0.0 ± 0.0
Pro
1.446ProAla: 1.446 ± 0.274
0.0ProCys: 0.0 ± 0.0
1.522ProAsp: 1.522 ± 0.36
1.674ProGlu: 1.674 ± 0.344
1.598ProPhe: 1.598 ± 0.306
1.674ProGly: 1.674 ± 0.451
0.533ProHis: 0.533 ± 0.217
2.207ProIle: 2.207 ± 0.444
3.197ProLys: 3.197 ± 0.466
1.827ProLeu: 1.827 ± 0.372
0.989ProMet: 0.989 ± 0.24
2.131ProAsn: 2.131 ± 0.423
0.304ProPro: 0.304 ± 0.121
0.989ProGln: 0.989 ± 0.331
0.761ProArg: 0.761 ± 0.198
1.979ProSer: 1.979 ± 0.411
1.979ProThr: 1.979 ± 0.336
1.522ProVal: 1.522 ± 0.382
0.152ProTrp: 0.152 ± 0.103
1.142ProTyr: 1.142 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
3.653GlnAla: 3.653 ± 0.548
0.304GlnCys: 0.304 ± 0.156
1.979GlnAsp: 1.979 ± 0.461
2.74GlnGlu: 2.74 ± 0.467
1.751GlnPhe: 1.751 ± 0.306
2.588GlnGly: 2.588 ± 0.523
0.837GlnHis: 0.837 ± 0.211
2.892GlnIle: 2.892 ± 0.432
3.044GlnLys: 3.044 ± 0.6
2.588GlnLeu: 2.588 ± 0.434
1.294GlnMet: 1.294 ± 0.337
2.74GlnAsn: 2.74 ± 0.403
1.751GlnPro: 1.751 ± 0.44
2.283GlnGln: 2.283 ± 0.501
1.903GlnArg: 1.903 ± 0.403
1.827GlnSer: 1.827 ± 0.333
1.598GlnThr: 1.598 ± 0.387
2.512GlnVal: 2.512 ± 0.476
0.304GlnTrp: 0.304 ± 0.167
1.522GlnTyr: 1.522 ± 0.316
0.0GlnXaa: 0.0 ± 0.0
Arg
1.522ArgAla: 1.522 ± 0.342
0.381ArgCys: 0.381 ± 0.142
2.435ArgAsp: 2.435 ± 0.416
3.577ArgGlu: 3.577 ± 0.582
2.131ArgPhe: 2.131 ± 0.437
1.827ArgGly: 1.827 ± 0.391
1.066ArgHis: 1.066 ± 0.345
3.425ArgIle: 3.425 ± 0.461
3.577ArgLys: 3.577 ± 0.459
3.729ArgLeu: 3.729 ± 0.56
0.761ArgMet: 0.761 ± 0.248
2.435ArgAsn: 2.435 ± 0.393
1.218ArgPro: 1.218 ± 0.234
1.903ArgGln: 1.903 ± 0.432
1.598ArgArg: 1.598 ± 0.329
1.674ArgSer: 1.674 ± 0.388
2.055ArgThr: 2.055 ± 0.415
1.979ArgVal: 1.979 ± 0.401
0.533ArgTrp: 0.533 ± 0.2
2.131ArgTyr: 2.131 ± 0.444
0.0ArgXaa: 0.0 ± 0.0
Ser
4.49SerAla: 4.49 ± 0.571
0.228SerCys: 0.228 ± 0.178
4.643SerAsp: 4.643 ± 0.744
4.11SerGlu: 4.11 ± 0.492
2.512SerPhe: 2.512 ± 0.564
4.034SerGly: 4.034 ± 0.645
0.609SerHis: 0.609 ± 0.196
4.414SerIle: 4.414 ± 0.677
5.784SerLys: 5.784 ± 0.78
3.577SerLeu: 3.577 ± 0.511
2.055SerMet: 2.055 ± 0.354
4.947SerAsn: 4.947 ± 0.605
0.685SerPro: 0.685 ± 0.253
2.512SerGln: 2.512 ± 0.477
2.131SerArg: 2.131 ± 0.306
3.044SerSer: 3.044 ± 0.512
3.501SerThr: 3.501 ± 0.399
3.653SerVal: 3.653 ± 0.615
0.685SerTrp: 0.685 ± 0.241
2.359SerTyr: 2.359 ± 0.439
0.0SerXaa: 0.0 ± 0.0
Thr
4.034ThrAla: 4.034 ± 0.638
0.0ThrCys: 0.0 ± 0.0
3.577ThrAsp: 3.577 ± 0.435
4.414ThrGlu: 4.414 ± 0.5
3.197ThrPhe: 3.197 ± 0.488
4.186ThrGly: 4.186 ± 0.542
0.837ThrHis: 0.837 ± 0.206
4.49ThrIle: 4.49 ± 0.653
4.338ThrLys: 4.338 ± 0.692
5.86ThrLeu: 5.86 ± 0.56
0.913ThrMet: 0.913 ± 0.227
4.338ThrAsn: 4.338 ± 0.661
1.751ThrPro: 1.751 ± 0.412
2.588ThrGln: 2.588 ± 0.581
2.359ThrArg: 2.359 ± 0.449
4.871ThrSer: 4.871 ± 0.856
4.262ThrThr: 4.262 ± 0.677
3.653ThrVal: 3.653 ± 0.516
0.989ThrTrp: 0.989 ± 0.306
2.055ThrTyr: 2.055 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
4.643ValAla: 4.643 ± 0.869
0.304ValCys: 0.304 ± 0.144
4.414ValAsp: 4.414 ± 0.837
4.49ValGlu: 4.49 ± 0.646
2.055ValPhe: 2.055 ± 0.404
3.349ValGly: 3.349 ± 0.55
0.533ValHis: 0.533 ± 0.23
4.49ValIle: 4.49 ± 0.517
6.469ValLys: 6.469 ± 0.643
4.947ValLeu: 4.947 ± 0.642
1.294ValMet: 1.294 ± 0.343
3.349ValAsn: 3.349 ± 0.455
1.979ValPro: 1.979 ± 0.356
1.218ValGln: 1.218 ± 0.34
2.435ValArg: 2.435 ± 0.4
3.882ValSer: 3.882 ± 0.697
3.805ValThr: 3.805 ± 0.59
3.653ValVal: 3.653 ± 0.473
0.913ValTrp: 0.913 ± 0.274
2.512ValTyr: 2.512 ± 0.531
0.0ValXaa: 0.0 ± 0.0
Trp
0.609TrpAla: 0.609 ± 0.245
0.152TrpCys: 0.152 ± 0.111
0.609TrpAsp: 0.609 ± 0.182
1.218TrpGlu: 1.218 ± 0.273
0.457TrpPhe: 0.457 ± 0.165
0.761TrpGly: 0.761 ± 0.247
0.304TrpHis: 0.304 ± 0.134
0.685TrpIle: 0.685 ± 0.253
1.066TrpLys: 1.066 ± 0.294
1.142TrpLeu: 1.142 ± 0.273
0.152TrpMet: 0.152 ± 0.094
0.761TrpAsn: 0.761 ± 0.241
0.076TrpPro: 0.076 ± 0.075
0.913TrpGln: 0.913 ± 0.276
0.381TrpArg: 0.381 ± 0.186
0.685TrpSer: 0.685 ± 0.246
1.066TrpThr: 1.066 ± 0.208
0.837TrpVal: 0.837 ± 0.247
0.0TrpTrp: 0.0 ± 0.0
0.457TrpTyr: 0.457 ± 0.178
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.827TyrAla: 1.827 ± 0.368
0.304TyrCys: 0.304 ± 0.15
2.207TyrAsp: 2.207 ± 0.427
3.958TyrGlu: 3.958 ± 0.63
1.066TyrPhe: 1.066 ± 0.327
2.207TyrGly: 2.207 ± 0.52
0.685TyrHis: 0.685 ± 0.235
3.044TyrIle: 3.044 ± 0.451
5.023TyrLys: 5.023 ± 0.75
3.653TyrLeu: 3.653 ± 0.54
1.066TyrMet: 1.066 ± 0.293
2.892TyrAsn: 2.892 ± 0.494
1.066TyrPro: 1.066 ± 0.329
1.979TyrGln: 1.979 ± 0.326
1.979TyrArg: 1.979 ± 0.455
2.968TyrSer: 2.968 ± 0.612
2.512TyrThr: 2.512 ± 0.419
3.349TyrVal: 3.349 ± 0.531
0.761TyrTrp: 0.761 ± 0.232
1.827TyrTyr: 1.827 ± 0.394
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (13140 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski