Amino acid dipepetide frequency for Staphylococcus virus 92

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.492AlaAla: 0.492 ± 0.202
0.632AlaCys: 0.632 ± 0.184
2.458AlaAsp: 2.458 ± 0.312
3.441AlaGlu: 3.441 ± 0.431
2.598AlaPhe: 2.598 ± 0.465
3.511AlaGly: 3.511 ± 0.685
1.053AlaHis: 1.053 ± 0.3
5.337AlaIle: 5.337 ± 0.666
5.97AlaLys: 5.97 ± 0.617
4.776AlaLeu: 4.776 ± 0.701
1.545AlaMet: 1.545 ± 0.386
3.301AlaAsn: 3.301 ± 0.474
2.739AlaPro: 2.739 ± 0.46
2.528AlaGln: 2.528 ± 0.435
2.247AlaArg: 2.247 ± 0.346
4.073AlaSer: 4.073 ± 0.775
4.635AlaThr: 4.635 ± 0.678
3.16AlaVal: 3.16 ± 0.577
0.913AlaTrp: 0.913 ± 0.328
2.388AlaTyr: 2.388 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
0.281CysAla: 0.281 ± 0.141
0.14CysCys: 0.14 ± 0.092
0.351CysAsp: 0.351 ± 0.165
0.492CysGlu: 0.492 ± 0.196
0.211CysPhe: 0.211 ± 0.118
0.351CysGly: 0.351 ± 0.136
0.0CysHis: 0.0 ± 0.0
0.351CysIle: 0.351 ± 0.144
0.632CysLys: 0.632 ± 0.187
0.351CysLeu: 0.351 ± 0.155
0.281CysMet: 0.281 ± 0.13
0.492CysAsn: 0.492 ± 0.205
0.492CysPro: 0.492 ± 0.195
0.281CysGln: 0.281 ± 0.112
0.421CysArg: 0.421 ± 0.176
0.492CysSer: 0.492 ± 0.182
0.351CysThr: 0.351 ± 0.198
0.14CysVal: 0.14 ± 0.093
0.14CysTrp: 0.14 ± 0.087
0.351CysTyr: 0.351 ± 0.14
0.0CysXaa: 0.0 ± 0.0
Asp
3.863AspAla: 3.863 ± 0.575
0.281AspCys: 0.281 ± 0.134
4.565AspAsp: 4.565 ± 0.685
5.478AspGlu: 5.478 ± 0.738
3.231AspPhe: 3.231 ± 0.569
4.705AspGly: 4.705 ± 0.702
0.351AspHis: 0.351 ± 0.142
4.986AspIle: 4.986 ± 0.682
6.04AspLys: 6.04 ± 0.736
5.337AspLeu: 5.337 ± 0.459
1.615AspMet: 1.615 ± 0.299
3.652AspAsn: 3.652 ± 0.595
1.334AspPro: 1.334 ± 0.313
1.475AspGln: 1.475 ± 0.3
1.896AspArg: 1.896 ± 0.349
4.073AspSer: 4.073 ± 0.507
3.441AspThr: 3.441 ± 0.416
4.073AspVal: 4.073 ± 0.634
0.632AspTrp: 0.632 ± 0.223
2.528AspTyr: 2.528 ± 0.409
0.0AspXaa: 0.0 ± 0.0
Glu
4.495GluAla: 4.495 ± 0.595
0.562GluCys: 0.562 ± 0.196
4.284GluAsp: 4.284 ± 0.704
5.057GluGlu: 5.057 ± 0.756
2.669GluPhe: 2.669 ± 0.445
2.528GluGly: 2.528 ± 0.366
0.983GluHis: 0.983 ± 0.235
5.689GluIle: 5.689 ± 0.643
5.127GluLys: 5.127 ± 0.545
7.093GluLeu: 7.093 ± 1.006
2.037GluMet: 2.037 ± 0.448
4.354GluAsn: 4.354 ± 0.578
1.966GluPro: 1.966 ± 0.302
3.933GluGln: 3.933 ± 0.55
3.301GluArg: 3.301 ± 0.467
3.231GluSer: 3.231 ± 0.423
3.582GluThr: 3.582 ± 0.434
5.267GluVal: 5.267 ± 0.574
1.124GluTrp: 1.124 ± 0.253
4.354GluTyr: 4.354 ± 0.637
0.0GluXaa: 0.0 ± 0.0
Phe
1.966PheAla: 1.966 ± 0.323
0.562PheCys: 0.562 ± 0.181
3.722PheAsp: 3.722 ± 0.414
3.02PheGlu: 3.02 ± 0.389
1.405PhePhe: 1.405 ± 0.276
2.809PheGly: 2.809 ± 0.576
0.773PheHis: 0.773 ± 0.206
3.301PheIle: 3.301 ± 0.454
4.284PheLys: 4.284 ± 0.487
2.669PheLeu: 2.669 ± 0.399
0.983PheMet: 0.983 ± 0.228
2.739PheAsn: 2.739 ± 0.399
0.983PhePro: 0.983 ± 0.293
1.124PheGln: 1.124 ± 0.28
1.475PheArg: 1.475 ± 0.274
2.598PheSer: 2.598 ± 0.44
2.669PheThr: 2.669 ± 0.4
3.09PheVal: 3.09 ± 0.469
0.492PheTrp: 0.492 ± 0.188
2.037PheTyr: 2.037 ± 0.467
0.0PheXaa: 0.0 ± 0.0
Gly
4.495GlyAla: 4.495 ± 0.544
0.421GlyCys: 0.421 ± 0.165
3.582GlyAsp: 3.582 ± 0.537
2.739GlyGlu: 2.739 ± 0.478
2.458GlyPhe: 2.458 ± 0.41
2.598GlyGly: 2.598 ± 0.483
1.475GlyHis: 1.475 ± 0.333
4.354GlyIle: 4.354 ± 0.567
4.916GlyLys: 4.916 ± 0.528
4.635GlyLeu: 4.635 ± 0.709
1.124GlyMet: 1.124 ± 0.255
3.301GlyAsn: 3.301 ± 0.503
0.492GlyPro: 0.492 ± 0.179
2.318GlyGln: 2.318 ± 0.347
2.388GlyArg: 2.388 ± 0.392
3.16GlySer: 3.16 ± 0.448
3.722GlyThr: 3.722 ± 0.514
4.776GlyVal: 4.776 ± 0.74
1.124GlyTrp: 1.124 ± 0.33
2.879GlyTyr: 2.879 ± 0.415
0.0GlyXaa: 0.0 ± 0.0
His
0.983HisAla: 0.983 ± 0.275
0.14HisCys: 0.14 ± 0.094
0.702HisAsp: 0.702 ± 0.214
1.053HisGlu: 1.053 ± 0.259
0.913HisPhe: 0.913 ± 0.221
1.194HisGly: 1.194 ± 0.281
0.492HisHis: 0.492 ± 0.194
1.264HisIle: 1.264 ± 0.283
1.264HisLys: 1.264 ± 0.293
1.405HisLeu: 1.405 ± 0.318
0.211HisMet: 0.211 ± 0.109
1.124HisAsn: 1.124 ± 0.277
0.773HisPro: 0.773 ± 0.262
0.632HisGln: 0.632 ± 0.213
0.492HisArg: 0.492 ± 0.186
0.843HisSer: 0.843 ± 0.231
1.545HisThr: 1.545 ± 0.333
0.913HisVal: 0.913 ± 0.277
0.0HisTrp: 0.0 ± 0.0
0.843HisTyr: 0.843 ± 0.331
0.0HisXaa: 0.0 ± 0.0
Ile
4.495IleAla: 4.495 ± 0.529
0.281IleCys: 0.281 ± 0.127
6.672IleAsp: 6.672 ± 0.855
5.408IleGlu: 5.408 ± 0.76
3.09IlePhe: 3.09 ± 0.503
4.846IleGly: 4.846 ± 0.782
0.983IleHis: 0.983 ± 0.294
4.565IleIle: 4.565 ± 0.552
8.076IleLys: 8.076 ± 0.828
3.792IleLeu: 3.792 ± 0.383
1.686IleMet: 1.686 ± 0.295
4.424IleAsn: 4.424 ± 0.563
2.388IlePro: 2.388 ± 0.349
3.16IleGln: 3.16 ± 0.424
2.879IleArg: 2.879 ± 0.513
4.846IleSer: 4.846 ± 0.567
4.846IleThr: 4.846 ± 0.529
3.722IleVal: 3.722 ± 0.422
0.843IleTrp: 0.843 ± 0.303
2.95IleTyr: 2.95 ± 0.524
0.0IleXaa: 0.0 ± 0.0
Lys
4.776LysAla: 4.776 ± 0.494
0.492LysCys: 0.492 ± 0.204
5.97LysAsp: 5.97 ± 0.55
7.866LysGlu: 7.866 ± 0.791
3.582LysPhe: 3.582 ± 0.491
5.899LysGly: 5.899 ± 0.654
1.896LysHis: 1.896 ± 0.35
5.97LysIle: 5.97 ± 0.71
8.076LysLys: 8.076 ± 0.835
7.444LysLeu: 7.444 ± 0.764
2.528LysMet: 2.528 ± 0.365
5.478LysAsn: 5.478 ± 0.608
2.528LysPro: 2.528 ± 0.446
4.986LysGln: 4.986 ± 0.625
4.565LysArg: 4.565 ± 0.52
4.635LysSer: 4.635 ± 0.494
5.127LysThr: 5.127 ± 0.693
5.408LysVal: 5.408 ± 0.622
0.773LysTrp: 0.773 ± 0.209
4.003LysTyr: 4.003 ± 0.51
0.0LysXaa: 0.0 ± 0.0
Leu
4.144LeuAla: 4.144 ± 0.538
0.562LeuCys: 0.562 ± 0.208
4.565LeuAsp: 4.565 ± 0.509
5.408LeuGlu: 5.408 ± 0.72
3.301LeuPhe: 3.301 ± 0.437
3.16LeuGly: 3.16 ± 0.415
1.334LeuHis: 1.334 ± 0.329
4.776LeuIle: 4.776 ± 0.484
7.304LeuLys: 7.304 ± 0.792
5.408LeuLeu: 5.408 ± 0.663
2.318LeuMet: 2.318 ± 0.435
5.899LeuAsn: 5.899 ± 0.546
2.388LeuPro: 2.388 ± 0.478
2.879LeuGln: 2.879 ± 0.348
3.231LeuArg: 3.231 ± 0.588
5.478LeuSer: 5.478 ± 0.601
5.829LeuThr: 5.829 ± 0.638
4.916LeuVal: 4.916 ± 0.64
0.632LeuTrp: 0.632 ± 0.253
3.16LeuTyr: 3.16 ± 0.578
0.0LeuXaa: 0.0 ± 0.0
Met
1.545MetAla: 1.545 ± 0.471
0.0MetCys: 0.0 ± 0.0
1.405MetAsp: 1.405 ± 0.267
1.334MetGlu: 1.334 ± 0.27
1.124MetPhe: 1.124 ± 0.262
1.124MetGly: 1.124 ± 0.278
0.351MetHis: 0.351 ± 0.137
1.686MetIle: 1.686 ± 0.331
2.177MetLys: 2.177 ± 0.354
2.458MetLeu: 2.458 ± 0.353
0.983MetMet: 0.983 ± 0.274
2.177MetAsn: 2.177 ± 0.365
1.053MetPro: 1.053 ± 0.246
1.475MetGln: 1.475 ± 0.371
0.913MetArg: 0.913 ± 0.253
1.896MetSer: 1.896 ± 0.403
1.826MetThr: 1.826 ± 0.339
1.053MetVal: 1.053 ± 0.289
0.351MetTrp: 0.351 ± 0.143
0.913MetTyr: 0.913 ± 0.269
0.0MetXaa: 0.0 ± 0.0
Asn
4.424AsnAla: 4.424 ± 0.587
0.421AsnCys: 0.421 ± 0.176
4.916AsnAsp: 4.916 ± 0.602
4.846AsnGlu: 4.846 ± 0.688
2.458AsnPhe: 2.458 ± 0.45
3.933AsnGly: 3.933 ± 0.636
1.053AsnHis: 1.053 ± 0.3
4.424AsnIle: 4.424 ± 0.593
6.18AsnLys: 6.18 ± 0.712
3.582AsnLeu: 3.582 ± 0.508
1.756AsnMet: 1.756 ± 0.328
4.495AsnAsn: 4.495 ± 0.699
2.95AsnPro: 2.95 ± 0.405
2.669AsnGln: 2.669 ± 0.52
2.037AsnArg: 2.037 ± 0.349
3.441AsnSer: 3.441 ± 0.442
3.301AsnThr: 3.301 ± 0.489
3.933AsnVal: 3.933 ± 0.481
1.124AsnTrp: 1.124 ± 0.214
2.177AsnTyr: 2.177 ± 0.442
0.0AsnXaa: 0.0 ± 0.0
Pro
1.475ProAla: 1.475 ± 0.276
0.14ProCys: 0.14 ± 0.084
1.124ProAsp: 1.124 ± 0.294
1.686ProGlu: 1.686 ± 0.24
2.037ProPhe: 2.037 ± 0.321
1.545ProGly: 1.545 ± 0.455
0.492ProHis: 0.492 ± 0.173
2.037ProIle: 2.037 ± 0.401
3.511ProLys: 3.511 ± 0.529
2.247ProLeu: 2.247 ± 0.372
0.913ProMet: 0.913 ± 0.282
2.107ProAsn: 2.107 ± 0.367
0.913ProPro: 0.913 ± 0.235
1.334ProGln: 1.334 ± 0.315
1.124ProArg: 1.124 ± 0.237
2.177ProSer: 2.177 ± 0.513
2.037ProThr: 2.037 ± 0.356
1.686ProVal: 1.686 ± 0.377
0.211ProTrp: 0.211 ± 0.108
1.124ProTyr: 1.124 ± 0.278
0.0ProXaa: 0.0 ± 0.0
Gln
3.511GlnAla: 3.511 ± 0.515
0.421GlnCys: 0.421 ± 0.192
1.896GlnAsp: 1.896 ± 0.37
2.669GlnGlu: 2.669 ± 0.419
2.247GlnPhe: 2.247 ± 0.327
2.528GlnGly: 2.528 ± 0.418
1.053GlnHis: 1.053 ± 0.247
2.95GlnIle: 2.95 ± 0.332
2.739GlnLys: 2.739 ± 0.487
2.879GlnLeu: 2.879 ± 0.406
1.615GlnMet: 1.615 ± 0.397
2.95GlnAsn: 2.95 ± 0.349
1.545GlnPro: 1.545 ± 0.399
1.615GlnGln: 1.615 ± 0.416
2.177GlnArg: 2.177 ± 0.415
1.896GlnSer: 1.896 ± 0.36
1.826GlnThr: 1.826 ± 0.369
2.528GlnVal: 2.528 ± 0.47
0.351GlnTrp: 0.351 ± 0.171
1.475GlnTyr: 1.475 ± 0.336
0.0GlnXaa: 0.0 ± 0.0
Arg
1.545ArgAla: 1.545 ± 0.266
0.351ArgCys: 0.351 ± 0.147
2.669ArgAsp: 2.669 ± 0.481
2.809ArgGlu: 2.809 ± 0.372
2.318ArgPhe: 2.318 ± 0.392
2.388ArgGly: 2.388 ± 0.467
0.773ArgHis: 0.773 ± 0.235
3.231ArgIle: 3.231 ± 0.494
3.652ArgLys: 3.652 ± 0.496
3.792ArgLeu: 3.792 ± 0.597
0.843ArgMet: 0.843 ± 0.23
2.809ArgAsn: 2.809 ± 0.382
0.983ArgPro: 0.983 ± 0.236
2.107ArgGln: 2.107 ± 0.396
1.194ArgArg: 1.194 ± 0.325
1.405ArgSer: 1.405 ± 0.379
2.247ArgThr: 2.247 ± 0.424
2.388ArgVal: 2.388 ± 0.407
0.632ArgTrp: 0.632 ± 0.193
2.598ArgTyr: 2.598 ± 0.489
0.0ArgXaa: 0.0 ± 0.0
Ser
4.354SerAla: 4.354 ± 0.605
0.281SerCys: 0.281 ± 0.181
3.652SerAsp: 3.652 ± 0.437
3.652SerGlu: 3.652 ± 0.569
2.879SerPhe: 2.879 ± 0.389
4.354SerGly: 4.354 ± 0.595
0.843SerHis: 0.843 ± 0.205
4.565SerIle: 4.565 ± 0.65
5.829SerLys: 5.829 ± 0.572
4.144SerLeu: 4.144 ± 0.576
1.615SerMet: 1.615 ± 0.289
4.565SerAsn: 4.565 ± 0.528
1.475SerPro: 1.475 ± 0.38
2.318SerGln: 2.318 ± 0.517
1.966SerArg: 1.966 ± 0.22
3.301SerSer: 3.301 ± 0.511
3.441SerThr: 3.441 ± 0.393
3.582SerVal: 3.582 ± 0.54
0.562SerTrp: 0.562 ± 0.17
2.107SerTyr: 2.107 ± 0.315
0.0SerXaa: 0.0 ± 0.0
Thr
3.933ThrAla: 3.933 ± 0.565
0.211ThrCys: 0.211 ± 0.121
3.722ThrAsp: 3.722 ± 0.444
3.933ThrGlu: 3.933 ± 0.445
2.669ThrPhe: 2.669 ± 0.482
3.722ThrGly: 3.722 ± 0.567
1.053ThrHis: 1.053 ± 0.215
4.846ThrIle: 4.846 ± 0.603
4.565ThrLys: 4.565 ± 0.646
5.478ThrLeu: 5.478 ± 0.675
0.773ThrMet: 0.773 ± 0.206
4.003ThrAsn: 4.003 ± 0.618
1.826ThrPro: 1.826 ± 0.338
2.458ThrGln: 2.458 ± 0.476
2.669ThrArg: 2.669 ± 0.398
4.705ThrSer: 4.705 ± 0.729
4.284ThrThr: 4.284 ± 0.531
4.214ThrVal: 4.214 ± 0.513
0.913ThrTrp: 0.913 ± 0.29
2.458ThrTyr: 2.458 ± 0.346
0.0ThrXaa: 0.0 ± 0.0
Val
4.354ValAla: 4.354 ± 0.752
0.211ValCys: 0.211 ± 0.114
4.284ValAsp: 4.284 ± 0.78
5.899ValGlu: 5.899 ± 0.697
1.826ValPhe: 1.826 ± 0.364
2.95ValGly: 2.95 ± 0.456
0.843ValHis: 0.843 ± 0.229
5.267ValIle: 5.267 ± 0.673
6.742ValLys: 6.742 ± 0.544
5.057ValLeu: 5.057 ± 0.524
1.615ValMet: 1.615 ± 0.339
3.09ValAsn: 3.09 ± 0.419
2.107ValPro: 2.107 ± 0.348
1.053ValGln: 1.053 ± 0.33
2.458ValArg: 2.458 ± 0.413
3.441ValSer: 3.441 ± 0.526
4.284ValThr: 4.284 ± 0.64
3.933ValVal: 3.933 ± 0.476
0.632ValTrp: 0.632 ± 0.207
2.388ValTyr: 2.388 ± 0.471
0.0ValXaa: 0.0 ± 0.0
Trp
0.702TrpAla: 0.702 ± 0.236
0.14TrpCys: 0.14 ± 0.103
0.562TrpAsp: 0.562 ± 0.181
1.053TrpGlu: 1.053 ± 0.236
0.421TrpPhe: 0.421 ± 0.156
0.702TrpGly: 0.702 ± 0.288
0.421TrpHis: 0.421 ± 0.143
0.913TrpIle: 0.913 ± 0.309
0.983TrpLys: 0.983 ± 0.268
0.983TrpLeu: 0.983 ± 0.254
0.281TrpMet: 0.281 ± 0.13
0.702TrpAsn: 0.702 ± 0.226
0.07TrpPro: 0.07 ± 0.056
0.773TrpGln: 0.773 ± 0.238
0.421TrpArg: 0.421 ± 0.163
0.632TrpSer: 0.632 ± 0.218
0.843TrpThr: 0.843 ± 0.193
0.913TrpVal: 0.913 ± 0.217
0.0TrpTrp: 0.0 ± 0.0
0.562TrpTyr: 0.562 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.966TyrAla: 1.966 ± 0.366
0.351TyrCys: 0.351 ± 0.167
2.458TyrAsp: 2.458 ± 0.428
3.792TyrGlu: 3.792 ± 0.521
1.264TyrPhe: 1.264 ± 0.341
2.107TyrGly: 2.107 ± 0.473
0.562TyrHis: 0.562 ± 0.22
3.441TyrIle: 3.441 ± 0.568
3.933TyrLys: 3.933 ± 0.607
3.16TyrLeu: 3.16 ± 0.556
1.053TyrMet: 1.053 ± 0.287
2.528TyrAsn: 2.528 ± 0.454
0.913TyrPro: 0.913 ± 0.295
1.756TyrGln: 1.756 ± 0.303
2.879TyrArg: 2.879 ± 0.514
3.16TyrSer: 3.16 ± 0.582
2.598TyrThr: 2.598 ± 0.458
2.739TyrVal: 2.739 ± 0.416
0.632TyrTrp: 0.632 ± 0.162
2.107TyrTyr: 2.107 ± 0.537
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (14240 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski