Amino acid dipepetide frequency for Staphylococcus virus EW

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.327AlaAla: 1.327 ± 0.316
0.21AlaCys: 0.21 ± 0.111
2.934AlaAsp: 2.934 ± 0.47
3.773AlaGlu: 3.773 ± 0.522
3.004AlaPhe: 3.004 ± 0.563
2.794AlaGly: 2.794 ± 0.521
1.118AlaHis: 1.118 ± 0.234
4.751AlaIle: 4.751 ± 0.617
5.868AlaLys: 5.868 ± 0.61
4.681AlaLeu: 4.681 ± 0.955
1.258AlaMet: 1.258 ± 0.359
3.493AlaAsn: 3.493 ± 0.428
1.816AlaPro: 1.816 ± 0.297
2.375AlaGln: 2.375 ± 0.443
2.236AlaArg: 2.236 ± 0.342
3.633AlaSer: 3.633 ± 0.445
3.703AlaThr: 3.703 ± 0.562
4.262AlaVal: 4.262 ± 0.613
0.768AlaTrp: 0.768 ± 0.289
1.467AlaTyr: 1.467 ± 0.277
0.0AlaXaa: 0.0 ± 0.0
Cys
0.21CysAla: 0.21 ± 0.124
0.07CysCys: 0.07 ± 0.074
0.419CysAsp: 0.419 ± 0.155
0.0CysGlu: 0.0 ± 0.0
0.21CysPhe: 0.21 ± 0.122
0.419CysGly: 0.419 ± 0.15
0.07CysHis: 0.07 ± 0.059
0.419CysIle: 0.419 ± 0.197
0.559CysLys: 0.559 ± 0.173
0.419CysLeu: 0.419 ± 0.189
0.0CysMet: 0.0 ± 0.0
0.489CysAsn: 0.489 ± 0.176
0.14CysPro: 0.14 ± 0.136
0.349CysGln: 0.349 ± 0.14
0.419CysArg: 0.419 ± 0.145
0.21CysSer: 0.21 ± 0.163
0.419CysThr: 0.419 ± 0.203
0.489CysVal: 0.489 ± 0.206
0.14CysTrp: 0.14 ± 0.086
0.14CysTyr: 0.14 ± 0.098
0.0CysXaa: 0.0 ± 0.0
Asp
3.144AspAla: 3.144 ± 0.477
0.21AspCys: 0.21 ± 0.137
4.541AspAsp: 4.541 ± 0.769
4.331AspGlu: 4.331 ± 0.711
2.934AspPhe: 2.934 ± 0.366
4.052AspGly: 4.052 ± 0.468
0.699AspHis: 0.699 ± 0.205
4.401AspIle: 4.401 ± 0.668
5.729AspLys: 5.729 ± 0.578
5.17AspLeu: 5.17 ± 0.576
1.886AspMet: 1.886 ± 0.358
4.192AspAsn: 4.192 ± 0.642
1.816AspPro: 1.816 ± 0.349
1.537AspGln: 1.537 ± 0.309
2.375AspArg: 2.375 ± 0.427
3.493AspSer: 3.493 ± 0.516
3.214AspThr: 3.214 ± 0.461
4.96AspVal: 4.96 ± 0.596
0.768AspTrp: 0.768 ± 0.226
3.703AspTyr: 3.703 ± 0.644
0.0AspXaa: 0.0 ± 0.0
Glu
3.773GluAla: 3.773 ± 0.663
0.629GluCys: 0.629 ± 0.213
4.192GluAsp: 4.192 ± 0.66
3.842GluGlu: 3.842 ± 0.63
3.214GluPhe: 3.214 ± 0.508
3.773GluGly: 3.773 ± 0.451
1.118GluHis: 1.118 ± 0.311
3.703GluIle: 3.703 ± 0.454
4.96GluLys: 4.96 ± 0.595
6.986GluLeu: 6.986 ± 0.922
1.816GluMet: 1.816 ± 0.428
4.331GluAsn: 4.331 ± 0.552
1.397GluPro: 1.397 ± 0.276
3.004GluGln: 3.004 ± 0.459
3.633GluArg: 3.633 ± 0.617
3.912GluSer: 3.912 ± 0.521
4.052GluThr: 4.052 ± 0.513
4.751GluVal: 4.751 ± 0.532
0.838GluTrp: 0.838 ± 0.198
3.074GluTyr: 3.074 ± 0.527
0.0GluXaa: 0.0 ± 0.0
Phe
1.886PheAla: 1.886 ± 0.343
0.07PheCys: 0.07 ± 0.074
2.655PheAsp: 2.655 ± 0.369
3.144PheGlu: 3.144 ± 0.576
1.258PhePhe: 1.258 ± 0.295
2.864PheGly: 2.864 ± 0.469
0.908PheHis: 0.908 ± 0.231
3.912PheIle: 3.912 ± 0.764
3.004PheLys: 3.004 ± 0.422
2.864PheLeu: 2.864 ± 0.445
1.188PheMet: 1.188 ± 0.306
3.004PheAsn: 3.004 ± 0.334
0.978PhePro: 0.978 ± 0.222
1.607PheGln: 1.607 ± 0.354
1.118PheArg: 1.118 ± 0.275
2.236PheSer: 2.236 ± 0.405
3.144PheThr: 3.144 ± 0.442
2.655PheVal: 2.655 ± 0.547
0.279PheTrp: 0.279 ± 0.171
2.375PheTyr: 2.375 ± 0.42
0.0PheXaa: 0.0 ± 0.0
Gly
4.262GlyAla: 4.262 ± 0.876
0.279GlyCys: 0.279 ± 0.135
3.703GlyAsp: 3.703 ± 0.792
3.423GlyGlu: 3.423 ± 0.55
2.794GlyPhe: 2.794 ± 0.435
3.493GlyGly: 3.493 ± 0.763
0.978GlyHis: 0.978 ± 0.255
4.471GlyIle: 4.471 ± 0.539
4.611GlyLys: 4.611 ± 0.553
4.401GlyLeu: 4.401 ± 0.686
1.188GlyMet: 1.188 ± 0.35
3.144GlyAsn: 3.144 ± 0.599
1.607GlyPro: 1.607 ± 0.572
2.655GlyGln: 2.655 ± 0.462
2.655GlyArg: 2.655 ± 0.427
4.052GlySer: 4.052 ± 0.484
3.773GlyThr: 3.773 ± 0.696
5.1GlyVal: 5.1 ± 0.653
1.258GlyTrp: 1.258 ± 0.43
3.703GlyTyr: 3.703 ± 0.69
0.0GlyXaa: 0.0 ± 0.0
His
0.559HisAla: 0.559 ± 0.201
0.21HisCys: 0.21 ± 0.135
0.838HisAsp: 0.838 ± 0.221
0.978HisGlu: 0.978 ± 0.242
0.768HisPhe: 0.768 ± 0.239
1.118HisGly: 1.118 ± 0.229
0.419HisHis: 0.419 ± 0.146
1.118HisIle: 1.118 ± 0.307
1.816HisLys: 1.816 ± 0.351
1.607HisLeu: 1.607 ± 0.317
0.349HisMet: 0.349 ± 0.138
1.048HisAsn: 1.048 ± 0.253
0.838HisPro: 0.838 ± 0.243
0.699HisGln: 0.699 ± 0.225
0.699HisArg: 0.699 ± 0.217
1.327HisSer: 1.327 ± 0.304
1.747HisThr: 1.747 ± 0.365
1.048HisVal: 1.048 ± 0.37
0.0HisTrp: 0.0 ± 0.0
0.838HisTyr: 0.838 ± 0.244
0.0HisXaa: 0.0 ± 0.0
Ile
5.17IleAla: 5.17 ± 0.697
0.21IleCys: 0.21 ± 0.107
5.938IleAsp: 5.938 ± 0.822
6.288IleGlu: 6.288 ± 0.698
2.026IlePhe: 2.026 ± 0.398
4.681IleGly: 4.681 ± 0.776
1.327IleHis: 1.327 ± 0.324
4.122IleIle: 4.122 ± 0.556
7.405IleLys: 7.405 ± 0.795
4.82IleLeu: 4.82 ± 0.65
1.537IleMet: 1.537 ± 0.341
5.379IleAsn: 5.379 ± 0.67
2.585IlePro: 2.585 ± 0.368
2.515IleGln: 2.515 ± 0.426
3.283IleArg: 3.283 ± 0.431
3.982IleSer: 3.982 ± 0.742
4.96IleThr: 4.96 ± 0.558
3.773IleVal: 3.773 ± 0.468
0.699IleTrp: 0.699 ± 0.231
2.585IleTyr: 2.585 ± 0.401
0.0IleXaa: 0.0 ± 0.0
Lys
4.401LysAla: 4.401 ± 0.7
0.14LysCys: 0.14 ± 0.096
4.82LysAsp: 4.82 ± 0.539
6.008LysGlu: 6.008 ± 0.753
2.864LysPhe: 2.864 ± 0.413
6.846LysGly: 6.846 ± 0.861
1.607LysHis: 1.607 ± 0.349
5.729LysIle: 5.729 ± 0.542
5.799LysLys: 5.799 ± 0.832
6.567LysLeu: 6.567 ± 0.66
2.166LysMet: 2.166 ± 0.396
4.401LysAsn: 4.401 ± 0.528
3.144LysPro: 3.144 ± 0.644
4.262LysGln: 4.262 ± 0.737
5.24LysArg: 5.24 ± 0.736
5.659LysSer: 5.659 ± 0.709
5.379LysThr: 5.379 ± 0.682
4.96LysVal: 4.96 ± 0.568
1.118LysTrp: 1.118 ± 0.27
4.331LysTyr: 4.331 ± 0.52
0.0LysXaa: 0.0 ± 0.0
Leu
4.681LeuAla: 4.681 ± 0.664
0.768LeuCys: 0.768 ± 0.3
5.589LeuAsp: 5.589 ± 0.605
6.078LeuGlu: 6.078 ± 0.701
3.773LeuPhe: 3.773 ± 0.455
4.541LeuGly: 4.541 ± 0.708
1.118LeuHis: 1.118 ± 0.302
5.24LeuIle: 5.24 ± 0.637
8.314LeuLys: 8.314 ± 0.756
5.659LeuLeu: 5.659 ± 0.771
2.166LeuMet: 2.166 ± 0.404
5.449LeuAsn: 5.449 ± 0.629
1.886LeuPro: 1.886 ± 0.298
3.703LeuGln: 3.703 ± 0.412
3.423LeuArg: 3.423 ± 0.462
4.331LeuSer: 4.331 ± 0.472
3.912LeuThr: 3.912 ± 0.491
3.773LeuVal: 3.773 ± 0.496
0.699LeuTrp: 0.699 ± 0.297
3.144LeuTyr: 3.144 ± 0.552
0.0LeuXaa: 0.0 ± 0.0
Met
1.747MetAla: 1.747 ± 0.436
0.07MetCys: 0.07 ± 0.069
1.467MetAsp: 1.467 ± 0.35
0.978MetGlu: 0.978 ± 0.217
0.978MetPhe: 0.978 ± 0.25
1.118MetGly: 1.118 ± 0.286
0.419MetHis: 0.419 ± 0.153
1.537MetIle: 1.537 ± 0.358
1.886MetLys: 1.886 ± 0.399
2.445MetLeu: 2.445 ± 0.343
0.768MetMet: 0.768 ± 0.216
1.607MetAsn: 1.607 ± 0.37
0.978MetPro: 0.978 ± 0.266
1.118MetGln: 1.118 ± 0.362
1.258MetArg: 1.258 ± 0.256
1.467MetSer: 1.467 ± 0.384
2.515MetThr: 2.515 ± 0.393
1.467MetVal: 1.467 ± 0.365
0.489MetTrp: 0.489 ± 0.163
1.118MetTyr: 1.118 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
4.471AsnAla: 4.471 ± 0.498
0.489AsnCys: 0.489 ± 0.214
4.192AsnAsp: 4.192 ± 0.641
3.703AsnGlu: 3.703 ± 0.454
2.585AsnPhe: 2.585 ± 0.537
4.052AsnGly: 4.052 ± 0.519
1.258AsnHis: 1.258 ± 0.303
4.331AsnIle: 4.331 ± 0.665
6.288AsnLys: 6.288 ± 0.816
4.96AsnLeu: 4.96 ± 0.576
1.397AsnMet: 1.397 ± 0.256
4.751AsnAsn: 4.751 ± 0.613
2.794AsnPro: 2.794 ± 0.539
2.934AsnGln: 2.934 ± 0.527
2.375AsnArg: 2.375 ± 0.432
3.144AsnSer: 3.144 ± 0.577
3.633AsnThr: 3.633 ± 0.543
3.633AsnVal: 3.633 ± 0.516
0.978AsnTrp: 0.978 ± 0.255
2.585AsnTyr: 2.585 ± 0.414
0.0AsnXaa: 0.0 ± 0.0
Pro
1.607ProAla: 1.607 ± 0.304
0.279ProCys: 0.279 ± 0.187
1.816ProAsp: 1.816 ± 0.409
2.236ProGlu: 2.236 ± 0.326
1.607ProPhe: 1.607 ± 0.399
1.188ProGly: 1.188 ± 0.285
0.768ProHis: 0.768 ± 0.259
2.934ProIle: 2.934 ± 0.42
2.864ProLys: 2.864 ± 0.532
2.026ProLeu: 2.026 ± 0.359
0.978ProMet: 0.978 ± 0.38
2.166ProAsn: 2.166 ± 0.501
1.118ProPro: 1.118 ± 0.311
1.118ProGln: 1.118 ± 0.282
1.467ProArg: 1.467 ± 0.298
1.747ProSer: 1.747 ± 0.372
1.397ProThr: 1.397 ± 0.343
2.305ProVal: 2.305 ± 0.401
0.21ProTrp: 0.21 ± 0.126
1.677ProTyr: 1.677 ± 0.385
0.0ProXaa: 0.0 ± 0.0
Gln
2.515GlnAla: 2.515 ± 0.458
0.419GlnCys: 0.419 ± 0.17
2.445GlnAsp: 2.445 ± 0.536
3.423GlnGlu: 3.423 ± 0.651
1.816GlnPhe: 1.816 ± 0.397
2.375GlnGly: 2.375 ± 0.492
0.699GlnHis: 0.699 ± 0.283
3.842GlnIle: 3.842 ± 0.511
2.375GlnLys: 2.375 ± 0.448
3.912GlnLeu: 3.912 ± 0.621
1.327GlnMet: 1.327 ± 0.289
2.166GlnAsn: 2.166 ± 0.535
1.327GlnPro: 1.327 ± 0.248
2.096GlnGln: 2.096 ± 0.395
1.956GlnArg: 1.956 ± 0.331
3.214GlnSer: 3.214 ± 0.485
1.537GlnThr: 1.537 ± 0.305
2.515GlnVal: 2.515 ± 0.351
0.349GlnTrp: 0.349 ± 0.152
1.607GlnTyr: 1.607 ± 0.432
0.0GlnXaa: 0.0 ± 0.0
Arg
2.305ArgAla: 2.305 ± 0.463
0.279ArgCys: 0.279 ± 0.127
2.864ArgAsp: 2.864 ± 0.425
2.445ArgGlu: 2.445 ± 0.502
2.096ArgPhe: 2.096 ± 0.316
2.236ArgGly: 2.236 ± 0.422
0.978ArgHis: 0.978 ± 0.281
3.703ArgIle: 3.703 ± 0.47
3.773ArgLys: 3.773 ± 0.566
4.262ArgLeu: 4.262 ± 0.586
1.537ArgMet: 1.537 ± 0.319
2.934ArgAsn: 2.934 ± 0.395
1.048ArgPro: 1.048 ± 0.337
1.677ArgGln: 1.677 ± 0.354
1.327ArgArg: 1.327 ± 0.309
2.305ArgSer: 2.305 ± 0.397
2.096ArgThr: 2.096 ± 0.339
2.864ArgVal: 2.864 ± 0.532
0.768ArgTrp: 0.768 ± 0.228
1.886ArgTyr: 1.886 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
2.864SerAla: 2.864 ± 0.539
0.14SerCys: 0.14 ± 0.089
3.423SerAsp: 3.423 ± 0.5
3.773SerGlu: 3.773 ± 0.487
2.096SerPhe: 2.096 ± 0.343
4.262SerGly: 4.262 ± 0.474
1.258SerHis: 1.258 ± 0.34
3.912SerIle: 3.912 ± 0.796
5.659SerLys: 5.659 ± 0.581
3.773SerLeu: 3.773 ± 0.448
1.537SerMet: 1.537 ± 0.311
4.611SerAsn: 4.611 ± 0.578
1.677SerPro: 1.677 ± 0.558
2.655SerGln: 2.655 ± 0.529
2.375SerArg: 2.375 ± 0.346
3.912SerSer: 3.912 ± 0.645
4.611SerThr: 4.611 ± 0.537
3.912SerVal: 3.912 ± 0.722
0.838SerTrp: 0.838 ± 0.225
2.655SerTyr: 2.655 ± 0.414
0.0SerXaa: 0.0 ± 0.0
Thr
3.633ThrAla: 3.633 ± 0.705
0.279ThrCys: 0.279 ± 0.115
3.912ThrAsp: 3.912 ± 0.55
3.493ThrGlu: 3.493 ± 0.43
2.934ThrPhe: 2.934 ± 0.528
3.703ThrGly: 3.703 ± 0.621
1.537ThrHis: 1.537 ± 0.253
4.331ThrIle: 4.331 ± 0.566
4.401ThrLys: 4.401 ± 0.613
5.1ThrLeu: 5.1 ± 0.653
1.467ThrMet: 1.467 ± 0.272
3.423ThrAsn: 3.423 ± 0.537
2.725ThrPro: 2.725 ± 0.567
2.655ThrGln: 2.655 ± 0.401
2.096ThrArg: 2.096 ± 0.43
3.703ThrSer: 3.703 ± 0.47
4.681ThrThr: 4.681 ± 1.101
4.611ThrVal: 4.611 ± 0.732
0.838ThrTrp: 0.838 ± 0.241
2.655ThrTyr: 2.655 ± 0.425
0.0ThrXaa: 0.0 ± 0.0
Val
3.703ValAla: 3.703 ± 0.499
0.419ValCys: 0.419 ± 0.15
4.471ValAsp: 4.471 ± 0.546
5.1ValGlu: 5.1 ± 0.685
2.026ValPhe: 2.026 ± 0.462
4.471ValGly: 4.471 ± 0.696
0.768ValHis: 0.768 ± 0.222
6.078ValIle: 6.078 ± 0.603
5.659ValLys: 5.659 ± 0.673
4.471ValLeu: 4.471 ± 0.755
1.677ValMet: 1.677 ± 0.311
3.842ValAsn: 3.842 ± 0.577
2.236ValPro: 2.236 ± 0.359
2.655ValGln: 2.655 ± 0.433
2.725ValArg: 2.725 ± 0.371
4.541ValSer: 4.541 ± 0.688
3.912ValThr: 3.912 ± 0.484
4.751ValVal: 4.751 ± 0.607
0.838ValTrp: 0.838 ± 0.304
1.607ValTyr: 1.607 ± 0.342
0.0ValXaa: 0.0 ± 0.0
Trp
0.768TrpAla: 0.768 ± 0.289
0.07TrpCys: 0.07 ± 0.059
0.629TrpAsp: 0.629 ± 0.208
0.838TrpGlu: 0.838 ± 0.24
0.489TrpPhe: 0.489 ± 0.192
0.629TrpGly: 0.629 ± 0.286
0.279TrpHis: 0.279 ± 0.129
1.118TrpIle: 1.118 ± 0.295
0.559TrpLys: 0.559 ± 0.178
1.118TrpLeu: 1.118 ± 0.287
0.21TrpMet: 0.21 ± 0.132
0.908TrpAsn: 0.908 ± 0.415
0.14TrpPro: 0.14 ± 0.1
0.419TrpGln: 0.419 ± 0.222
0.838TrpArg: 0.838 ± 0.214
0.699TrpSer: 0.699 ± 0.219
1.188TrpThr: 1.188 ± 0.323
1.118TrpVal: 1.118 ± 0.272
0.21TrpTrp: 0.21 ± 0.126
0.349TrpTyr: 0.349 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.445TyrAla: 2.445 ± 0.414
0.349TyrCys: 0.349 ± 0.154
2.375TyrAsp: 2.375 ± 0.404
3.144TyrGlu: 3.144 ± 0.569
1.677TyrPhe: 1.677 ± 0.348
2.934TyrGly: 2.934 ± 0.491
0.699TyrHis: 0.699 ± 0.256
3.633TyrIle: 3.633 ± 0.423
3.912TyrLys: 3.912 ± 0.578
3.004TyrLeu: 3.004 ± 0.365
0.908TyrMet: 0.908 ± 0.241
3.214TyrAsn: 3.214 ± 0.377
1.258TyrPro: 1.258 ± 0.352
1.886TyrGln: 1.886 ± 0.342
1.886TyrArg: 1.886 ± 0.353
2.375TyrSer: 2.375 ± 0.326
2.305TyrThr: 2.305 ± 0.424
3.004TyrVal: 3.004 ± 0.387
0.489TyrTrp: 0.489 ± 0.168
2.166TyrTyr: 2.166 ± 0.566
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (14315 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski