Amino acid dipepetide frequency for Staphylococcus virus 47

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.512AlaAla: 2.512 ± 0.653
0.279AlaCys: 0.279 ± 0.128
2.791AlaAsp: 2.791 ± 0.417
4.326AlaGlu: 4.326 ± 0.488
1.465AlaPhe: 1.465 ± 0.277
3.559AlaGly: 3.559 ± 0.774
0.907AlaHis: 0.907 ± 0.244
4.675AlaIle: 4.675 ± 0.747
6.21AlaLys: 6.21 ± 1.068
5.373AlaLeu: 5.373 ± 0.656
1.256AlaMet: 1.256 ± 0.251
3.908AlaAsn: 3.908 ± 0.782
1.326AlaPro: 1.326 ± 0.334
1.465AlaGln: 1.465 ± 0.291
2.652AlaArg: 2.652 ± 0.361
4.605AlaSer: 4.605 ± 0.755
3.349AlaThr: 3.349 ± 0.499
3.14AlaVal: 3.14 ± 0.47
1.396AlaTrp: 1.396 ± 0.367
2.582AlaTyr: 2.582 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.209CysAla: 0.209 ± 0.115
0.14CysCys: 0.14 ± 0.105
0.14CysAsp: 0.14 ± 0.103
0.349CysGlu: 0.349 ± 0.203
0.349CysPhe: 0.349 ± 0.151
0.14CysGly: 0.14 ± 0.08
0.14CysHis: 0.14 ± 0.099
0.628CysIle: 0.628 ± 0.194
0.698CysLys: 0.698 ± 0.231
1.047CysLeu: 1.047 ± 0.315
0.279CysMet: 0.279 ± 0.201
0.349CysAsn: 0.349 ± 0.163
0.07CysPro: 0.07 ± 0.069
0.209CysGln: 0.209 ± 0.113
0.349CysArg: 0.349 ± 0.17
0.279CysSer: 0.279 ± 0.135
0.349CysThr: 0.349 ± 0.156
0.209CysVal: 0.209 ± 0.129
0.0CysTrp: 0.0 ± 0.0
0.279CysTyr: 0.279 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
3.0AspAla: 3.0 ± 0.542
0.419AspCys: 0.419 ± 0.189
3.489AspAsp: 3.489 ± 0.548
5.024AspGlu: 5.024 ± 0.818
3.28AspPhe: 3.28 ± 0.416
3.419AspGly: 3.419 ± 0.511
0.698AspHis: 0.698 ± 0.164
5.233AspIle: 5.233 ± 0.528
6.908AspLys: 6.908 ± 0.585
5.652AspLeu: 5.652 ± 0.542
2.093AspMet: 2.093 ± 0.371
3.0AspAsn: 3.0 ± 0.356
0.977AspPro: 0.977 ± 0.309
1.186AspGln: 1.186 ± 0.236
1.884AspArg: 1.884 ± 0.338
3.559AspSer: 3.559 ± 0.558
3.768AspThr: 3.768 ± 0.441
4.047AspVal: 4.047 ± 0.403
0.768AspTrp: 0.768 ± 0.237
2.861AspTyr: 2.861 ± 0.504
0.0AspXaa: 0.0 ± 0.0
Glu
4.536GluAla: 4.536 ± 0.498
0.628GluCys: 0.628 ± 0.243
4.396GluAsp: 4.396 ± 0.712
7.117GluGlu: 7.117 ± 1.076
3.0GluPhe: 3.0 ± 0.443
3.559GluGly: 3.559 ± 0.51
1.116GluHis: 1.116 ± 0.305
5.792GluIle: 5.792 ± 0.928
8.583GluLys: 8.583 ± 0.787
7.676GluLeu: 7.676 ± 0.816
2.652GluMet: 2.652 ± 0.39
5.164GluAsn: 5.164 ± 0.629
1.186GluPro: 1.186 ± 0.301
3.14GluGln: 3.14 ± 0.544
3.21GluArg: 3.21 ± 0.624
3.349GluSer: 3.349 ± 0.482
4.396GluThr: 4.396 ± 0.625
3.628GluVal: 3.628 ± 0.428
0.837GluTrp: 0.837 ± 0.18
2.791GluTyr: 2.791 ± 0.551
0.0GluXaa: 0.0 ± 0.0
Phe
1.884PheAla: 1.884 ± 0.349
0.419PheCys: 0.419 ± 0.193
3.07PheAsp: 3.07 ± 0.342
2.721PheGlu: 2.721 ± 0.488
1.256PhePhe: 1.256 ± 0.277
3.489PheGly: 3.489 ± 0.542
0.419PheHis: 0.419 ± 0.15
3.0PheIle: 3.0 ± 0.55
4.187PheLys: 4.187 ± 0.583
2.372PheLeu: 2.372 ± 0.363
1.186PheMet: 1.186 ± 0.253
3.14PheAsn: 3.14 ± 0.504
0.698PhePro: 0.698 ± 0.262
0.907PheGln: 0.907 ± 0.185
1.047PheArg: 1.047 ± 0.226
2.303PheSer: 2.303 ± 0.424
2.024PheThr: 2.024 ± 0.359
1.814PheVal: 1.814 ± 0.361
0.279PheTrp: 0.279 ± 0.128
2.163PheTyr: 2.163 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
3.908GlyAla: 3.908 ± 0.989
0.14GlyCys: 0.14 ± 0.086
4.326GlyAsp: 4.326 ± 0.47
3.489GlyGlu: 3.489 ± 0.498
2.442GlyPhe: 2.442 ± 0.337
4.326GlyGly: 4.326 ± 1.176
1.326GlyHis: 1.326 ± 0.332
3.698GlyIle: 3.698 ± 0.518
5.792GlyLys: 5.792 ± 0.57
5.582GlyLeu: 5.582 ± 0.985
1.535GlyMet: 1.535 ± 0.342
3.07GlyAsn: 3.07 ± 0.473
0.907GlyPro: 0.907 ± 0.215
1.744GlyGln: 1.744 ± 0.358
2.372GlyArg: 2.372 ± 0.419
3.28GlySer: 3.28 ± 0.553
3.349GlyThr: 3.349 ± 0.504
4.257GlyVal: 4.257 ± 0.676
0.837GlyTrp: 0.837 ± 0.241
2.721GlyTyr: 2.721 ± 0.523
0.0GlyXaa: 0.0 ± 0.0
His
0.977HisAla: 0.977 ± 0.267
0.07HisCys: 0.07 ± 0.069
0.558HisAsp: 0.558 ± 0.242
1.186HisGlu: 1.186 ± 0.258
0.907HisPhe: 0.907 ± 0.173
1.326HisGly: 1.326 ± 0.265
0.488HisHis: 0.488 ± 0.2
1.465HisIle: 1.465 ± 0.402
1.256HisLys: 1.256 ± 0.268
1.535HisLeu: 1.535 ± 0.289
0.279HisMet: 0.279 ± 0.155
1.116HisAsn: 1.116 ± 0.346
0.907HisPro: 0.907 ± 0.219
0.837HisGln: 0.837 ± 0.247
0.768HisArg: 0.768 ± 0.21
1.186HisSer: 1.186 ± 0.232
0.977HisThr: 0.977 ± 0.219
0.977HisVal: 0.977 ± 0.246
0.279HisTrp: 0.279 ± 0.13
1.047HisTyr: 1.047 ± 0.224
0.0HisXaa: 0.0 ± 0.0
Ile
4.047IleAla: 4.047 ± 0.6
0.419IleCys: 0.419 ± 0.171
5.024IleAsp: 5.024 ± 0.648
6.21IleGlu: 6.21 ± 0.682
2.372IlePhe: 2.372 ± 0.532
2.861IleGly: 2.861 ± 0.541
1.814IleHis: 1.814 ± 0.337
4.396IleIle: 4.396 ± 0.649
7.745IleLys: 7.745 ± 0.834
5.024IleLeu: 5.024 ± 0.761
1.605IleMet: 1.605 ± 0.37
4.815IleAsn: 4.815 ± 0.507
2.652IlePro: 2.652 ± 0.441
1.744IleGln: 1.744 ± 0.335
3.419IleArg: 3.419 ± 0.433
5.024IleSer: 5.024 ± 0.543
3.838IleThr: 3.838 ± 0.544
4.047IleVal: 4.047 ± 0.577
0.628IleTrp: 0.628 ± 0.209
3.14IleTyr: 3.14 ± 0.574
0.0IleXaa: 0.0 ± 0.0
Lys
7.676LysAla: 7.676 ± 1.087
0.209LysCys: 0.209 ± 0.125
4.954LysAsp: 4.954 ± 0.528
8.722LysGlu: 8.722 ± 0.978
2.233LysPhe: 2.233 ± 0.341
5.094LysGly: 5.094 ± 0.873
2.024LysHis: 2.024 ± 0.414
6.908LysIle: 6.908 ± 0.796
8.234LysLys: 8.234 ± 0.961
9.281LysLeu: 9.281 ± 0.862
2.652LysMet: 2.652 ± 0.395
5.861LysAsn: 5.861 ± 0.724
2.442LysPro: 2.442 ± 0.396
4.605LysGln: 4.605 ± 0.528
3.698LysArg: 3.698 ± 0.645
6.071LysSer: 6.071 ± 1.216
5.652LysThr: 5.652 ± 0.654
5.373LysVal: 5.373 ± 0.632
1.884LysTrp: 1.884 ± 0.365
5.094LysTyr: 5.094 ± 0.591
0.0LysXaa: 0.0 ± 0.0
Leu
4.257LeuAla: 4.257 ± 0.825
0.698LeuCys: 0.698 ± 0.218
5.024LeuAsp: 5.024 ± 0.831
7.048LeuGlu: 7.048 ± 0.662
2.582LeuPhe: 2.582 ± 0.334
4.536LeuGly: 4.536 ± 0.907
1.186LeuHis: 1.186 ± 0.366
5.443LeuIle: 5.443 ± 0.717
8.862LeuLys: 8.862 ± 1.105
7.397LeuLeu: 7.397 ± 0.78
2.163LeuMet: 2.163 ± 0.37
6.071LeuAsn: 6.071 ± 0.654
3.21LeuPro: 3.21 ± 0.521
3.698LeuGln: 3.698 ± 0.581
3.349LeuArg: 3.349 ± 0.42
5.722LeuSer: 5.722 ± 0.734
5.861LeuThr: 5.861 ± 0.699
4.047LeuVal: 4.047 ± 0.524
0.419LeuTrp: 0.419 ± 0.19
3.908LeuTyr: 3.908 ± 0.732
0.0LeuXaa: 0.0 ± 0.0
Met
1.396MetAla: 1.396 ± 0.316
0.14MetCys: 0.14 ± 0.089
0.977MetAsp: 0.977 ± 0.289
1.396MetGlu: 1.396 ± 0.306
0.837MetPhe: 0.837 ± 0.261
1.605MetGly: 1.605 ± 0.485
0.628MetHis: 0.628 ± 0.226
1.047MetIle: 1.047 ± 0.211
2.791MetLys: 2.791 ± 0.504
2.093MetLeu: 2.093 ± 0.468
0.279MetMet: 0.279 ± 0.125
2.163MetAsn: 2.163 ± 0.483
1.047MetPro: 1.047 ± 0.255
1.326MetGln: 1.326 ± 0.326
0.907MetArg: 0.907 ± 0.225
2.233MetSer: 2.233 ± 0.463
2.582MetThr: 2.582 ± 0.426
0.907MetVal: 0.907 ± 0.204
0.279MetTrp: 0.279 ± 0.118
0.977MetTyr: 0.977 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
3.419AsnAla: 3.419 ± 0.547
0.279AsnCys: 0.279 ± 0.157
4.187AsnAsp: 4.187 ± 0.487
4.675AsnGlu: 4.675 ± 0.654
1.605AsnPhe: 1.605 ± 0.267
3.977AsnGly: 3.977 ± 0.552
1.047AsnHis: 1.047 ± 0.307
4.187AsnIle: 4.187 ± 0.488
6.978AsnLys: 6.978 ± 0.76
5.513AsnLeu: 5.513 ± 0.672
0.768AsnMet: 0.768 ± 0.18
3.768AsnAsn: 3.768 ± 0.569
2.372AsnPro: 2.372 ± 0.321
2.512AsnGln: 2.512 ± 0.458
2.861AsnArg: 2.861 ± 0.451
3.977AsnSer: 3.977 ± 0.562
4.257AsnThr: 4.257 ± 0.497
3.28AsnVal: 3.28 ± 0.613
0.977AsnTrp: 0.977 ± 0.293
2.303AsnTyr: 2.303 ± 0.412
0.0AsnXaa: 0.0 ± 0.0
Pro
1.396ProAla: 1.396 ± 0.342
0.209ProCys: 0.209 ± 0.111
1.396ProAsp: 1.396 ± 0.343
2.093ProGlu: 2.093 ± 0.387
1.116ProPhe: 1.116 ± 0.236
1.605ProGly: 1.605 ± 0.337
0.349ProHis: 0.349 ± 0.13
1.814ProIle: 1.814 ± 0.327
2.093ProLys: 2.093 ± 0.429
2.233ProLeu: 2.233 ± 0.416
0.628ProMet: 0.628 ± 0.235
1.744ProAsn: 1.744 ± 0.354
0.558ProPro: 0.558 ± 0.19
1.396ProGln: 1.396 ± 0.363
1.116ProArg: 1.116 ± 0.273
2.233ProSer: 2.233 ± 0.377
1.744ProThr: 1.744 ± 0.363
1.396ProVal: 1.396 ± 0.371
0.349ProTrp: 0.349 ± 0.132
1.047ProTyr: 1.047 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
2.372GlnAla: 2.372 ± 0.345
0.349GlnCys: 0.349 ± 0.169
2.372GlnAsp: 2.372 ± 0.392
2.791GlnGlu: 2.791 ± 0.446
1.465GlnPhe: 1.465 ± 0.274
2.093GlnGly: 2.093 ± 0.414
0.628GlnHis: 0.628 ± 0.169
2.652GlnIle: 2.652 ± 0.528
3.28GlnLys: 3.28 ± 0.431
3.07GlnLeu: 3.07 ± 0.458
0.907GlnMet: 0.907 ± 0.26
1.465GlnAsn: 1.465 ± 0.268
1.047GlnPro: 1.047 ± 0.278
1.116GlnGln: 1.116 ± 0.32
2.163GlnArg: 2.163 ± 0.41
2.024GlnSer: 2.024 ± 0.331
1.396GlnThr: 1.396 ± 0.384
2.512GlnVal: 2.512 ± 0.506
0.558GlnTrp: 0.558 ± 0.178
1.605GlnTyr: 1.605 ± 0.331
0.0GlnXaa: 0.0 ± 0.0
Arg
2.303ArgAla: 2.303 ± 0.443
0.14ArgCys: 0.14 ± 0.115
3.419ArgAsp: 3.419 ± 0.389
2.582ArgGlu: 2.582 ± 0.472
2.442ArgPhe: 2.442 ± 0.377
2.024ArgGly: 2.024 ± 0.341
0.698ArgHis: 0.698 ± 0.216
3.489ArgIle: 3.489 ± 0.455
3.908ArgLys: 3.908 ± 0.439
3.698ArgLeu: 3.698 ± 0.63
0.907ArgMet: 0.907 ± 0.212
2.442ArgAsn: 2.442 ± 0.377
0.558ArgPro: 0.558 ± 0.28
1.256ArgGln: 1.256 ± 0.284
1.675ArgArg: 1.675 ± 0.309
1.954ArgSer: 1.954 ± 0.318
2.233ArgThr: 2.233 ± 0.368
2.233ArgVal: 2.233 ± 0.364
0.419ArgTrp: 0.419 ± 0.175
2.233ArgTyr: 2.233 ± 0.536
0.0ArgXaa: 0.0 ± 0.0
Ser
4.326SerAla: 4.326 ± 0.706
0.279SerCys: 0.279 ± 0.142
4.605SerAsp: 4.605 ± 0.501
4.396SerGlu: 4.396 ± 0.543
2.931SerPhe: 2.931 ± 0.519
4.257SerGly: 4.257 ± 0.791
0.977SerHis: 0.977 ± 0.222
4.257SerIle: 4.257 ± 0.545
6.35SerLys: 6.35 ± 1.052
3.908SerLeu: 3.908 ± 0.563
2.024SerMet: 2.024 ± 0.339
4.466SerAsn: 4.466 ± 0.543
1.744SerPro: 1.744 ± 0.379
2.442SerGln: 2.442 ± 0.344
2.233SerArg: 2.233 ± 0.379
4.745SerSer: 4.745 ± 0.786
3.419SerThr: 3.419 ± 0.515
4.047SerVal: 4.047 ± 0.533
0.837SerTrp: 0.837 ± 0.231
1.814SerTyr: 1.814 ± 0.334
0.0SerXaa: 0.0 ± 0.0
Thr
3.559ThrAla: 3.559 ± 0.508
0.279ThrCys: 0.279 ± 0.131
3.28ThrAsp: 3.28 ± 0.46
4.117ThrGlu: 4.117 ± 0.597
3.0ThrPhe: 3.0 ± 0.369
4.605ThrGly: 4.605 ± 0.587
1.465ThrHis: 1.465 ± 0.314
4.466ThrIle: 4.466 ± 0.556
5.513ThrLys: 5.513 ± 0.858
4.326ThrLeu: 4.326 ± 0.552
1.047ThrMet: 1.047 ± 0.277
3.559ThrAsn: 3.559 ± 0.584
2.163ThrPro: 2.163 ± 0.33
1.884ThrGln: 1.884 ± 0.287
2.303ThrArg: 2.303 ± 0.405
3.28ThrSer: 3.28 ± 0.47
2.931ThrThr: 2.931 ± 0.578
4.536ThrVal: 4.536 ± 0.505
0.488ThrTrp: 0.488 ± 0.251
2.303ThrTyr: 2.303 ± 0.391
0.0ThrXaa: 0.0 ± 0.0
Val
3.0ValAla: 3.0 ± 0.44
0.558ValCys: 0.558 ± 0.245
3.908ValAsp: 3.908 ± 0.554
4.954ValGlu: 4.954 ± 0.557
2.233ValPhe: 2.233 ± 0.38
3.559ValGly: 3.559 ± 0.595
1.047ValHis: 1.047 ± 0.196
4.117ValIle: 4.117 ± 0.508
4.954ValLys: 4.954 ± 0.536
4.885ValLeu: 4.885 ± 0.497
1.465ValMet: 1.465 ± 0.287
3.28ValAsn: 3.28 ± 0.581
1.396ValPro: 1.396 ± 0.304
2.093ValGln: 2.093 ± 0.356
2.372ValArg: 2.372 ± 0.403
4.187ValSer: 4.187 ± 0.579
3.559ValThr: 3.559 ± 0.588
3.21ValVal: 3.21 ± 0.512
0.488ValTrp: 0.488 ± 0.175
2.093ValTyr: 2.093 ± 0.449
0.0ValXaa: 0.0 ± 0.0
Trp
0.698TrpAla: 0.698 ± 0.175
0.0TrpCys: 0.0 ± 0.0
0.558TrpAsp: 0.558 ± 0.192
0.837TrpGlu: 0.837 ± 0.312
1.186TrpPhe: 1.186 ± 0.305
0.419TrpGly: 0.419 ± 0.177
0.0TrpHis: 0.0 ± 0.0
0.837TrpIle: 0.837 ± 0.281
0.837TrpLys: 0.837 ± 0.276
1.047TrpLeu: 1.047 ± 0.2
0.419TrpMet: 0.419 ± 0.177
0.977TrpAsn: 0.977 ± 0.22
0.279TrpPro: 0.279 ± 0.189
0.488TrpGln: 0.488 ± 0.142
0.419TrpArg: 0.419 ± 0.162
1.047TrpSer: 1.047 ± 0.412
0.768TrpThr: 0.768 ± 0.208
0.907TrpVal: 0.907 ± 0.261
0.14TrpTrp: 0.14 ± 0.104
0.558TrpTyr: 0.558 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.303TyrAla: 2.303 ± 0.29
0.558TyrCys: 0.558 ± 0.19
2.791TyrAsp: 2.791 ± 0.615
2.721TyrGlu: 2.721 ± 0.522
1.675TyrPhe: 1.675 ± 0.359
2.652TyrGly: 2.652 ± 0.505
1.186TyrHis: 1.186 ± 0.29
2.721TyrIle: 2.721 ± 0.533
3.489TyrLys: 3.489 ± 0.482
3.768TyrLeu: 3.768 ± 0.582
1.396TyrMet: 1.396 ± 0.277
2.582TyrAsn: 2.582 ± 0.578
0.977TyrPro: 0.977 ± 0.282
1.814TyrGln: 1.814 ± 0.292
1.814TyrArg: 1.814 ± 0.439
3.0TyrSer: 3.0 ± 0.491
2.652TyrThr: 2.652 ± 0.481
2.791TyrVal: 2.791 ± 0.483
0.558TyrTrp: 0.558 ± 0.155
1.465TyrTyr: 1.465 ± 0.412
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (14332 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski