Amino acid dipepetide frequency for Staphylococcus virus 77

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.875AlaAla: 1.875 ± 0.696
0.234AlaCys: 0.234 ± 0.165
2.343AlaAsp: 2.343 ± 0.354
4.218AlaGlu: 4.218 ± 0.672
2.734AlaPhe: 2.734 ± 0.579
3.984AlaGly: 3.984 ± 0.689
0.703AlaHis: 0.703 ± 0.237
4.999AlaIle: 4.999 ± 0.562
4.843AlaLys: 4.843 ± 0.553
4.609AlaLeu: 4.609 ± 0.476
1.64AlaMet: 1.64 ± 0.388
2.812AlaAsn: 2.812 ± 0.503
1.64AlaPro: 1.64 ± 0.281
2.734AlaGln: 2.734 ± 0.588
2.734AlaArg: 2.734 ± 0.384
3.515AlaSer: 3.515 ± 0.649
3.125AlaThr: 3.125 ± 0.716
3.281AlaVal: 3.281 ± 0.513
0.781AlaTrp: 0.781 ± 0.219
2.656AlaTyr: 2.656 ± 0.396
0.0AlaXaa: 0.0 ± 0.0
Cys
0.234CysAla: 0.234 ± 0.129
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.391CysGlu: 0.391 ± 0.202
0.391CysPhe: 0.391 ± 0.209
0.391CysGly: 0.391 ± 0.19
0.156CysHis: 0.156 ± 0.101
0.859CysIle: 0.859 ± 0.242
0.547CysLys: 0.547 ± 0.211
0.234CysLeu: 0.234 ± 0.137
0.0CysMet: 0.0 ± 0.0
0.312CysAsn: 0.312 ± 0.128
0.156CysPro: 0.156 ± 0.11
0.156CysGln: 0.156 ± 0.098
0.234CysArg: 0.234 ± 0.19
0.234CysSer: 0.234 ± 0.123
0.234CysThr: 0.234 ± 0.119
0.234CysVal: 0.234 ± 0.143
0.234CysTrp: 0.234 ± 0.153
0.234CysTyr: 0.234 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
3.125AspAla: 3.125 ± 0.39
0.469AspCys: 0.469 ± 0.188
4.452AspAsp: 4.452 ± 0.862
5.155AspGlu: 5.155 ± 0.915
3.203AspPhe: 3.203 ± 0.51
4.843AspGly: 4.843 ± 0.7
0.547AspHis: 0.547 ± 0.217
5.624AspIle: 5.624 ± 0.655
5.546AspLys: 5.546 ± 0.759
5.624AspLeu: 5.624 ± 0.701
2.187AspMet: 2.187 ± 0.399
4.14AspAsn: 4.14 ± 0.605
1.562AspPro: 1.562 ± 0.325
0.781AspGln: 0.781 ± 0.218
2.187AspArg: 2.187 ± 0.524
3.437AspSer: 3.437 ± 0.514
2.89AspThr: 2.89 ± 0.531
3.671AspVal: 3.671 ± 0.582
0.703AspTrp: 0.703 ± 0.226
2.812AspTyr: 2.812 ± 0.493
0.0AspXaa: 0.0 ± 0.0
Glu
4.296GluAla: 4.296 ± 0.704
0.625GluCys: 0.625 ± 0.207
4.14GluAsp: 4.14 ± 0.622
6.327GluGlu: 6.327 ± 0.989
3.281GluPhe: 3.281 ± 0.427
2.812GluGly: 2.812 ± 0.627
1.094GluHis: 1.094 ± 0.256
6.874GluIle: 6.874 ± 0.788
7.968GluLys: 7.968 ± 0.973
7.108GluLeu: 7.108 ± 0.801
2.421GluMet: 2.421 ± 0.451
4.843GluAsn: 4.843 ± 0.548
1.562GluPro: 1.562 ± 0.291
3.281GluGln: 3.281 ± 0.503
4.296GluArg: 4.296 ± 0.644
3.671GluSer: 3.671 ± 0.568
3.984GluThr: 3.984 ± 0.862
4.765GluVal: 4.765 ± 0.598
0.859GluTrp: 0.859 ± 0.179
3.984GluTyr: 3.984 ± 0.648
0.0GluXaa: 0.0 ± 0.0
Phe
3.203PheAla: 3.203 ± 0.589
0.234PheCys: 0.234 ± 0.131
2.656PheAsp: 2.656 ± 0.387
3.203PheGlu: 3.203 ± 0.432
1.172PhePhe: 1.172 ± 0.291
2.5PheGly: 2.5 ± 0.32
0.703PheHis: 0.703 ± 0.208
3.281PheIle: 3.281 ± 0.499
4.062PheLys: 4.062 ± 0.633
2.421PheLeu: 2.421 ± 0.374
1.484PheMet: 1.484 ± 0.392
3.515PheAsn: 3.515 ± 0.441
0.937PhePro: 0.937 ± 0.297
0.703PheGln: 0.703 ± 0.205
1.718PheArg: 1.718 ± 0.308
3.828PheSer: 3.828 ± 0.798
2.656PheThr: 2.656 ± 0.487
2.812PheVal: 2.812 ± 0.501
0.078PheTrp: 0.078 ± 0.083
1.562PheTyr: 1.562 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
2.812GlyAla: 2.812 ± 0.678
0.312GlyCys: 0.312 ± 0.167
3.203GlyAsp: 3.203 ± 0.567
3.984GlyGlu: 3.984 ± 0.797
2.734GlyPhe: 2.734 ± 0.48
3.828GlyGly: 3.828 ± 0.995
1.562GlyHis: 1.562 ± 0.385
4.687GlyIle: 4.687 ± 0.812
6.483GlyLys: 6.483 ± 0.66
4.452GlyLeu: 4.452 ± 0.951
1.328GlyMet: 1.328 ± 0.308
3.203GlyAsn: 3.203 ± 0.591
1.328GlyPro: 1.328 ± 0.347
1.875GlyGln: 1.875 ± 0.505
2.265GlyArg: 2.265 ± 0.475
3.125GlySer: 3.125 ± 0.519
2.578GlyThr: 2.578 ± 0.471
3.671GlyVal: 3.671 ± 0.64
0.781GlyTrp: 0.781 ± 0.289
3.359GlyTyr: 3.359 ± 0.592
0.0GlyXaa: 0.0 ± 0.0
His
1.25HisAla: 1.25 ± 0.374
0.0HisCys: 0.0 ± 0.0
0.781HisAsp: 0.781 ± 0.263
0.937HisGlu: 0.937 ± 0.231
1.406HisPhe: 1.406 ± 0.32
0.859HisGly: 0.859 ± 0.294
0.312HisHis: 0.312 ± 0.189
1.797HisIle: 1.797 ± 0.382
1.406HisLys: 1.406 ± 0.381
1.328HisLeu: 1.328 ± 0.265
0.156HisMet: 0.156 ± 0.098
0.937HisAsn: 0.937 ± 0.302
0.547HisPro: 0.547 ± 0.163
0.703HisGln: 0.703 ± 0.198
0.312HisArg: 0.312 ± 0.169
0.703HisSer: 0.703 ± 0.191
0.859HisThr: 0.859 ± 0.321
1.172HisVal: 1.172 ± 0.34
0.078HisTrp: 0.078 ± 0.078
1.094HisTyr: 1.094 ± 0.234
0.0HisXaa: 0.0 ± 0.0
Ile
5.312IleAla: 5.312 ± 0.865
0.312IleCys: 0.312 ± 0.134
6.015IleAsp: 6.015 ± 0.751
7.499IleGlu: 7.499 ± 0.714
3.203IlePhe: 3.203 ± 0.504
3.515IleGly: 3.515 ± 0.614
1.562IleHis: 1.562 ± 0.439
5.468IleIle: 5.468 ± 0.73
10.233IleLys: 10.233 ± 0.848
4.687IleLeu: 4.687 ± 0.582
1.718IleMet: 1.718 ± 0.339
6.483IleAsn: 6.483 ± 0.888
2.421IlePro: 2.421 ± 0.509
2.968IleGln: 2.968 ± 0.419
3.281IleArg: 3.281 ± 0.539
4.921IleSer: 4.921 ± 0.593
4.374IleThr: 4.374 ± 0.648
5.155IleVal: 5.155 ± 0.6
0.859IleTrp: 0.859 ± 0.426
2.343IleTyr: 2.343 ± 0.521
0.0IleXaa: 0.0 ± 0.0
Lys
6.093LysAla: 6.093 ± 0.618
0.547LysCys: 0.547 ± 0.197
6.093LysAsp: 6.093 ± 0.572
9.061LysGlu: 9.061 ± 0.969
4.531LysPhe: 4.531 ± 0.584
5.468LysGly: 5.468 ± 0.857
1.797LysHis: 1.797 ± 0.443
7.421LysIle: 7.421 ± 0.699
8.749LysLys: 8.749 ± 0.904
8.514LysLeu: 8.514 ± 0.807
2.421LysMet: 2.421 ± 0.468
5.78LysAsn: 5.78 ± 0.897
2.656LysPro: 2.656 ± 0.492
4.452LysGln: 4.452 ± 0.533
4.374LysArg: 4.374 ± 0.519
4.14LysSer: 4.14 ± 0.631
5.39LysThr: 5.39 ± 0.806
6.171LysVal: 6.171 ± 0.737
1.406LysTrp: 1.406 ± 0.259
4.687LysTyr: 4.687 ± 0.705
0.0LysXaa: 0.0 ± 0.0
Leu
3.359LeuAla: 3.359 ± 0.483
0.547LeuCys: 0.547 ± 0.2
4.765LeuAsp: 4.765 ± 0.535
6.249LeuGlu: 6.249 ± 0.765
3.203LeuPhe: 3.203 ± 0.447
3.828LeuGly: 3.828 ± 0.601
1.25LeuHis: 1.25 ± 0.323
5.937LeuIle: 5.937 ± 0.809
8.749LeuLys: 8.749 ± 0.911
6.718LeuLeu: 6.718 ± 0.872
1.562LeuMet: 1.562 ± 0.404
6.249LeuAsn: 6.249 ± 0.636
2.343LeuPro: 2.343 ± 0.437
2.656LeuGln: 2.656 ± 0.321
3.359LeuArg: 3.359 ± 0.505
5.937LeuSer: 5.937 ± 0.728
3.984LeuThr: 3.984 ± 0.603
3.984LeuVal: 3.984 ± 0.495
0.625LeuTrp: 0.625 ± 0.225
3.671LeuTyr: 3.671 ± 0.628
0.0LeuXaa: 0.0 ± 0.0
Met
1.015MetAla: 1.015 ± 0.339
0.156MetCys: 0.156 ± 0.125
1.562MetAsp: 1.562 ± 0.333
1.172MetGlu: 1.172 ± 0.346
0.937MetPhe: 0.937 ± 0.185
1.406MetGly: 1.406 ± 0.5
0.391MetHis: 0.391 ± 0.229
2.031MetIle: 2.031 ± 0.402
2.343MetLys: 2.343 ± 0.501
2.265MetLeu: 2.265 ± 0.339
0.703MetMet: 0.703 ± 0.231
2.265MetAsn: 2.265 ± 0.435
0.937MetPro: 0.937 ± 0.274
1.25MetGln: 1.25 ± 0.336
1.25MetArg: 1.25 ± 0.275
1.64MetSer: 1.64 ± 0.328
1.953MetThr: 1.953 ± 0.422
1.094MetVal: 1.094 ± 0.245
0.391MetTrp: 0.391 ± 0.159
0.937MetTyr: 0.937 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
3.671AsnAla: 3.671 ± 0.59
0.312AsnCys: 0.312 ± 0.161
4.374AsnAsp: 4.374 ± 0.604
3.828AsnGlu: 3.828 ± 0.552
1.718AsnPhe: 1.718 ± 0.537
5.155AsnGly: 5.155 ± 0.564
0.937AsnHis: 0.937 ± 0.243
4.452AsnIle: 4.452 ± 0.486
8.28AsnLys: 8.28 ± 1.085
4.609AsnLeu: 4.609 ± 0.518
2.031AsnMet: 2.031 ± 0.299
5.155AsnAsn: 5.155 ± 0.665
2.968AsnPro: 2.968 ± 0.333
2.5AsnGln: 2.5 ± 0.459
2.968AsnArg: 2.968 ± 0.553
3.515AsnSer: 3.515 ± 0.504
3.671AsnThr: 3.671 ± 0.502
3.828AsnVal: 3.828 ± 0.664
0.859AsnTrp: 0.859 ± 0.374
2.265AsnTyr: 2.265 ± 0.362
0.0AsnXaa: 0.0 ± 0.0
Pro
1.015ProAla: 1.015 ± 0.274
0.0ProCys: 0.0 ± 0.0
1.484ProAsp: 1.484 ± 0.255
2.5ProGlu: 2.5 ± 0.426
1.562ProPhe: 1.562 ± 0.287
0.859ProGly: 0.859 ± 0.271
0.469ProHis: 0.469 ± 0.159
2.5ProIle: 2.5 ± 0.428
3.359ProLys: 3.359 ± 0.71
1.562ProLeu: 1.562 ± 0.401
0.781ProMet: 0.781 ± 0.184
1.25ProAsn: 1.25 ± 0.323
0.937ProPro: 0.937 ± 0.198
1.015ProGln: 1.015 ± 0.254
0.859ProArg: 0.859 ± 0.184
2.109ProSer: 2.109 ± 0.426
1.875ProThr: 1.875 ± 0.462
1.484ProVal: 1.484 ± 0.298
0.312ProTrp: 0.312 ± 0.163
1.094ProTyr: 1.094 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
3.046GlnAla: 3.046 ± 0.484
0.312GlnCys: 0.312 ± 0.142
1.875GlnAsp: 1.875 ± 0.453
2.656GlnGlu: 2.656 ± 0.638
1.406GlnPhe: 1.406 ± 0.348
2.187GlnGly: 2.187 ± 0.412
0.937GlnHis: 0.937 ± 0.222
2.578GlnIle: 2.578 ± 0.359
3.281GlnLys: 3.281 ± 0.547
2.578GlnLeu: 2.578 ± 0.421
0.781GlnMet: 0.781 ± 0.251
2.421GlnAsn: 2.421 ± 0.442
1.015GlnPro: 1.015 ± 0.238
1.562GlnGln: 1.562 ± 0.447
1.875GlnArg: 1.875 ± 0.313
1.718GlnSer: 1.718 ± 0.385
1.797GlnThr: 1.797 ± 0.326
2.109GlnVal: 2.109 ± 0.451
0.391GlnTrp: 0.391 ± 0.156
1.562GlnTyr: 1.562 ± 0.276
0.0GlnXaa: 0.0 ± 0.0
Arg
2.5ArgAla: 2.5 ± 0.377
0.234ArgCys: 0.234 ± 0.134
2.812ArgAsp: 2.812 ± 0.431
4.296ArgGlu: 4.296 ± 0.624
1.64ArgPhe: 1.64 ± 0.303
1.953ArgGly: 1.953 ± 0.389
0.703ArgHis: 0.703 ± 0.237
3.593ArgIle: 3.593 ± 0.634
3.749ArgLys: 3.749 ± 0.604
3.984ArgLeu: 3.984 ± 0.44
1.406ArgMet: 1.406 ± 0.298
3.203ArgAsn: 3.203 ± 0.382
0.625ArgPro: 0.625 ± 0.222
1.328ArgGln: 1.328 ± 0.432
1.875ArgArg: 1.875 ± 0.392
1.172ArgSer: 1.172 ± 0.237
1.718ArgThr: 1.718 ± 0.486
2.343ArgVal: 2.343 ± 0.369
0.469ArgTrp: 0.469 ± 0.167
2.343ArgTyr: 2.343 ± 0.607
0.0ArgXaa: 0.0 ± 0.0
Ser
3.437SerAla: 3.437 ± 0.535
0.156SerCys: 0.156 ± 0.116
4.531SerAsp: 4.531 ± 0.586
4.999SerGlu: 4.999 ± 0.623
2.421SerPhe: 2.421 ± 0.5
3.828SerGly: 3.828 ± 0.711
0.859SerHis: 0.859 ± 0.199
5.702SerIle: 5.702 ± 0.956
4.452SerLys: 4.452 ± 0.572
3.984SerLeu: 3.984 ± 0.561
1.406SerMet: 1.406 ± 0.283
4.296SerAsn: 4.296 ± 0.566
1.094SerPro: 1.094 ± 0.297
2.109SerGln: 2.109 ± 0.443
1.64SerArg: 1.64 ± 0.307
3.437SerSer: 3.437 ± 0.49
3.203SerThr: 3.203 ± 0.556
2.578SerVal: 2.578 ± 0.353
0.156SerTrp: 0.156 ± 0.099
2.031SerTyr: 2.031 ± 0.367
0.0SerXaa: 0.0 ± 0.0
Thr
3.203ThrAla: 3.203 ± 0.694
0.234ThrCys: 0.234 ± 0.116
4.218ThrAsp: 4.218 ± 0.835
3.203ThrGlu: 3.203 ± 0.415
2.343ThrPhe: 2.343 ± 0.435
3.593ThrGly: 3.593 ± 0.681
1.094ThrHis: 1.094 ± 0.298
4.609ThrIle: 4.609 ± 0.647
4.14ThrLys: 4.14 ± 0.517
4.609ThrLeu: 4.609 ± 0.53
0.937ThrMet: 0.937 ± 0.209
2.968ThrAsn: 2.968 ± 0.533
1.953ThrPro: 1.953 ± 0.299
1.406ThrGln: 1.406 ± 0.333
2.109ThrArg: 2.109 ± 0.463
3.593ThrSer: 3.593 ± 0.602
2.812ThrThr: 2.812 ± 0.547
3.125ThrVal: 3.125 ± 0.522
0.781ThrTrp: 0.781 ± 0.212
2.734ThrTyr: 2.734 ± 0.53
0.0ThrXaa: 0.0 ± 0.0
Val
3.359ValAla: 3.359 ± 0.585
0.156ValCys: 0.156 ± 0.122
3.671ValAsp: 3.671 ± 0.504
4.452ValGlu: 4.452 ± 0.726
2.031ValPhe: 2.031 ± 0.389
3.437ValGly: 3.437 ± 0.566
0.859ValHis: 0.859 ± 0.248
4.765ValIle: 4.765 ± 0.582
6.483ValLys: 6.483 ± 0.498
4.999ValLeu: 4.999 ± 0.474
1.484ValMet: 1.484 ± 0.329
3.515ValAsn: 3.515 ± 0.414
1.328ValPro: 1.328 ± 0.325
1.953ValGln: 1.953 ± 0.44
2.187ValArg: 2.187 ± 0.37
2.812ValSer: 2.812 ± 0.496
3.828ValThr: 3.828 ± 0.516
3.593ValVal: 3.593 ± 0.55
0.547ValTrp: 0.547 ± 0.255
2.578ValTyr: 2.578 ± 0.404
0.0ValXaa: 0.0 ± 0.0
Trp
0.469TrpAla: 0.469 ± 0.169
0.0TrpCys: 0.0 ± 0.0
0.859TrpAsp: 0.859 ± 0.235
0.937TrpGlu: 0.937 ± 0.216
0.781TrpPhe: 0.781 ± 0.197
0.625TrpGly: 0.625 ± 0.218
0.0TrpHis: 0.0 ± 0.0
1.172TrpIle: 1.172 ± 0.262
0.937TrpLys: 0.937 ± 0.275
0.937TrpLeu: 0.937 ± 0.229
0.391TrpMet: 0.391 ± 0.144
0.781TrpAsn: 0.781 ± 0.269
0.078TrpPro: 0.078 ± 0.065
0.469TrpGln: 0.469 ± 0.171
0.391TrpArg: 0.391 ± 0.173
0.547TrpSer: 0.547 ± 0.264
0.391TrpThr: 0.391 ± 0.129
0.625TrpVal: 0.625 ± 0.156
0.078TrpTrp: 0.078 ± 0.066
0.469TrpTyr: 0.469 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.187TyrAla: 2.187 ± 0.315
0.391TyrCys: 0.391 ± 0.185
3.125TyrAsp: 3.125 ± 0.634
3.125TyrGlu: 3.125 ± 0.514
2.031TyrPhe: 2.031 ± 0.482
2.5TyrGly: 2.5 ± 0.562
0.703TyrHis: 0.703 ± 0.225
3.984TyrIle: 3.984 ± 0.675
4.14TyrLys: 4.14 ± 0.553
3.828TyrLeu: 3.828 ± 0.615
0.781TyrMet: 0.781 ± 0.26
2.968TyrAsn: 2.968 ± 0.355
0.937TyrPro: 0.937 ± 0.234
2.187TyrGln: 2.187 ± 0.359
2.031TyrArg: 2.031 ± 0.403
2.265TyrSer: 2.265 ± 0.448
2.343TyrThr: 2.343 ± 0.401
2.343TyrVal: 2.343 ± 0.415
0.547TyrTrp: 0.547 ± 0.187
0.937TyrTyr: 0.937 ± 0.288
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (12803 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski