Amino acid dipepetide frequency for Staphylococcus virus phi12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.597AlaAla: 2.597 ± 0.859
0.251AlaCys: 0.251 ± 0.141
2.513AlaAsp: 2.513 ± 0.623
4.44AlaGlu: 4.44 ± 0.592
1.843AlaPhe: 1.843 ± 0.408
3.854AlaGly: 3.854 ± 0.862
1.005AlaHis: 1.005 ± 0.287
4.44AlaIle: 4.44 ± 0.886
6.786AlaLys: 6.786 ± 1.274
5.361AlaLeu: 5.361 ± 0.813
1.089AlaMet: 1.089 ± 0.272
4.608AlaAsn: 4.608 ± 0.97
1.675AlaPro: 1.675 ± 0.401
1.34AlaGln: 1.34 ± 0.387
2.681AlaArg: 2.681 ± 0.379
4.859AlaSer: 4.859 ± 0.793
3.435AlaThr: 3.435 ± 0.587
3.183AlaVal: 3.183 ± 0.471
1.173AlaTrp: 1.173 ± 0.43
2.513AlaTyr: 2.513 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
0.084CysAla: 0.084 ± 0.086
0.0CysCys: 0.0 ± 0.0
0.084CysAsp: 0.084 ± 0.071
0.419CysGlu: 0.419 ± 0.252
0.251CysPhe: 0.251 ± 0.201
0.168CysGly: 0.168 ± 0.108
0.168CysHis: 0.168 ± 0.149
0.419CysIle: 0.419 ± 0.179
0.586CysLys: 0.586 ± 0.274
0.586CysLeu: 0.586 ± 0.212
0.251CysMet: 0.251 ± 0.21
0.168CysAsn: 0.168 ± 0.14
0.084CysPro: 0.084 ± 0.093
0.168CysGln: 0.168 ± 0.126
0.419CysArg: 0.419 ± 0.25
0.335CysSer: 0.335 ± 0.182
0.251CysThr: 0.251 ± 0.153
0.168CysVal: 0.168 ± 0.11
0.0CysTrp: 0.0 ± 0.0
0.335CysTyr: 0.335 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
2.681AspAla: 2.681 ± 0.71
0.419AspCys: 0.419 ± 0.222
4.105AspAsp: 4.105 ± 0.774
4.943AspGlu: 4.943 ± 0.903
3.267AspPhe: 3.267 ± 0.591
4.021AspGly: 4.021 ± 0.759
0.754AspHis: 0.754 ± 0.244
6.032AspIle: 6.032 ± 0.826
6.618AspLys: 6.618 ± 0.751
6.367AspLeu: 6.367 ± 0.679
1.843AspMet: 1.843 ± 0.419
2.848AspAsn: 2.848 ± 0.511
0.838AspPro: 0.838 ± 0.214
0.922AspGln: 0.922 ± 0.261
2.178AspArg: 2.178 ± 0.433
3.435AspSer: 3.435 ± 0.675
3.602AspThr: 3.602 ± 0.479
4.272AspVal: 4.272 ± 0.601
0.754AspTrp: 0.754 ± 0.283
2.681AspTyr: 2.681 ± 0.578
0.0AspXaa: 0.0 ± 0.0
Glu
4.272GluAla: 4.272 ± 0.6
0.251GluCys: 0.251 ± 0.16
4.524GluAsp: 4.524 ± 0.745
6.367GluGlu: 6.367 ± 1.22
2.765GluPhe: 2.765 ± 0.467
3.854GluGly: 3.854 ± 0.633
0.922GluHis: 0.922 ± 0.253
5.11GluIle: 5.11 ± 0.904
8.712GluLys: 8.712 ± 1.013
7.958GluLeu: 7.958 ± 0.886
2.597GluMet: 2.597 ± 0.404
4.943GluAsn: 4.943 ± 0.685
1.257GluPro: 1.257 ± 0.436
3.1GluGln: 3.1 ± 0.566
3.351GluArg: 3.351 ± 0.783
3.435GluSer: 3.435 ± 0.564
4.021GluThr: 4.021 ± 0.631
3.77GluVal: 3.77 ± 0.51
0.922GluTrp: 0.922 ± 0.239
2.681GluTyr: 2.681 ± 0.743
0.0GluXaa: 0.0 ± 0.0
Phe
1.675PheAla: 1.675 ± 0.385
0.168PheCys: 0.168 ± 0.135
3.1PheAsp: 3.1 ± 0.562
2.848PheGlu: 2.848 ± 0.579
0.838PhePhe: 0.838 ± 0.221
3.1PheGly: 3.1 ± 0.651
0.586PheHis: 0.586 ± 0.199
3.1PheIle: 3.1 ± 0.622
4.356PheLys: 4.356 ± 0.575
2.094PheLeu: 2.094 ± 0.368
0.922PheMet: 0.922 ± 0.262
3.686PheAsn: 3.686 ± 0.577
0.838PhePro: 0.838 ± 0.329
1.089PheGln: 1.089 ± 0.26
1.089PheArg: 1.089 ± 0.278
2.094PheSer: 2.094 ± 0.414
1.843PheThr: 1.843 ± 0.401
1.927PheVal: 1.927 ± 0.489
0.168PheTrp: 0.168 ± 0.108
2.011PheTyr: 2.011 ± 0.447
0.0PheXaa: 0.0 ± 0.0
Gly
4.44GlyAla: 4.44 ± 1.091
0.251GlyCys: 0.251 ± 0.156
4.608GlyAsp: 4.608 ± 0.424
4.021GlyGlu: 4.021 ± 0.445
2.513GlyPhe: 2.513 ± 0.352
5.361GlyGly: 5.361 ± 1.32
1.257GlyHis: 1.257 ± 0.361
3.77GlyIle: 3.77 ± 0.664
5.613GlyLys: 5.613 ± 0.671
5.948GlyLeu: 5.948 ± 1.132
1.424GlyMet: 1.424 ± 0.478
3.518GlyAsn: 3.518 ± 0.567
1.005GlyPro: 1.005 ± 0.262
1.759GlyGln: 1.759 ± 0.438
2.178GlyArg: 2.178 ± 0.505
3.686GlySer: 3.686 ± 0.759
3.77GlyThr: 3.77 ± 0.681
4.524GlyVal: 4.524 ± 0.795
0.838GlyTrp: 0.838 ± 0.286
2.765GlyTyr: 2.765 ± 0.6
0.0GlyXaa: 0.0 ± 0.0
His
1.005HisAla: 1.005 ± 0.25
0.084HisCys: 0.084 ± 0.093
0.838HisAsp: 0.838 ± 0.329
0.838HisGlu: 0.838 ± 0.209
0.586HisPhe: 0.586 ± 0.2
1.424HisGly: 1.424 ± 0.344
0.251HisHis: 0.251 ± 0.146
1.424HisIle: 1.424 ± 0.471
1.173HisLys: 1.173 ± 0.291
1.508HisLeu: 1.508 ± 0.356
0.419HisMet: 0.419 ± 0.227
0.922HisAsn: 0.922 ± 0.264
1.005HisPro: 1.005 ± 0.225
0.586HisGln: 0.586 ± 0.185
0.67HisArg: 0.67 ± 0.26
1.089HisSer: 1.089 ± 0.298
1.005HisThr: 1.005 ± 0.313
0.838HisVal: 0.838 ± 0.286
0.168HisTrp: 0.168 ± 0.123
1.005HisTyr: 1.005 ± 0.322
0.0HisXaa: 0.0 ± 0.0
Ile
4.105IleAla: 4.105 ± 0.634
0.419IleCys: 0.419 ± 0.26
5.361IleAsp: 5.361 ± 0.799
5.529IleGlu: 5.529 ± 0.67
2.681IlePhe: 2.681 ± 0.658
3.1IleGly: 3.1 ± 0.579
1.843IleHis: 1.843 ± 0.441
4.943IleIle: 4.943 ± 0.941
7.623IleLys: 7.623 ± 0.827
4.775IleLeu: 4.775 ± 0.966
1.592IleMet: 1.592 ± 0.405
5.278IleAsn: 5.278 ± 0.611
2.848IlePro: 2.848 ± 0.418
2.011IleGln: 2.011 ± 0.375
3.518IleArg: 3.518 ± 0.517
4.775IleSer: 4.775 ± 0.651
4.105IleThr: 4.105 ± 0.586
3.77IleVal: 3.77 ± 0.713
0.67IleTrp: 0.67 ± 0.274
2.513IleTyr: 2.513 ± 0.494
0.0IleXaa: 0.0 ± 0.0
Lys
8.796LysAla: 8.796 ± 1.297
0.335LysCys: 0.335 ± 0.169
5.278LysAsp: 5.278 ± 0.674
8.294LysGlu: 8.294 ± 0.942
2.094LysPhe: 2.094 ± 0.379
5.864LysGly: 5.864 ± 0.953
1.675LysHis: 1.675 ± 0.356
6.367LysIle: 6.367 ± 0.825
7.204LysLys: 7.204 ± 1.237
8.712LysLeu: 8.712 ± 0.944
2.429LysMet: 2.429 ± 0.44
6.115LysAsn: 6.115 ± 0.68
2.094LysPro: 2.094 ± 0.452
5.278LysGln: 5.278 ± 0.525
3.183LysArg: 3.183 ± 0.865
5.948LysSer: 5.948 ± 1.558
5.361LysThr: 5.361 ± 0.759
5.194LysVal: 5.194 ± 0.751
2.011LysTrp: 2.011 ± 0.519
5.361LysTyr: 5.361 ± 0.966
0.0LysXaa: 0.0 ± 0.0
Leu
4.524LeuAla: 4.524 ± 0.946
0.586LeuCys: 0.586 ± 0.231
5.529LeuAsp: 5.529 ± 1.1
6.367LeuGlu: 6.367 ± 0.779
2.765LeuPhe: 2.765 ± 0.378
4.775LeuGly: 4.775 ± 1.111
1.257LeuHis: 1.257 ± 0.433
6.032LeuIle: 6.032 ± 0.783
9.047LeuLys: 9.047 ± 1.292
6.786LeuLeu: 6.786 ± 0.814
1.927LeuMet: 1.927 ± 0.367
5.78LeuAsn: 5.78 ± 0.719
3.267LeuPro: 3.267 ± 0.704
3.183LeuGln: 3.183 ± 0.672
3.016LeuArg: 3.016 ± 0.525
5.11LeuSer: 5.11 ± 0.716
5.445LeuThr: 5.445 ± 0.802
4.105LeuVal: 4.105 ± 0.578
0.419LeuTrp: 0.419 ± 0.188
3.351LeuTyr: 3.351 ± 0.88
0.0LeuXaa: 0.0 ± 0.0
Met
1.173MetAla: 1.173 ± 0.292
0.168MetCys: 0.168 ± 0.126
1.089MetAsp: 1.089 ± 0.396
1.257MetGlu: 1.257 ± 0.387
0.67MetPhe: 0.67 ± 0.211
1.675MetGly: 1.675 ± 0.591
0.419MetHis: 0.419 ± 0.209
1.257MetIle: 1.257 ± 0.191
2.932MetLys: 2.932 ± 0.564
1.843MetLeu: 1.843 ± 0.445
0.419MetMet: 0.419 ± 0.193
1.927MetAsn: 1.927 ± 0.479
0.922MetPro: 0.922 ± 0.206
1.424MetGln: 1.424 ± 0.434
1.173MetArg: 1.173 ± 0.299
2.346MetSer: 2.346 ± 0.573
2.429MetThr: 2.429 ± 0.399
1.173MetVal: 1.173 ± 0.235
0.168MetTrp: 0.168 ± 0.103
0.922MetTyr: 0.922 ± 0.249
0.0MetXaa: 0.0 ± 0.0
Asn
3.77AsnAla: 3.77 ± 0.6
0.251AsnCys: 0.251 ± 0.176
4.189AsnAsp: 4.189 ± 0.642
4.356AsnGlu: 4.356 ± 0.72
1.675AsnPhe: 1.675 ± 0.311
4.859AsnGly: 4.859 ± 0.723
0.922AsnHis: 0.922 ± 0.33
4.608AsnIle: 4.608 ± 0.55
6.702AsnLys: 6.702 ± 0.852
5.194AsnLeu: 5.194 ± 0.533
1.089AsnMet: 1.089 ± 0.265
3.937AsnAsn: 3.937 ± 0.7
2.513AsnPro: 2.513 ± 0.371
2.765AsnGln: 2.765 ± 0.514
2.848AsnArg: 2.848 ± 0.486
4.608AsnSer: 4.608 ± 0.641
4.021AsnThr: 4.021 ± 0.48
3.686AsnVal: 3.686 ± 0.614
1.005AsnTrp: 1.005 ± 0.374
2.681AsnTyr: 2.681 ± 0.427
0.0AsnXaa: 0.0 ± 0.0
Pro
1.173ProAla: 1.173 ± 0.289
0.335ProCys: 0.335 ± 0.145
1.424ProAsp: 1.424 ± 0.402
2.262ProGlu: 2.262 ± 0.584
1.592ProPhe: 1.592 ± 0.4
1.592ProGly: 1.592 ± 0.406
0.168ProHis: 0.168 ± 0.132
1.843ProIle: 1.843 ± 0.388
2.178ProLys: 2.178 ± 0.444
2.346ProLeu: 2.346 ± 0.586
0.586ProMet: 0.586 ± 0.251
2.178ProAsn: 2.178 ± 0.467
0.838ProPro: 0.838 ± 0.286
1.34ProGln: 1.34 ± 0.363
1.173ProArg: 1.173 ± 0.26
2.262ProSer: 2.262 ± 0.41
2.011ProThr: 2.011 ± 0.463
1.257ProVal: 1.257 ± 0.414
0.168ProTrp: 0.168 ± 0.116
1.089ProTyr: 1.089 ± 0.312
0.0ProXaa: 0.0 ± 0.0
Gln
2.848GlnAla: 2.848 ± 0.384
0.168GlnCys: 0.168 ± 0.126
2.094GlnAsp: 2.094 ± 0.393
2.932GlnGlu: 2.932 ± 0.457
1.592GlnPhe: 1.592 ± 0.306
2.011GlnGly: 2.011 ± 0.375
0.67GlnHis: 0.67 ± 0.187
2.765GlnIle: 2.765 ± 0.617
3.351GlnLys: 3.351 ± 0.41
3.351GlnLeu: 3.351 ± 0.538
1.005GlnMet: 1.005 ± 0.314
1.675GlnAsn: 1.675 ± 0.324
1.005GlnPro: 1.005 ± 0.264
0.922GlnGln: 0.922 ± 0.359
2.346GlnArg: 2.346 ± 0.437
2.262GlnSer: 2.262 ± 0.368
1.257GlnThr: 1.257 ± 0.452
2.346GlnVal: 2.346 ± 0.494
0.335GlnTrp: 0.335 ± 0.195
1.675GlnTyr: 1.675 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
2.429ArgAla: 2.429 ± 0.557
0.084ArgCys: 0.084 ± 0.085
2.932ArgAsp: 2.932 ± 0.377
2.681ArgGlu: 2.681 ± 0.566
1.843ArgPhe: 1.843 ± 0.336
2.346ArgGly: 2.346 ± 0.465
0.67ArgHis: 0.67 ± 0.242
3.435ArgIle: 3.435 ± 0.544
4.021ArgLys: 4.021 ± 0.532
3.1ArgLeu: 3.1 ± 0.598
1.089ArgMet: 1.089 ± 0.253
2.597ArgAsn: 2.597 ± 0.419
0.754ArgPro: 0.754 ± 0.332
1.257ArgGln: 1.257 ± 0.28
1.592ArgArg: 1.592 ± 0.433
1.927ArgSer: 1.927 ± 0.385
2.346ArgThr: 2.346 ± 0.476
2.011ArgVal: 2.011 ± 0.355
0.335ArgTrp: 0.335 ± 0.147
2.513ArgTyr: 2.513 ± 0.633
0.0ArgXaa: 0.0 ± 0.0
Ser
4.272SerAla: 4.272 ± 0.853
0.335SerCys: 0.335 ± 0.197
4.943SerAsp: 4.943 ± 0.572
4.44SerGlu: 4.44 ± 0.639
2.765SerPhe: 2.765 ± 0.55
4.44SerGly: 4.44 ± 0.806
0.922SerHis: 0.922 ± 0.238
4.105SerIle: 4.105 ± 0.438
6.032SerLys: 6.032 ± 1.395
4.44SerLeu: 4.44 ± 0.549
2.011SerMet: 2.011 ± 0.423
4.859SerAsn: 4.859 ± 0.588
1.759SerPro: 1.759 ± 0.417
3.016SerGln: 3.016 ± 0.413
2.011SerArg: 2.011 ± 0.418
4.691SerSer: 4.691 ± 0.852
3.016SerThr: 3.016 ± 0.505
4.272SerVal: 4.272 ± 0.717
0.922SerTrp: 0.922 ± 0.273
1.675SerTyr: 1.675 ± 0.36
0.0SerXaa: 0.0 ± 0.0
Thr
3.518ThrAla: 3.518 ± 0.647
0.168ThrCys: 0.168 ± 0.127
3.267ThrAsp: 3.267 ± 0.618
4.356ThrGlu: 4.356 ± 0.544
2.681ThrPhe: 2.681 ± 0.431
4.775ThrGly: 4.775 ± 0.662
1.508ThrHis: 1.508 ± 0.412
4.105ThrIle: 4.105 ± 0.63
5.278ThrLys: 5.278 ± 0.853
3.937ThrLeu: 3.937 ± 0.529
1.005ThrMet: 1.005 ± 0.26
3.77ThrAsn: 3.77 ± 0.653
2.513ThrPro: 2.513 ± 0.415
1.592ThrGln: 1.592 ± 0.298
1.843ThrArg: 1.843 ± 0.342
3.267ThrSer: 3.267 ± 0.499
2.597ThrThr: 2.597 ± 0.508
3.77ThrVal: 3.77 ± 0.501
0.419ThrTrp: 0.419 ± 0.194
2.262ThrTyr: 2.262 ± 0.499
0.0ThrXaa: 0.0 ± 0.0
Val
3.183ValAla: 3.183 ± 0.577
0.168ValCys: 0.168 ± 0.132
3.602ValAsp: 3.602 ± 0.559
5.278ValGlu: 5.278 ± 0.604
2.513ValPhe: 2.513 ± 0.546
3.351ValGly: 3.351 ± 0.544
0.922ValHis: 0.922 ± 0.22
4.189ValIle: 4.189 ± 0.641
4.775ValLys: 4.775 ± 0.616
4.44ValLeu: 4.44 ± 0.504
1.424ValMet: 1.424 ± 0.382
3.518ValAsn: 3.518 ± 0.473
1.508ValPro: 1.508 ± 0.406
2.262ValGln: 2.262 ± 0.357
2.262ValArg: 2.262 ± 0.438
4.775ValSer: 4.775 ± 0.692
3.016ValThr: 3.016 ± 0.672
3.016ValVal: 3.016 ± 0.552
0.419ValTrp: 0.419 ± 0.219
2.262ValTyr: 2.262 ± 0.536
0.0ValXaa: 0.0 ± 0.0
Trp
0.67TrpAla: 0.67 ± 0.202
0.0TrpCys: 0.0 ± 0.0
0.503TrpAsp: 0.503 ± 0.233
0.922TrpGlu: 0.922 ± 0.357
1.34TrpPhe: 1.34 ± 0.374
0.251TrpGly: 0.251 ± 0.128
0.0TrpHis: 0.0 ± 0.0
0.754TrpIle: 0.754 ± 0.294
0.754TrpLys: 0.754 ± 0.27
1.089TrpLeu: 1.089 ± 0.259
0.419TrpMet: 0.419 ± 0.203
0.922TrpAsn: 0.922 ± 0.272
0.084TrpPro: 0.084 ± 0.071
0.503TrpGln: 0.503 ± 0.182
0.419TrpArg: 0.419 ± 0.197
0.922TrpSer: 0.922 ± 0.396
0.503TrpThr: 0.503 ± 0.178
0.754TrpVal: 0.754 ± 0.201
0.168TrpTrp: 0.168 ± 0.136
0.67TrpTyr: 0.67 ± 0.258
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.429TyrAla: 2.429 ± 0.331
0.503TyrCys: 0.503 ± 0.19
2.597TyrAsp: 2.597 ± 0.659
2.262TyrGlu: 2.262 ± 0.508
1.257TyrPhe: 1.257 ± 0.387
2.429TyrGly: 2.429 ± 0.551
0.922TyrHis: 0.922 ± 0.353
2.765TyrIle: 2.765 ± 0.632
3.854TyrLys: 3.854 ± 0.622
3.518TyrLeu: 3.518 ± 0.669
1.759TyrMet: 1.759 ± 0.317
2.429TyrAsn: 2.429 ± 0.56
1.005TyrPro: 1.005 ± 0.309
2.094TyrGln: 2.094 ± 0.413
2.011TyrArg: 2.011 ± 0.538
3.1TyrSer: 3.1 ± 0.46
2.597TyrThr: 2.597 ± 0.526
2.848TyrVal: 2.848 ± 0.608
0.586TyrTrp: 0.586 ± 0.197
1.592TyrTyr: 1.592 ± 0.622
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (11938 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski