Amino acid dipepetide frequency for Streptococcus phage Javan112

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.152AlaAla: 4.152 ± 0.676
0.662AlaCys: 0.662 ± 0.173
4.212AlaAsp: 4.212 ± 0.548
4.152AlaGlu: 4.152 ± 0.48
2.768AlaPhe: 2.768 ± 0.421
5.295AlaGly: 5.295 ± 0.536
0.903AlaHis: 0.903 ± 0.168
5.596AlaIle: 5.596 ± 0.834
5.115AlaLys: 5.115 ± 0.439
4.934AlaLeu: 4.934 ± 0.84
1.444AlaMet: 1.444 ± 0.331
3.189AlaAsn: 3.189 ± 0.528
1.384AlaPro: 1.384 ± 0.3
2.287AlaGln: 2.287 ± 0.376
2.948AlaArg: 2.948 ± 0.44
3.49AlaSer: 3.49 ± 0.379
2.467AlaThr: 2.467 ± 0.421
3.67AlaVal: 3.67 ± 0.481
0.602AlaTrp: 0.602 ± 0.146
2.708AlaTyr: 2.708 ± 0.466
0.0AlaXaa: 0.0 ± 0.0
Cys
0.301CysAla: 0.301 ± 0.131
0.361CysCys: 0.361 ± 0.12
0.722CysAsp: 0.722 ± 0.183
0.963CysGlu: 0.963 ± 0.226
0.602CysPhe: 0.602 ± 0.204
1.023CysGly: 1.023 ± 0.254
0.06CysHis: 0.06 ± 0.055
0.602CysIle: 0.602 ± 0.188
0.842CysLys: 0.842 ± 0.225
1.023CysLeu: 1.023 ± 0.213
0.12CysMet: 0.12 ± 0.1
0.602CysAsn: 0.602 ± 0.152
0.481CysPro: 0.481 ± 0.13
0.421CysGln: 0.421 ± 0.187
0.602CysArg: 0.602 ± 0.186
0.662CysSer: 0.662 ± 0.146
0.782CysThr: 0.782 ± 0.221
0.722CysVal: 0.722 ± 0.218
0.06CysTrp: 0.06 ± 0.058
0.662CysTyr: 0.662 ± 0.185
0.0CysXaa: 0.0 ± 0.0
Asp
4.393AspAla: 4.393 ± 0.473
0.842AspCys: 0.842 ± 0.266
2.828AspAsp: 2.828 ± 0.467
5.536AspGlu: 5.536 ± 0.54
3.009AspPhe: 3.009 ± 0.387
4.754AspGly: 4.754 ± 0.625
0.842AspHis: 0.842 ± 0.247
5.777AspIle: 5.777 ± 0.476
5.897AspLys: 5.897 ± 0.463
4.693AspLeu: 4.693 ± 0.636
1.504AspMet: 1.504 ± 0.308
2.768AspAsn: 2.768 ± 0.46
1.926AspPro: 1.926 ± 0.271
0.602AspGln: 0.602 ± 0.17
3.309AspArg: 3.309 ± 0.388
3.43AspSer: 3.43 ± 0.431
3.069AspThr: 3.069 ± 0.489
3.189AspVal: 3.189 ± 0.456
0.903AspTrp: 0.903 ± 0.209
2.467AspTyr: 2.467 ± 0.379
0.0AspXaa: 0.0 ± 0.0
Glu
3.61GluAla: 3.61 ± 0.39
0.903GluCys: 0.903 ± 0.18
4.152GluAsp: 4.152 ± 0.483
7.582GluGlu: 7.582 ± 0.943
3.009GluPhe: 3.009 ± 0.418
5.054GluGly: 5.054 ± 0.491
0.963GluHis: 0.963 ± 0.212
7.522GluIle: 7.522 ± 0.711
8.304GluLys: 8.304 ± 0.602
8.183GluLeu: 8.183 ± 0.558
3.069GluMet: 3.069 ± 0.372
5.295GluAsn: 5.295 ± 0.497
1.564GluPro: 1.564 ± 0.342
2.287GluGln: 2.287 ± 0.34
3.189GluArg: 3.189 ± 0.568
3.731GluSer: 3.731 ± 0.486
3.67GluThr: 3.67 ± 0.397
5.476GluVal: 5.476 ± 0.657
0.963GluTrp: 0.963 ± 0.206
3.189GluTyr: 3.189 ± 0.461
0.0GluXaa: 0.0 ± 0.0
Phe
2.527PheAla: 2.527 ± 0.443
0.421PheCys: 0.421 ± 0.182
3.249PheAsp: 3.249 ± 0.471
3.37PheGlu: 3.37 ± 0.376
1.203PhePhe: 1.203 ± 0.243
2.648PheGly: 2.648 ± 0.401
0.782PheHis: 0.782 ± 0.277
3.069PheIle: 3.069 ± 0.485
3.61PheLys: 3.61 ± 0.465
3.189PheLeu: 3.189 ± 0.449
1.023PheMet: 1.023 ± 0.261
2.166PheAsn: 2.166 ± 0.394
1.143PhePro: 1.143 ± 0.282
0.782PheGln: 0.782 ± 0.191
1.926PheArg: 1.926 ± 0.359
2.407PheSer: 2.407 ± 0.234
1.625PheThr: 1.625 ± 0.328
2.467PheVal: 2.467 ± 0.413
0.481PheTrp: 0.481 ± 0.153
1.805PheTyr: 1.805 ± 0.34
0.0PheXaa: 0.0 ± 0.0
Gly
3.911GlyAla: 3.911 ± 0.574
0.662GlyCys: 0.662 ± 0.187
3.49GlyAsp: 3.49 ± 0.462
4.272GlyGlu: 4.272 ± 0.38
2.407GlyPhe: 2.407 ± 0.357
4.272GlyGly: 4.272 ± 0.616
1.203GlyHis: 1.203 ± 0.233
5.115GlyIle: 5.115 ± 0.445
6.92GlyLys: 6.92 ± 0.568
5.476GlyLeu: 5.476 ± 0.603
1.926GlyMet: 1.926 ± 0.276
3.67GlyAsn: 3.67 ± 0.62
0.782GlyPro: 0.782 ± 0.206
1.264GlyGln: 1.264 ± 0.234
2.708GlyArg: 2.708 ± 0.398
3.069GlySer: 3.069 ± 0.513
4.332GlyThr: 4.332 ± 0.402
3.61GlyVal: 3.61 ± 0.5
0.662GlyTrp: 0.662 ± 0.158
2.768GlyTyr: 2.768 ± 0.386
0.0GlyXaa: 0.0 ± 0.0
His
0.782HisAla: 0.782 ± 0.189
0.0HisCys: 0.0 ± 0.0
0.903HisAsp: 0.903 ± 0.228
0.842HisGlu: 0.842 ± 0.237
0.602HisPhe: 0.602 ± 0.166
1.324HisGly: 1.324 ± 0.29
0.481HisHis: 0.481 ± 0.165
1.324HisIle: 1.324 ± 0.318
1.143HisLys: 1.143 ± 0.224
1.745HisLeu: 1.745 ± 0.258
0.301HisMet: 0.301 ± 0.137
0.662HisAsn: 0.662 ± 0.217
0.903HisPro: 0.903 ± 0.192
0.481HisGln: 0.481 ± 0.173
0.662HisArg: 0.662 ± 0.192
1.083HisSer: 1.083 ± 0.271
0.662HisThr: 0.662 ± 0.208
0.903HisVal: 0.903 ± 0.202
0.181HisTrp: 0.181 ± 0.09
0.842HisTyr: 0.842 ± 0.218
0.0HisXaa: 0.0 ± 0.0
Ile
5.777IleAla: 5.777 ± 0.615
1.264IleCys: 1.264 ± 0.215
5.355IleAsp: 5.355 ± 0.514
7.1IleGlu: 7.1 ± 0.729
3.851IlePhe: 3.851 ± 0.396
3.67IleGly: 3.67 ± 0.492
1.264IleHis: 1.264 ± 0.25
5.355IleIle: 5.355 ± 0.635
7.04IleLys: 7.04 ± 0.829
6.378IleLeu: 6.378 ± 0.603
1.865IleMet: 1.865 ± 0.466
4.754IleAsn: 4.754 ± 0.544
2.648IlePro: 2.648 ± 0.365
2.166IleGln: 2.166 ± 0.388
3.37IleArg: 3.37 ± 0.499
5.837IleSer: 5.837 ± 0.47
4.633IleThr: 4.633 ± 0.573
3.791IleVal: 3.791 ± 0.467
1.444IleTrp: 1.444 ± 0.308
2.708IleTyr: 2.708 ± 0.442
0.0IleXaa: 0.0 ± 0.0
Lys
5.656LysAla: 5.656 ± 0.571
1.203LysCys: 1.203 ± 0.243
6.378LysAsp: 6.378 ± 0.57
8.123LysGlu: 8.123 ± 0.634
3.49LysPhe: 3.49 ± 0.374
4.934LysGly: 4.934 ± 0.495
0.963LysHis: 0.963 ± 0.244
6.98LysIle: 6.98 ± 0.629
7.522LysLys: 7.522 ± 0.627
7.1LysLeu: 7.1 ± 0.71
1.865LysMet: 1.865 ± 0.391
3.911LysAsn: 3.911 ± 0.527
3.129LysPro: 3.129 ± 0.52
3.61LysGln: 3.61 ± 0.455
3.67LysArg: 3.67 ± 0.471
5.957LysSer: 5.957 ± 0.6
4.272LysThr: 4.272 ± 0.542
5.415LysVal: 5.415 ± 0.458
1.264LysTrp: 1.264 ± 0.283
3.911LysTyr: 3.911 ± 0.508
0.0LysXaa: 0.0 ± 0.0
Leu
4.453LeuAla: 4.453 ± 0.517
0.842LeuCys: 0.842 ± 0.24
5.536LeuAsp: 5.536 ± 0.617
7.762LeuGlu: 7.762 ± 0.87
3.069LeuPhe: 3.069 ± 0.47
4.994LeuGly: 4.994 ± 0.498
0.903LeuHis: 0.903 ± 0.281
6.86LeuIle: 6.86 ± 0.792
6.559LeuLys: 6.559 ± 0.749
6.92LeuLeu: 6.92 ± 0.459
2.407LeuMet: 2.407 ± 0.474
4.092LeuAsn: 4.092 ± 0.439
2.948LeuPro: 2.948 ± 0.361
2.708LeuGln: 2.708 ± 0.442
3.069LeuArg: 3.069 ± 0.4
7.522LeuSer: 7.522 ± 0.604
3.971LeuThr: 3.971 ± 0.419
4.814LeuVal: 4.814 ± 0.443
0.542LeuTrp: 0.542 ± 0.141
3.49LeuTyr: 3.49 ± 0.468
0.0LeuXaa: 0.0 ± 0.0
Met
1.564MetAla: 1.564 ± 0.262
0.361MetCys: 0.361 ± 0.13
1.865MetAsp: 1.865 ± 0.37
2.287MetGlu: 2.287 ± 0.356
0.782MetPhe: 0.782 ± 0.24
1.504MetGly: 1.504 ± 0.308
0.181MetHis: 0.181 ± 0.101
1.564MetIle: 1.564 ± 0.303
2.648MetLys: 2.648 ± 0.351
2.046MetLeu: 2.046 ± 0.323
0.602MetMet: 0.602 ± 0.187
1.926MetAsn: 1.926 ± 0.27
0.722MetPro: 0.722 ± 0.178
0.722MetGln: 0.722 ± 0.229
1.023MetArg: 1.023 ± 0.232
2.046MetSer: 2.046 ± 0.272
1.564MetThr: 1.564 ± 0.342
1.625MetVal: 1.625 ± 0.31
0.241MetTrp: 0.241 ± 0.127
1.143MetTyr: 1.143 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
4.332AsnAla: 4.332 ± 0.572
0.602AsnCys: 0.602 ± 0.208
3.37AsnAsp: 3.37 ± 0.506
3.971AsnGlu: 3.971 ± 0.582
1.986AsnPhe: 1.986 ± 0.368
5.175AsnGly: 5.175 ± 0.522
1.264AsnHis: 1.264 ± 0.283
4.693AsnIle: 4.693 ± 0.579
4.453AsnLys: 4.453 ± 0.441
3.851AsnLeu: 3.851 ± 0.417
1.324AsnMet: 1.324 ± 0.277
2.467AsnAsn: 2.467 ± 0.398
1.865AsnPro: 1.865 ± 0.314
1.685AsnGln: 1.685 ± 0.324
2.768AsnArg: 2.768 ± 0.505
3.49AsnSer: 3.49 ± 0.601
3.009AsnThr: 3.009 ± 0.482
2.708AsnVal: 2.708 ± 0.335
0.301AsnTrp: 0.301 ± 0.154
1.926AsnTyr: 1.926 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
1.023ProAla: 1.023 ± 0.256
0.421ProCys: 0.421 ± 0.19
1.625ProAsp: 1.625 ± 0.273
2.347ProGlu: 2.347 ± 0.38
1.625ProPhe: 1.625 ± 0.301
1.083ProGly: 1.083 ± 0.224
0.662ProHis: 0.662 ± 0.195
2.226ProIle: 2.226 ± 0.371
2.347ProLys: 2.347 ± 0.371
1.625ProLeu: 1.625 ± 0.325
1.083ProMet: 1.083 ± 0.243
2.046ProAsn: 2.046 ± 0.363
1.203ProPro: 1.203 ± 0.242
1.203ProGln: 1.203 ± 0.248
1.143ProArg: 1.143 ± 0.23
2.287ProSer: 2.287 ± 0.337
2.046ProThr: 2.046 ± 0.341
1.504ProVal: 1.504 ± 0.287
0.361ProTrp: 0.361 ± 0.15
1.865ProTyr: 1.865 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
2.347GlnAla: 2.347 ± 0.331
0.421GlnCys: 0.421 ± 0.148
1.324GlnAsp: 1.324 ± 0.276
2.046GlnGlu: 2.046 ± 0.437
1.143GlnPhe: 1.143 ± 0.182
1.384GlnGly: 1.384 ± 0.287
0.662GlnHis: 0.662 ± 0.207
2.407GlnIle: 2.407 ± 0.29
3.249GlnLys: 3.249 ± 0.341
2.166GlnLeu: 2.166 ± 0.363
1.203GlnMet: 1.203 ± 0.324
2.046GlnAsn: 2.046 ± 0.309
1.203GlnPro: 1.203 ± 0.236
1.324GlnGln: 1.324 ± 0.305
1.203GlnArg: 1.203 ± 0.274
1.384GlnSer: 1.384 ± 0.34
0.963GlnThr: 0.963 ± 0.253
1.745GlnVal: 1.745 ± 0.295
0.481GlnTrp: 0.481 ± 0.161
0.782GlnTyr: 0.782 ± 0.185
0.0GlnXaa: 0.0 ± 0.0
Arg
2.587ArgAla: 2.587 ± 0.441
0.662ArgCys: 0.662 ± 0.193
2.527ArgAsp: 2.527 ± 0.352
3.309ArgGlu: 3.309 ± 0.339
1.625ArgPhe: 1.625 ± 0.271
1.625ArgGly: 1.625 ± 0.269
0.542ArgHis: 0.542 ± 0.185
3.49ArgIle: 3.49 ± 0.531
4.092ArgLys: 4.092 ± 0.446
3.49ArgLeu: 3.49 ± 0.466
1.745ArgMet: 1.745 ± 0.286
2.226ArgAsn: 2.226 ± 0.353
1.143ArgPro: 1.143 ± 0.279
1.625ArgGln: 1.625 ± 0.346
1.865ArgArg: 1.865 ± 0.439
2.166ArgSer: 2.166 ± 0.358
2.648ArgThr: 2.648 ± 0.351
2.948ArgVal: 2.948 ± 0.496
0.662ArgTrp: 0.662 ± 0.231
1.384ArgTyr: 1.384 ± 0.224
0.0ArgXaa: 0.0 ± 0.0
Ser
4.453SerAla: 4.453 ± 0.592
0.301SerCys: 0.301 ± 0.11
4.272SerAsp: 4.272 ± 0.522
4.633SerGlu: 4.633 ± 0.489
2.467SerPhe: 2.467 ± 0.441
4.814SerGly: 4.814 ± 0.48
1.264SerHis: 1.264 ± 0.291
5.295SerIle: 5.295 ± 0.707
5.054SerLys: 5.054 ± 0.581
5.957SerLeu: 5.957 ± 0.6
1.865SerMet: 1.865 ± 0.456
3.129SerAsn: 3.129 ± 0.47
1.685SerPro: 1.685 ± 0.281
1.685SerGln: 1.685 ± 0.276
1.865SerArg: 1.865 ± 0.3
5.656SerSer: 5.656 ± 0.917
3.49SerThr: 3.49 ± 0.48
3.791SerVal: 3.791 ± 0.471
0.963SerTrp: 0.963 ± 0.267
2.648SerTyr: 2.648 ± 0.54
0.0SerXaa: 0.0 ± 0.0
Thr
3.309ThrAla: 3.309 ± 0.593
0.301ThrCys: 0.301 ± 0.117
3.009ThrAsp: 3.009 ± 0.453
4.032ThrGlu: 4.032 ± 0.492
1.564ThrPhe: 1.564 ± 0.32
3.129ThrGly: 3.129 ± 0.39
1.023ThrHis: 1.023 ± 0.252
4.814ThrIle: 4.814 ± 0.524
4.573ThrLys: 4.573 ± 0.494
4.393ThrLeu: 4.393 ± 0.438
1.143ThrMet: 1.143 ± 0.206
3.189ThrAsn: 3.189 ± 0.462
1.805ThrPro: 1.805 ± 0.318
0.842ThrGln: 0.842 ± 0.203
1.805ThrArg: 1.805 ± 0.352
3.851ThrSer: 3.851 ± 0.538
2.828ThrThr: 2.828 ± 0.438
3.791ThrVal: 3.791 ± 0.693
0.421ThrTrp: 0.421 ± 0.181
1.865ThrTyr: 1.865 ± 0.304
0.0ThrXaa: 0.0 ± 0.0
Val
3.43ValAla: 3.43 ± 0.415
0.481ValCys: 0.481 ± 0.155
4.212ValAsp: 4.212 ± 0.528
4.633ValGlu: 4.633 ± 0.511
2.226ValPhe: 2.226 ± 0.453
2.407ValGly: 2.407 ± 0.54
0.842ValHis: 0.842 ± 0.201
4.152ValIle: 4.152 ± 0.565
5.054ValLys: 5.054 ± 0.564
5.415ValLeu: 5.415 ± 0.489
0.842ValMet: 0.842 ± 0.201
3.791ValAsn: 3.791 ± 0.493
1.685ValPro: 1.685 ± 0.24
1.865ValGln: 1.865 ± 0.352
2.888ValArg: 2.888 ± 0.375
4.092ValSer: 4.092 ± 0.49
3.129ValThr: 3.129 ± 0.403
3.37ValVal: 3.37 ± 0.391
0.842ValTrp: 0.842 ± 0.229
2.888ValTyr: 2.888 ± 0.438
0.0ValXaa: 0.0 ± 0.0
Trp
0.662TrpAla: 0.662 ± 0.196
0.301TrpCys: 0.301 ± 0.155
0.602TrpAsp: 0.602 ± 0.157
1.023TrpGlu: 1.023 ± 0.258
0.481TrpPhe: 0.481 ± 0.169
0.722TrpGly: 0.722 ± 0.196
0.481TrpHis: 0.481 ± 0.169
0.301TrpIle: 0.301 ± 0.138
0.782TrpLys: 0.782 ± 0.176
1.203TrpLeu: 1.203 ± 0.283
0.421TrpMet: 0.421 ± 0.132
0.842TrpAsn: 0.842 ± 0.249
0.12TrpPro: 0.12 ± 0.091
0.421TrpGln: 0.421 ± 0.117
0.662TrpArg: 0.662 ± 0.182
0.722TrpSer: 0.722 ± 0.193
0.421TrpThr: 0.421 ± 0.145
0.782TrpVal: 0.782 ± 0.241
0.06TrpTrp: 0.06 ± 0.058
0.662TrpTyr: 0.662 ± 0.188
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.708TyrAla: 2.708 ± 0.432
0.602TyrCys: 0.602 ± 0.125
2.106TyrAsp: 2.106 ± 0.377
3.731TyrGlu: 3.731 ± 0.426
1.986TyrPhe: 1.986 ± 0.345
2.708TyrGly: 2.708 ± 0.461
0.542TyrHis: 0.542 ± 0.154
3.129TyrIle: 3.129 ± 0.506
4.212TyrLys: 4.212 ± 0.55
3.67TyrLeu: 3.67 ± 0.461
0.602TyrMet: 0.602 ± 0.204
2.467TyrAsn: 2.467 ± 0.417
1.384TyrPro: 1.384 ± 0.295
1.504TyrGln: 1.504 ± 0.192
1.685TyrArg: 1.685 ± 0.34
2.407TyrSer: 2.407 ± 0.35
2.106TyrThr: 2.106 ± 0.381
1.926TyrVal: 1.926 ± 0.317
0.241TyrTrp: 0.241 ± 0.112
1.805TyrTyr: 1.805 ± 0.442
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (16620 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski