Amino acid dipepetide frequency for Yersinia phage vB_Yen_X1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.529AlaAla: 8.529 ± 0.809
0.525AlaCys: 0.525 ± 0.16
4.789AlaAsp: 4.789 ± 0.537
4.658AlaGlu: 4.658 ± 0.456
2.69AlaPhe: 2.69 ± 0.489
7.676AlaGly: 7.676 ± 0.76
0.656AlaHis: 0.656 ± 0.182
6.167AlaIle: 6.167 ± 0.565
6.102AlaLys: 6.102 ± 0.683
6.561AlaLeu: 6.561 ± 0.689
3.28AlaMet: 3.28 ± 0.478
4.133AlaAsn: 4.133 ± 0.566
2.887AlaPro: 2.887 ± 0.489
3.149AlaGln: 3.149 ± 0.49
3.871AlaArg: 3.871 ± 0.52
4.855AlaSer: 4.855 ± 0.559
4.133AlaThr: 4.133 ± 0.479
6.167AlaVal: 6.167 ± 0.666
1.509AlaTrp: 1.509 ± 0.304
2.624AlaTyr: 2.624 ± 0.465
0.0AlaXaa: 0.0 ± 0.0
Cys
0.853CysAla: 0.853 ± 0.242
0.394CysCys: 0.394 ± 0.192
0.59CysAsp: 0.59 ± 0.177
0.59CysGlu: 0.59 ± 0.235
0.394CysPhe: 0.394 ± 0.145
0.525CysGly: 0.525 ± 0.237
0.262CysHis: 0.262 ± 0.168
0.328CysIle: 0.328 ± 0.15
0.853CysLys: 0.853 ± 0.226
0.525CysLeu: 0.525 ± 0.181
0.197CysMet: 0.197 ± 0.105
0.459CysAsn: 0.459 ± 0.162
0.328CysPro: 0.328 ± 0.196
0.262CysGln: 0.262 ± 0.123
0.722CysArg: 0.722 ± 0.2
0.459CysSer: 0.459 ± 0.178
0.656CysThr: 0.656 ± 0.22
0.853CysVal: 0.853 ± 0.299
0.394CysTrp: 0.394 ± 0.202
0.459CysTyr: 0.459 ± 0.177
0.0CysXaa: 0.0 ± 0.0
Asp
6.233AspAla: 6.233 ± 0.683
0.262AspCys: 0.262 ± 0.13
4.789AspAsp: 4.789 ± 0.581
5.511AspGlu: 5.511 ± 0.622
3.608AspPhe: 3.608 ± 0.365
6.102AspGly: 6.102 ± 0.81
0.853AspHis: 0.853 ± 0.274
4.789AspIle: 4.789 ± 0.675
4.265AspLys: 4.265 ± 0.417
4.921AspLeu: 4.921 ± 0.604
2.428AspMet: 2.428 ± 0.455
3.084AspAsn: 3.084 ± 0.365
2.165AspPro: 2.165 ± 0.402
1.378AspGln: 1.378 ± 0.233
3.084AspArg: 3.084 ± 0.36
3.674AspSer: 3.674 ± 0.438
3.543AspThr: 3.543 ± 0.49
4.068AspVal: 4.068 ± 0.479
1.181AspTrp: 1.181 ± 0.256
2.624AspTyr: 2.624 ± 0.42
0.0AspXaa: 0.0 ± 0.0
Glu
5.905GluAla: 5.905 ± 0.511
0.59GluCys: 0.59 ± 0.18
4.396GluAsp: 4.396 ± 0.568
6.102GluGlu: 6.102 ± 0.727
2.887GluPhe: 2.887 ± 0.431
4.068GluGly: 4.068 ± 0.472
1.181GluHis: 1.181 ± 0.247
5.445GluIle: 5.445 ± 0.589
6.102GluLys: 6.102 ± 0.6
7.676GluLeu: 7.676 ± 0.629
2.952GluMet: 2.952 ± 0.415
3.412GluAsn: 3.412 ± 0.445
1.706GluPro: 1.706 ± 0.342
3.149GluGln: 3.149 ± 0.403
4.789GluArg: 4.789 ± 0.543
3.74GluSer: 3.74 ± 0.585
3.28GluThr: 3.28 ± 0.517
4.986GluVal: 4.986 ± 0.451
1.443GluTrp: 1.443 ± 0.358
2.559GluTyr: 2.559 ± 0.423
0.0GluXaa: 0.0 ± 0.0
Phe
2.756PheAla: 2.756 ± 0.443
0.459PheCys: 0.459 ± 0.156
3.674PheAsp: 3.674 ± 0.501
2.756PheGlu: 2.756 ± 0.485
0.984PhePhe: 0.984 ± 0.362
2.756PheGly: 2.756 ± 0.5
0.722PheHis: 0.722 ± 0.189
2.493PheIle: 2.493 ± 0.439
3.28PheLys: 3.28 ± 0.485
1.706PheLeu: 1.706 ± 0.363
1.181PheMet: 1.181 ± 0.281
2.099PheAsn: 2.099 ± 0.303
0.984PhePro: 0.984 ± 0.31
0.787PheGln: 0.787 ± 0.227
2.034PheArg: 2.034 ± 0.347
1.64PheSer: 1.64 ± 0.256
2.362PheThr: 2.362 ± 0.427
2.099PheVal: 2.099 ± 0.344
0.656PheTrp: 0.656 ± 0.175
0.919PheTyr: 0.919 ± 0.232
0.0PheXaa: 0.0 ± 0.0
Gly
5.511GlyAla: 5.511 ± 0.647
1.115GlyCys: 1.115 ± 0.282
5.052GlyAsp: 5.052 ± 0.597
4.724GlyGlu: 4.724 ± 0.478
2.952GlyPhe: 2.952 ± 0.517
6.167GlyGly: 6.167 ± 0.739
1.05GlyHis: 1.05 ± 0.261
3.805GlyIle: 3.805 ± 0.525
6.758GlyLys: 6.758 ± 0.641
5.117GlyLeu: 5.117 ± 0.541
2.034GlyMet: 2.034 ± 0.35
3.346GlyAsn: 3.346 ± 0.538
1.312GlyPro: 1.312 ± 0.297
1.903GlyGln: 1.903 ± 0.418
2.952GlyArg: 2.952 ± 0.452
4.724GlySer: 4.724 ± 0.645
3.871GlyThr: 3.871 ± 0.45
5.577GlyVal: 5.577 ± 0.638
1.312GlyTrp: 1.312 ± 0.307
2.624GlyTyr: 2.624 ± 0.371
0.0GlyXaa: 0.0 ± 0.0
His
1.181HisAla: 1.181 ± 0.296
0.131HisCys: 0.131 ± 0.1
1.181HisAsp: 1.181 ± 0.255
1.378HisGlu: 1.378 ± 0.239
0.984HisPhe: 0.984 ± 0.235
1.247HisGly: 1.247 ± 0.309
0.525HisHis: 0.525 ± 0.193
1.509HisIle: 1.509 ± 0.306
1.247HisLys: 1.247 ± 0.278
1.247HisLeu: 1.247 ± 0.219
0.459HisMet: 0.459 ± 0.147
0.984HisAsn: 0.984 ± 0.287
1.181HisPro: 1.181 ± 0.241
0.394HisGln: 0.394 ± 0.17
0.722HisArg: 0.722 ± 0.213
1.181HisSer: 1.181 ± 0.32
0.656HisThr: 0.656 ± 0.307
1.443HisVal: 1.443 ± 0.307
0.262HisTrp: 0.262 ± 0.121
0.853HisTyr: 0.853 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
5.577IleAla: 5.577 ± 0.586
1.181IleCys: 1.181 ± 0.293
5.774IleAsp: 5.774 ± 0.69
6.889IleGlu: 6.889 ± 0.733
1.771IlePhe: 1.771 ± 0.34
3.74IleGly: 3.74 ± 0.475
1.247IleHis: 1.247 ± 0.248
3.674IleIle: 3.674 ± 0.528
4.789IleLys: 4.789 ± 0.646
3.871IleLeu: 3.871 ± 0.522
2.165IleMet: 2.165 ± 0.32
3.674IleAsn: 3.674 ± 0.53
1.706IlePro: 1.706 ± 0.314
2.69IleGln: 2.69 ± 0.547
2.821IleArg: 2.821 ± 0.387
3.28IleSer: 3.28 ± 0.447
3.346IleThr: 3.346 ± 0.473
4.33IleVal: 4.33 ± 0.462
0.787IleTrp: 0.787 ± 0.206
2.296IleTyr: 2.296 ± 0.321
0.0IleXaa: 0.0 ± 0.0
Lys
6.495LysAla: 6.495 ± 0.592
0.853LysCys: 0.853 ± 0.209
4.724LysAsp: 4.724 ± 0.43
5.905LysGlu: 5.905 ± 0.655
2.493LysPhe: 2.493 ± 0.287
4.527LysGly: 4.527 ± 0.497
1.968LysHis: 1.968 ± 0.367
3.936LysIle: 3.936 ± 0.315
4.461LysLys: 4.461 ± 0.632
5.249LysLeu: 5.249 ± 0.514
2.493LysMet: 2.493 ± 0.377
3.084LysAsn: 3.084 ± 0.558
3.215LysPro: 3.215 ± 0.525
3.084LysGln: 3.084 ± 0.427
3.543LysArg: 3.543 ± 0.378
3.871LysSer: 3.871 ± 0.553
3.608LysThr: 3.608 ± 0.505
5.642LysVal: 5.642 ± 0.702
1.247LysTrp: 1.247 ± 0.284
2.099LysTyr: 2.099 ± 0.436
0.0LysXaa: 0.0 ± 0.0
Leu
6.954LeuAla: 6.954 ± 0.631
0.722LeuCys: 0.722 ± 0.217
4.789LeuAsp: 4.789 ± 0.55
5.905LeuGlu: 5.905 ± 0.628
2.099LeuPhe: 2.099 ± 0.326
4.921LeuGly: 4.921 ± 0.553
1.247LeuHis: 1.247 ± 0.281
5.577LeuIle: 5.577 ± 0.636
5.642LeuLys: 5.642 ± 0.625
5.052LeuLeu: 5.052 ± 0.615
2.165LeuMet: 2.165 ± 0.388
3.215LeuAsn: 3.215 ± 0.512
2.69LeuPro: 2.69 ± 0.364
3.149LeuGln: 3.149 ± 0.435
3.871LeuArg: 3.871 ± 0.491
4.396LeuSer: 4.396 ± 0.508
5.38LeuThr: 5.38 ± 0.538
4.068LeuVal: 4.068 ± 0.413
0.722LeuTrp: 0.722 ± 0.22
1.837LeuTyr: 1.837 ± 0.319
0.0LeuXaa: 0.0 ± 0.0
Met
3.149MetAla: 3.149 ± 0.39
0.066MetCys: 0.066 ± 0.065
1.312MetAsp: 1.312 ± 0.324
2.362MetGlu: 2.362 ± 0.38
0.853MetPhe: 0.853 ± 0.231
1.903MetGly: 1.903 ± 0.437
1.05MetHis: 1.05 ± 0.308
3.412MetIle: 3.412 ± 0.442
1.968MetLys: 1.968 ± 0.379
2.756MetLeu: 2.756 ± 0.412
0.722MetMet: 0.722 ± 0.248
1.64MetAsn: 1.64 ± 0.295
0.656MetPro: 0.656 ± 0.227
0.919MetGln: 0.919 ± 0.254
2.821MetArg: 2.821 ± 0.389
1.837MetSer: 1.837 ± 0.352
1.64MetThr: 1.64 ± 0.356
1.706MetVal: 1.706 ± 0.276
0.131MetTrp: 0.131 ± 0.081
1.181MetTyr: 1.181 ± 0.314
0.0MetXaa: 0.0 ± 0.0
Asn
4.724AsnAla: 4.724 ± 0.51
0.525AsnCys: 0.525 ± 0.206
3.215AsnAsp: 3.215 ± 0.43
2.69AsnGlu: 2.69 ± 0.372
1.378AsnPhe: 1.378 ± 0.317
4.921AsnGly: 4.921 ± 0.572
1.64AsnHis: 1.64 ± 0.316
2.624AsnIle: 2.624 ± 0.399
3.346AsnLys: 3.346 ± 0.461
3.215AsnLeu: 3.215 ± 0.493
1.378AsnMet: 1.378 ± 0.25
2.624AsnAsn: 2.624 ± 0.494
1.968AsnPro: 1.968 ± 0.328
1.247AsnGln: 1.247 ± 0.394
2.296AsnArg: 2.296 ± 0.46
1.837AsnSer: 1.837 ± 0.345
2.69AsnThr: 2.69 ± 0.368
2.952AsnVal: 2.952 ± 0.372
0.722AsnTrp: 0.722 ± 0.197
1.509AsnTyr: 1.509 ± 0.248
0.0AsnXaa: 0.0 ± 0.0
Pro
3.215ProAla: 3.215 ± 0.501
0.066ProCys: 0.066 ± 0.069
2.756ProAsp: 2.756 ± 0.446
3.018ProGlu: 3.018 ± 0.448
0.984ProPhe: 0.984 ± 0.206
0.197ProGly: 0.197 ± 0.108
0.525ProHis: 0.525 ± 0.201
1.903ProIle: 1.903 ± 0.293
1.771ProLys: 1.771 ± 0.31
2.887ProLeu: 2.887 ± 0.427
0.722ProMet: 0.722 ± 0.196
2.034ProAsn: 2.034 ± 0.372
1.575ProPro: 1.575 ± 0.511
1.903ProGln: 1.903 ± 0.282
1.64ProArg: 1.64 ± 0.335
1.64ProSer: 1.64 ± 0.366
1.968ProThr: 1.968 ± 0.448
2.887ProVal: 2.887 ± 0.44
0.656ProTrp: 0.656 ± 0.177
1.247ProTyr: 1.247 ± 0.24
0.0ProXaa: 0.0 ± 0.0
Gln
3.412GlnAla: 3.412 ± 0.519
0.525GlnCys: 0.525 ± 0.178
1.378GlnAsp: 1.378 ± 0.307
3.018GlnGlu: 3.018 ± 0.471
1.312GlnPhe: 1.312 ± 0.281
2.034GlnGly: 2.034 ± 0.347
0.525GlnHis: 0.525 ± 0.165
2.296GlnIle: 2.296 ± 0.407
1.968GlnLys: 1.968 ± 0.34
2.362GlnLeu: 2.362 ± 0.453
1.509GlnMet: 1.509 ± 0.344
1.247GlnAsn: 1.247 ± 0.253
1.115GlnPro: 1.115 ± 0.317
2.165GlnGln: 2.165 ± 0.429
2.034GlnArg: 2.034 ± 0.274
1.575GlnSer: 1.575 ± 0.326
1.312GlnThr: 1.312 ± 0.275
2.559GlnVal: 2.559 ± 0.313
0.787GlnTrp: 0.787 ± 0.223
1.378GlnTyr: 1.378 ± 0.271
0.0GlnXaa: 0.0 ± 0.0
Arg
3.346ArgAla: 3.346 ± 0.438
0.59ArgCys: 0.59 ± 0.282
3.28ArgAsp: 3.28 ± 0.423
4.593ArgGlu: 4.593 ± 0.637
1.575ArgPhe: 1.575 ± 0.249
3.215ArgGly: 3.215 ± 0.464
0.59ArgHis: 0.59 ± 0.175
3.805ArgIle: 3.805 ± 0.509
4.002ArgLys: 4.002 ± 0.477
4.921ArgLeu: 4.921 ± 0.616
1.575ArgMet: 1.575 ± 0.286
2.231ArgAsn: 2.231 ± 0.372
1.706ArgPro: 1.706 ± 0.305
1.706ArgGln: 1.706 ± 0.378
2.559ArgArg: 2.559 ± 0.423
2.756ArgSer: 2.756 ± 0.479
2.428ArgThr: 2.428 ± 0.424
3.674ArgVal: 3.674 ± 0.467
0.459ArgTrp: 0.459 ± 0.165
0.919ArgTyr: 0.919 ± 0.195
0.0ArgXaa: 0.0 ± 0.0
Ser
3.936SerAla: 3.936 ± 0.459
0.394SerCys: 0.394 ± 0.146
4.133SerAsp: 4.133 ± 0.472
4.002SerGlu: 4.002 ± 0.493
2.099SerPhe: 2.099 ± 0.307
4.461SerGly: 4.461 ± 0.69
0.722SerHis: 0.722 ± 0.205
3.215SerIle: 3.215 ± 0.661
3.346SerLys: 3.346 ± 0.503
4.199SerLeu: 4.199 ± 0.552
1.771SerMet: 1.771 ± 0.317
2.034SerAsn: 2.034 ± 0.346
1.968SerPro: 1.968 ± 0.366
1.968SerGln: 1.968 ± 0.295
2.69SerArg: 2.69 ± 0.44
2.165SerSer: 2.165 ± 0.291
2.821SerThr: 2.821 ± 0.415
4.986SerVal: 4.986 ± 0.515
0.459SerTrp: 0.459 ± 0.168
1.247SerTyr: 1.247 ± 0.265
0.0SerXaa: 0.0 ± 0.0
Thr
4.658ThrAla: 4.658 ± 0.564
0.459ThrCys: 0.459 ± 0.136
3.543ThrAsp: 3.543 ± 0.391
4.461ThrGlu: 4.461 ± 0.656
2.362ThrPhe: 2.362 ± 0.398
5.052ThrGly: 5.052 ± 0.519
0.853ThrHis: 0.853 ± 0.296
3.346ThrIle: 3.346 ± 0.542
3.871ThrLys: 3.871 ± 0.523
4.133ThrLeu: 4.133 ± 0.383
0.919ThrMet: 0.919 ± 0.226
2.099ThrAsn: 2.099 ± 0.352
2.296ThrPro: 2.296 ± 0.45
1.575ThrGln: 1.575 ± 0.335
1.903ThrArg: 1.903 ± 0.339
2.428ThrSer: 2.428 ± 0.413
3.477ThrThr: 3.477 ± 0.471
3.936ThrVal: 3.936 ± 0.47
0.59ThrTrp: 0.59 ± 0.2
2.165ThrTyr: 2.165 ± 0.362
0.0ThrXaa: 0.0 ± 0.0
Val
5.052ValAla: 5.052 ± 0.527
0.656ValCys: 0.656 ± 0.176
5.38ValAsp: 5.38 ± 0.595
4.986ValGlu: 4.986 ± 0.557
2.69ValPhe: 2.69 ± 0.494
4.855ValGly: 4.855 ± 0.659
1.509ValHis: 1.509 ± 0.271
4.593ValIle: 4.593 ± 0.468
5.577ValLys: 5.577 ± 0.5
4.002ValLeu: 4.002 ± 0.487
2.428ValMet: 2.428 ± 0.344
3.674ValAsn: 3.674 ± 0.633
2.428ValPro: 2.428 ± 0.53
1.903ValGln: 1.903 ± 0.358
3.28ValArg: 3.28 ± 0.462
4.33ValSer: 4.33 ± 0.436
4.461ValThr: 4.461 ± 0.599
4.33ValVal: 4.33 ± 0.571
0.919ValTrp: 0.919 ± 0.263
2.231ValTyr: 2.231 ± 0.459
0.0ValXaa: 0.0 ± 0.0
Trp
1.181TrpAla: 1.181 ± 0.213
0.262TrpCys: 0.262 ± 0.121
1.115TrpAsp: 1.115 ± 0.236
0.853TrpGlu: 0.853 ± 0.261
1.05TrpPhe: 1.05 ± 0.262
0.984TrpGly: 0.984 ± 0.239
0.656TrpHis: 0.656 ± 0.246
0.722TrpIle: 0.722 ± 0.23
1.05TrpLys: 1.05 ± 0.215
1.378TrpLeu: 1.378 ± 0.242
0.59TrpMet: 0.59 ± 0.184
0.722TrpAsn: 0.722 ± 0.186
0.525TrpPro: 0.525 ± 0.177
0.394TrpGln: 0.394 ± 0.168
0.59TrpArg: 0.59 ± 0.196
0.853TrpSer: 0.853 ± 0.245
0.459TrpThr: 0.459 ± 0.163
0.853TrpVal: 0.853 ± 0.227
0.197TrpTrp: 0.197 ± 0.103
0.394TrpTyr: 0.394 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.296TyrAla: 2.296 ± 0.353
0.328TyrCys: 0.328 ± 0.188
2.69TyrAsp: 2.69 ± 0.477
1.968TyrGlu: 1.968 ± 0.408
1.181TyrPhe: 1.181 ± 0.23
2.559TyrGly: 2.559 ± 0.42
0.984TyrHis: 0.984 ± 0.268
1.968TyrIle: 1.968 ± 0.358
2.099TyrLys: 2.099 ± 0.332
2.624TyrLeu: 2.624 ± 0.48
1.05TyrMet: 1.05 ± 0.256
1.706TyrAsn: 1.706 ± 0.324
1.181TyrPro: 1.181 ± 0.278
0.787TyrGln: 0.787 ± 0.178
1.706TyrArg: 1.706 ± 0.31
1.443TyrSer: 1.443 ± 0.301
2.034TyrThr: 2.034 ± 0.35
2.165TyrVal: 2.165 ± 0.381
0.394TyrTrp: 0.394 ± 0.201
0.853TyrTyr: 0.853 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 88 proteins (15243 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski