Amino acid dipepetide frequency for Yersinia virus L413C

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.758AlaAla: 9.758 ± 1.777
0.623AlaCys: 0.623 ± 0.283
6.021AlaAsp: 6.021 ± 0.938
5.71AlaGlu: 5.71 ± 0.78
2.907AlaPhe: 2.907 ± 0.527
7.993AlaGly: 7.993 ± 1.012
1.453AlaHis: 1.453 ± 0.384
3.737AlaIle: 3.737 ± 0.795
5.502AlaLys: 5.502 ± 0.793
9.966AlaLeu: 9.966 ± 1.258
2.284AlaMet: 2.284 ± 0.423
2.388AlaAsn: 2.388 ± 0.473
4.256AlaPro: 4.256 ± 0.716
3.53AlaGln: 3.53 ± 0.699
4.879AlaArg: 4.879 ± 0.854
8.097AlaSer: 8.097 ± 0.851
5.917AlaThr: 5.917 ± 0.83
7.578AlaVal: 7.578 ± 0.908
1.661AlaTrp: 1.661 ± 0.369
2.18AlaTyr: 2.18 ± 0.427
0.0AlaXaa: 0.0 ± 0.0
Cys
1.038CysAla: 1.038 ± 0.355
0.104CysCys: 0.104 ± 0.115
0.934CysAsp: 0.934 ± 0.285
0.311CysGlu: 0.311 ± 0.179
0.208CysPhe: 0.208 ± 0.15
0.415CysGly: 0.415 ± 0.206
0.104CysHis: 0.104 ± 0.101
0.311CysIle: 0.311 ± 0.195
0.311CysLys: 0.311 ± 0.204
0.623CysLeu: 0.623 ± 0.264
0.208CysMet: 0.208 ± 0.137
0.208CysAsn: 0.208 ± 0.154
0.519CysPro: 0.519 ± 0.241
0.727CysGln: 0.727 ± 0.348
0.934CysArg: 0.934 ± 0.295
0.519CysSer: 0.519 ± 0.252
0.934CysThr: 0.934 ± 0.32
0.519CysVal: 0.519 ± 0.255
0.104CysTrp: 0.104 ± 0.111
0.311CysTyr: 0.311 ± 0.178
0.0CysXaa: 0.0 ± 0.0
Asp
6.54AspAla: 6.54 ± 0.792
0.311AspCys: 0.311 ± 0.192
3.322AspAsp: 3.322 ± 0.592
3.633AspGlu: 3.633 ± 0.767
2.699AspPhe: 2.699 ± 0.851
5.087AspGly: 5.087 ± 0.66
0.519AspHis: 0.519 ± 0.265
4.256AspIle: 4.256 ± 0.867
2.284AspLys: 2.284 ± 0.491
5.087AspLeu: 5.087 ± 0.697
0.727AspMet: 0.727 ± 0.291
2.18AspAsn: 2.18 ± 0.524
1.661AspPro: 1.661 ± 0.429
1.453AspGln: 1.453 ± 0.532
2.388AspArg: 2.388 ± 0.58
2.595AspSer: 2.595 ± 0.449
3.945AspThr: 3.945 ± 0.675
3.633AspVal: 3.633 ± 0.499
1.038AspTrp: 1.038 ± 0.401
2.491AspTyr: 2.491 ± 0.553
0.0AspXaa: 0.0 ± 0.0
Glu
5.19GluAla: 5.19 ± 0.671
0.519GluCys: 0.519 ± 0.246
2.699GluAsp: 2.699 ± 0.516
4.049GluGlu: 4.049 ± 0.65
1.972GluPhe: 1.972 ± 0.462
2.699GluGly: 2.699 ± 0.524
1.246GluHis: 1.246 ± 0.355
2.595GluIle: 2.595 ± 0.552
4.568GluLys: 4.568 ± 0.705
8.409GluLeu: 8.409 ± 0.969
2.388GluMet: 2.388 ± 0.511
3.01GluAsn: 3.01 ± 0.605
2.803GluPro: 2.803 ± 0.689
3.218GluGln: 3.218 ± 0.765
4.671GluArg: 4.671 ± 0.984
4.775GluSer: 4.775 ± 0.699
2.907GluThr: 2.907 ± 0.697
4.256GluVal: 4.256 ± 0.687
1.453GluTrp: 1.453 ± 0.49
2.076GluTyr: 2.076 ± 0.497
0.0GluXaa: 0.0 ± 0.0
Phe
2.491PheAla: 2.491 ± 0.488
0.727PheCys: 0.727 ± 0.246
1.557PheAsp: 1.557 ± 0.361
2.076PheGlu: 2.076 ± 0.48
1.142PhePhe: 1.142 ± 0.336
1.246PheGly: 1.246 ± 0.376
0.727PheHis: 0.727 ± 0.247
1.661PheIle: 1.661 ± 0.518
2.491PheLys: 2.491 ± 0.566
3.53PheLeu: 3.53 ± 0.687
0.934PheMet: 0.934 ± 0.305
1.765PheAsn: 1.765 ± 0.47
1.142PhePro: 1.142 ± 0.285
1.35PheGln: 1.35 ± 0.309
1.972PheArg: 1.972 ± 0.431
2.284PheSer: 2.284 ± 0.493
2.907PheThr: 2.907 ± 0.558
1.246PheVal: 1.246 ± 0.352
0.727PheTrp: 0.727 ± 0.327
1.453PheTyr: 1.453 ± 0.411
0.0PheXaa: 0.0 ± 0.0
Gly
5.813GlyAla: 5.813 ± 0.931
0.623GlyCys: 0.623 ± 0.347
4.775GlyAsp: 4.775 ± 0.548
4.464GlyGlu: 4.464 ± 0.703
2.595GlyPhe: 2.595 ± 0.54
5.502GlyGly: 5.502 ± 1.01
0.727GlyHis: 0.727 ± 0.224
4.049GlyIle: 4.049 ± 0.723
5.19GlyLys: 5.19 ± 0.64
4.568GlyLeu: 4.568 ± 0.659
2.388GlyMet: 2.388 ± 0.598
2.284GlyAsn: 2.284 ± 0.703
0.519GlyPro: 0.519 ± 0.223
2.491GlyGln: 2.491 ± 0.511
4.464GlyArg: 4.464 ± 0.653
3.633GlySer: 3.633 ± 0.599
4.464GlyThr: 4.464 ± 0.903
6.021GlyVal: 6.021 ± 0.849
1.142GlyTrp: 1.142 ± 0.253
1.972GlyTyr: 1.972 ± 0.455
0.0GlyXaa: 0.0 ± 0.0
His
1.972HisAla: 1.972 ± 0.521
0.415HisCys: 0.415 ± 0.174
0.934HisAsp: 0.934 ± 0.354
1.038HisGlu: 1.038 ± 0.458
0.519HisPhe: 0.519 ± 0.287
1.246HisGly: 1.246 ± 0.484
0.623HisHis: 0.623 ± 0.223
1.557HisIle: 1.557 ± 0.49
0.623HisLys: 0.623 ± 0.301
1.765HisLeu: 1.765 ± 0.373
0.519HisMet: 0.519 ± 0.224
1.038HisAsn: 1.038 ± 0.358
1.142HisPro: 1.142 ± 0.35
0.934HisGln: 0.934 ± 0.317
0.934HisArg: 0.934 ± 0.35
0.415HisSer: 0.415 ± 0.221
0.934HisThr: 0.934 ± 0.383
0.83HisVal: 0.83 ± 0.28
0.311HisTrp: 0.311 ± 0.132
0.415HisTyr: 0.415 ± 0.216
0.0HisXaa: 0.0 ± 0.0
Ile
4.464IleAla: 4.464 ± 0.624
0.519IleCys: 0.519 ± 0.301
3.737IleAsp: 3.737 ± 0.67
3.633IleGlu: 3.633 ± 0.547
1.765IlePhe: 1.765 ± 0.509
3.945IleGly: 3.945 ± 0.687
0.623IleHis: 0.623 ± 0.275
3.114IleIle: 3.114 ± 0.496
2.388IleLys: 2.388 ± 0.634
2.803IleLeu: 2.803 ± 0.434
1.142IleMet: 1.142 ± 0.341
2.907IleAsn: 2.907 ± 0.522
2.076IlePro: 2.076 ± 0.497
2.076IleGln: 2.076 ± 0.513
4.152IleArg: 4.152 ± 0.627
4.256IleSer: 4.256 ± 0.927
4.983IleThr: 4.983 ± 0.555
3.218IleVal: 3.218 ± 0.487
0.83IleTrp: 0.83 ± 0.248
1.765IleTyr: 1.765 ± 0.346
0.0IleXaa: 0.0 ± 0.0
Lys
4.983LysAla: 4.983 ± 0.783
0.208LysCys: 0.208 ± 0.144
1.972LysAsp: 1.972 ± 0.526
3.218LysGlu: 3.218 ± 0.605
1.765LysPhe: 1.765 ± 0.428
3.01LysGly: 3.01 ± 0.437
1.35LysHis: 1.35 ± 0.411
2.388LysIle: 2.388 ± 0.438
4.152LysLys: 4.152 ± 0.813
6.644LysLeu: 6.644 ± 0.972
0.623LysMet: 0.623 ± 0.26
3.945LysAsn: 3.945 ± 0.781
3.322LysPro: 3.322 ± 0.601
1.869LysGln: 1.869 ± 0.397
4.152LysArg: 4.152 ± 0.782
2.491LysSer: 2.491 ± 0.527
3.841LysThr: 3.841 ± 0.552
3.426LysVal: 3.426 ± 0.583
0.83LysTrp: 0.83 ± 0.353
2.595LysTyr: 2.595 ± 0.629
0.0LysXaa: 0.0 ± 0.0
Leu
9.135LeuAla: 9.135 ± 0.861
0.727LeuCys: 0.727 ± 0.273
5.502LeuAsp: 5.502 ± 0.832
6.436LeuGlu: 6.436 ± 0.872
3.945LeuPhe: 3.945 ± 0.844
4.775LeuGly: 4.775 ± 0.868
1.972LeuHis: 1.972 ± 0.474
4.879LeuIle: 4.879 ± 0.715
5.71LeuLys: 5.71 ± 0.881
5.606LeuLeu: 5.606 ± 0.975
3.633LeuMet: 3.633 ± 0.661
4.775LeuAsn: 4.775 ± 0.575
4.464LeuPro: 4.464 ± 0.787
2.595LeuGln: 2.595 ± 0.591
4.568LeuArg: 4.568 ± 0.667
7.267LeuSer: 7.267 ± 0.989
7.37LeuThr: 7.37 ± 0.986
4.256LeuVal: 4.256 ± 0.547
0.934LeuTrp: 0.934 ± 0.347
2.595LeuTyr: 2.595 ± 0.536
0.0LeuXaa: 0.0 ± 0.0
Met
3.218MetAla: 3.218 ± 0.448
0.208MetCys: 0.208 ± 0.138
0.727MetAsp: 0.727 ± 0.343
1.661MetGlu: 1.661 ± 0.331
0.934MetPhe: 0.934 ± 0.321
0.83MetGly: 0.83 ± 0.29
0.727MetHis: 0.727 ± 0.283
1.142MetIle: 1.142 ± 0.305
1.246MetLys: 1.246 ± 0.341
2.803MetLeu: 2.803 ± 0.647
1.038MetMet: 1.038 ± 0.339
1.869MetAsn: 1.869 ± 0.475
1.038MetPro: 1.038 ± 0.485
0.934MetGln: 0.934 ± 0.329
2.076MetArg: 2.076 ± 0.645
1.765MetSer: 1.765 ± 0.487
3.01MetThr: 3.01 ± 0.586
1.35MetVal: 1.35 ± 0.468
0.311MetTrp: 0.311 ± 0.215
0.934MetTyr: 0.934 ± 0.278
0.0MetXaa: 0.0 ± 0.0
Asn
3.53AsnAla: 3.53 ± 0.608
0.415AsnCys: 0.415 ± 0.204
2.595AsnAsp: 2.595 ± 0.561
2.388AsnGlu: 2.388 ± 0.705
1.142AsnPhe: 1.142 ± 0.366
3.737AsnGly: 3.737 ± 0.814
0.727AsnHis: 0.727 ± 0.327
3.322AsnIle: 3.322 ± 0.767
2.388AsnLys: 2.388 ± 0.51
2.803AsnLeu: 2.803 ± 0.533
0.83AsnMet: 0.83 ± 0.256
1.453AsnAsn: 1.453 ± 0.272
2.907AsnPro: 2.907 ± 0.532
1.246AsnGln: 1.246 ± 0.337
3.01AsnArg: 3.01 ± 0.627
2.388AsnSer: 2.388 ± 0.653
1.661AsnThr: 1.661 ± 0.385
2.284AsnVal: 2.284 ± 0.426
0.311AsnTrp: 0.311 ± 0.176
1.038AsnTyr: 1.038 ± 0.374
0.0AsnXaa: 0.0 ± 0.0
Pro
3.945ProAla: 3.945 ± 0.832
0.208ProCys: 0.208 ± 0.15
3.426ProAsp: 3.426 ± 0.605
4.049ProGlu: 4.049 ± 0.648
0.934ProPhe: 0.934 ± 0.396
2.595ProGly: 2.595 ± 0.539
1.142ProHis: 1.142 ± 0.376
1.557ProIle: 1.557 ± 0.424
2.284ProLys: 2.284 ± 0.543
4.152ProLeu: 4.152 ± 0.647
0.934ProMet: 0.934 ± 0.396
0.934ProAsn: 0.934 ± 0.352
1.661ProPro: 1.661 ± 0.448
1.557ProGln: 1.557 ± 0.381
2.595ProArg: 2.595 ± 0.77
2.284ProSer: 2.284 ± 0.479
1.661ProThr: 1.661 ± 0.363
4.775ProVal: 4.775 ± 0.691
0.519ProTrp: 0.519 ± 0.223
1.038ProTyr: 1.038 ± 0.402
0.0ProXaa: 0.0 ± 0.0
Gln
3.737GlnAla: 3.737 ± 1.11
0.208GlnCys: 0.208 ± 0.145
2.076GlnAsp: 2.076 ± 0.472
2.284GlnGlu: 2.284 ± 0.608
0.727GlnPhe: 0.727 ± 0.252
1.35GlnGly: 1.35 ± 0.36
0.83GlnHis: 0.83 ± 0.359
1.972GlnIle: 1.972 ± 0.659
2.491GlnLys: 2.491 ± 0.428
3.945GlnLeu: 3.945 ± 0.627
0.83GlnMet: 0.83 ± 0.326
0.83GlnAsn: 0.83 ± 0.296
1.35GlnPro: 1.35 ± 0.425
2.284GlnGln: 2.284 ± 0.651
4.152GlnArg: 4.152 ± 0.776
2.595GlnSer: 2.595 ± 0.576
2.18GlnThr: 2.18 ± 0.622
2.388GlnVal: 2.388 ± 0.568
0.623GlnTrp: 0.623 ± 0.223
0.519GlnTyr: 0.519 ± 0.227
0.0GlnXaa: 0.0 ± 0.0
Arg
6.021ArgAla: 6.021 ± 0.815
0.934ArgCys: 0.934 ± 0.282
3.322ArgAsp: 3.322 ± 0.46
4.256ArgGlu: 4.256 ± 0.743
1.869ArgPhe: 1.869 ± 0.485
3.53ArgGly: 3.53 ± 0.938
1.453ArgHis: 1.453 ± 0.367
3.322ArgIle: 3.322 ± 0.712
3.426ArgLys: 3.426 ± 0.533
6.332ArgLeu: 6.332 ± 0.82
1.661ArgMet: 1.661 ± 0.463
2.491ArgAsn: 2.491 ± 0.539
2.18ArgPro: 2.18 ± 0.342
3.218ArgGln: 3.218 ± 0.705
4.775ArgArg: 4.775 ± 0.737
3.114ArgSer: 3.114 ± 0.585
2.803ArgThr: 2.803 ± 0.447
5.294ArgVal: 5.294 ± 0.931
0.934ArgTrp: 0.934 ± 0.315
3.218ArgTyr: 3.218 ± 0.542
0.0ArgXaa: 0.0 ± 0.0
Ser
6.332SerAla: 6.332 ± 0.95
0.727SerCys: 0.727 ± 0.268
3.322SerAsp: 3.322 ± 0.614
4.568SerGlu: 4.568 ± 0.73
2.18SerPhe: 2.18 ± 0.499
4.775SerGly: 4.775 ± 1.147
1.038SerHis: 1.038 ± 0.534
3.322SerIle: 3.322 ± 0.672
3.01SerLys: 3.01 ± 0.518
6.644SerLeu: 6.644 ± 1.031
1.972SerMet: 1.972 ± 0.43
2.284SerAsn: 2.284 ± 0.493
2.595SerPro: 2.595 ± 0.703
2.076SerGln: 2.076 ± 0.354
4.256SerArg: 4.256 ± 0.75
2.491SerSer: 2.491 ± 0.515
4.152SerThr: 4.152 ± 0.786
4.879SerVal: 4.879 ± 0.908
0.519SerTrp: 0.519 ± 0.219
1.453SerTyr: 1.453 ± 0.484
0.0SerXaa: 0.0 ± 0.0
Thr
7.37ThrAla: 7.37 ± 1.309
0.727ThrCys: 0.727 ± 0.308
3.633ThrAsp: 3.633 ± 0.578
2.907ThrGlu: 2.907 ± 0.622
2.491ThrPhe: 2.491 ± 0.399
7.163ThrGly: 7.163 ± 0.87
0.83ThrHis: 0.83 ± 0.321
3.01ThrIle: 3.01 ± 0.726
2.803ThrLys: 2.803 ± 0.596
6.54ThrLeu: 6.54 ± 1.143
2.076ThrMet: 2.076 ± 0.355
1.972ThrAsn: 1.972 ± 0.532
3.218ThrPro: 3.218 ± 0.469
1.869ThrGln: 1.869 ± 0.53
4.152ThrArg: 4.152 ± 0.677
4.256ThrSer: 4.256 ± 0.635
3.53ThrThr: 3.53 ± 0.63
4.879ThrVal: 4.879 ± 0.831
0.83ThrTrp: 0.83 ± 0.421
1.038ThrTyr: 1.038 ± 0.339
0.0ThrXaa: 0.0 ± 0.0
Val
6.748ValAla: 6.748 ± 1.04
1.038ValCys: 1.038 ± 0.417
3.426ValAsp: 3.426 ± 0.573
4.568ValGlu: 4.568 ± 0.703
2.284ValPhe: 2.284 ± 0.549
4.983ValGly: 4.983 ± 0.821
0.934ValHis: 0.934 ± 0.337
4.775ValIle: 4.775 ± 0.736
4.256ValLys: 4.256 ± 0.634
5.502ValLeu: 5.502 ± 0.883
2.284ValMet: 2.284 ± 0.435
2.284ValAsn: 2.284 ± 0.54
2.699ValPro: 2.699 ± 0.533
1.869ValGln: 1.869 ± 0.462
2.595ValArg: 2.595 ± 0.459
4.983ValSer: 4.983 ± 0.603
5.917ValThr: 5.917 ± 1.134
4.152ValVal: 4.152 ± 0.687
0.623ValTrp: 0.623 ± 0.267
1.246ValTyr: 1.246 ± 0.314
0.0ValXaa: 0.0 ± 0.0
Trp
1.142TrpAla: 1.142 ± 0.284
0.0TrpCys: 0.0 ± 0.0
0.83TrpAsp: 0.83 ± 0.286
1.142TrpGlu: 1.142 ± 0.265
0.311TrpPhe: 0.311 ± 0.167
0.519TrpGly: 0.519 ± 0.277
0.415TrpHis: 0.415 ± 0.236
0.727TrpIle: 0.727 ± 0.258
0.83TrpLys: 0.83 ± 0.309
1.661TrpLeu: 1.661 ± 0.468
0.415TrpMet: 0.415 ± 0.213
0.623TrpAsn: 0.623 ± 0.355
1.142TrpPro: 1.142 ± 0.306
0.415TrpGln: 0.415 ± 0.165
1.557TrpArg: 1.557 ± 0.49
0.934TrpSer: 0.934 ± 0.431
0.415TrpThr: 0.415 ± 0.176
0.623TrpVal: 0.623 ± 0.248
0.519TrpTrp: 0.519 ± 0.221
0.623TrpTyr: 0.623 ± 0.265
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.01TyrAla: 3.01 ± 0.635
0.104TyrCys: 0.104 ± 0.102
1.038TyrAsp: 1.038 ± 0.331
3.114TyrGlu: 3.114 ± 0.64
1.038TyrPhe: 1.038 ± 0.355
2.388TyrGly: 2.388 ± 0.564
0.727TyrHis: 0.727 ± 0.228
2.699TyrIle: 2.699 ± 0.551
0.83TyrLys: 0.83 ± 0.332
1.765TyrLeu: 1.765 ± 0.453
0.83TyrMet: 0.83 ± 0.332
0.934TyrAsn: 0.934 ± 0.332
1.557TyrPro: 1.557 ± 0.319
1.661TyrGln: 1.661 ± 0.417
1.869TyrArg: 1.869 ± 0.545
1.453TyrSer: 1.453 ± 0.385
1.765TyrThr: 1.765 ± 0.468
1.557TyrVal: 1.557 ± 0.422
0.623TyrTrp: 0.623 ± 0.229
0.623TyrTyr: 0.623 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 40 proteins (9634 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski