Amino acid dipepetide frequency for Salmonella virus PsP3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.558AlaAla: 11.558 ± 2.19
1.261AlaCys: 1.261 ± 0.434
6.41AlaAsp: 6.41 ± 0.925
5.989AlaGlu: 5.989 ± 0.821
2.312AlaPhe: 2.312 ± 0.361
9.247AlaGly: 9.247 ± 1.233
1.891AlaHis: 1.891 ± 0.429
5.254AlaIle: 5.254 ± 0.797
5.779AlaLys: 5.779 ± 0.737
9.562AlaLeu: 9.562 ± 0.884
2.837AlaMet: 2.837 ± 0.484
2.837AlaAsn: 2.837 ± 0.818
5.569AlaPro: 5.569 ± 0.968
4.518AlaGln: 4.518 ± 0.75
5.359AlaArg: 5.359 ± 0.698
7.145AlaSer: 7.145 ± 0.865
6.305AlaThr: 6.305 ± 0.979
6.515AlaVal: 6.515 ± 0.76
1.786AlaTrp: 1.786 ± 0.373
2.417AlaTyr: 2.417 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
1.051CysAla: 1.051 ± 0.327
0.105CysCys: 0.105 ± 0.107
0.42CysAsp: 0.42 ± 0.21
0.525CysGlu: 0.525 ± 0.236
0.105CysPhe: 0.105 ± 0.114
0.63CysGly: 0.63 ± 0.259
0.315CysHis: 0.315 ± 0.192
0.42CysIle: 0.42 ± 0.203
0.736CysLys: 0.736 ± 0.234
0.63CysLeu: 0.63 ± 0.23
0.21CysMet: 0.21 ± 0.146
0.525CysAsn: 0.525 ± 0.2
0.525CysPro: 0.525 ± 0.28
0.63CysGln: 0.63 ± 0.295
0.841CysArg: 0.841 ± 0.304
0.21CysSer: 0.21 ± 0.139
0.946CysThr: 0.946 ± 0.265
0.841CysVal: 0.841 ± 0.323
0.105CysTrp: 0.105 ± 0.091
0.315CysTyr: 0.315 ± 0.186
0.0CysXaa: 0.0 ± 0.0
Asp
6.199AspAla: 6.199 ± 0.758
0.315AspCys: 0.315 ± 0.176
4.728AspAsp: 4.728 ± 0.808
3.678AspGlu: 3.678 ± 0.519
3.047AspPhe: 3.047 ± 0.605
5.254AspGly: 5.254 ± 0.626
0.42AspHis: 0.42 ± 0.192
4.833AspIle: 4.833 ± 0.723
3.678AspLys: 3.678 ± 0.609
5.044AspLeu: 5.044 ± 0.588
1.366AspMet: 1.366 ± 0.35
2.627AspAsn: 2.627 ± 0.671
1.891AspPro: 1.891 ± 0.46
1.366AspGln: 1.366 ± 0.399
2.522AspArg: 2.522 ± 0.59
3.047AspSer: 3.047 ± 0.615
4.833AspThr: 4.833 ± 0.706
3.888AspVal: 3.888 ± 0.743
0.946AspTrp: 0.946 ± 0.339
1.786AspTyr: 1.786 ± 0.472
0.0AspXaa: 0.0 ± 0.0
Glu
5.359GluAla: 5.359 ± 0.877
0.63GluCys: 0.63 ± 0.286
2.312GluAsp: 2.312 ± 0.648
2.627GluGlu: 2.627 ± 0.491
2.102GluPhe: 2.102 ± 0.511
2.942GluGly: 2.942 ± 0.621
1.576GluHis: 1.576 ± 0.481
3.678GluIle: 3.678 ± 0.72
3.467GluLys: 3.467 ± 0.53
8.826GluLeu: 8.826 ± 1.084
2.207GluMet: 2.207 ± 0.466
2.942GluAsn: 2.942 ± 0.786
2.522GluPro: 2.522 ± 0.51
2.207GluGln: 2.207 ± 0.478
3.152GluArg: 3.152 ± 0.564
4.413GluSer: 4.413 ± 0.693
2.942GluThr: 2.942 ± 0.583
2.837GluVal: 2.837 ± 0.513
0.42GluTrp: 0.42 ± 0.233
2.102GluTyr: 2.102 ± 0.542
0.0GluXaa: 0.0 ± 0.0
Phe
2.837PheAla: 2.837 ± 0.507
0.525PheCys: 0.525 ± 0.216
1.576PheAsp: 1.576 ± 0.469
1.891PheGlu: 1.891 ± 0.419
1.051PhePhe: 1.051 ± 0.374
1.996PheGly: 1.996 ± 0.438
0.736PheHis: 0.736 ± 0.222
1.261PheIle: 1.261 ± 0.515
2.417PheLys: 2.417 ± 0.576
1.996PheLeu: 1.996 ± 0.496
0.841PheMet: 0.841 ± 0.262
1.996PheAsn: 1.996 ± 0.405
1.156PhePro: 1.156 ± 0.359
1.261PheGln: 1.261 ± 0.39
1.576PheArg: 1.576 ± 0.446
2.942PheSer: 2.942 ± 0.611
2.417PheThr: 2.417 ± 0.593
1.471PheVal: 1.471 ± 0.4
0.946PheTrp: 0.946 ± 0.294
0.736PheTyr: 0.736 ± 0.28
0.0PheXaa: 0.0 ± 0.0
Gly
6.515GlyAla: 6.515 ± 0.933
1.471GlyCys: 1.471 ± 0.424
5.464GlyAsp: 5.464 ± 0.858
4.939GlyGlu: 4.939 ± 0.801
2.732GlyPhe: 2.732 ± 0.563
6.199GlyGly: 6.199 ± 1.536
0.946GlyHis: 0.946 ± 0.296
4.518GlyIle: 4.518 ± 0.844
4.833GlyLys: 4.833 ± 0.68
4.413GlyLeu: 4.413 ± 0.595
1.996GlyMet: 1.996 ± 0.523
2.942GlyAsn: 2.942 ± 0.598
0.42GlyPro: 0.42 ± 0.202
1.996GlyGln: 1.996 ± 0.462
3.678GlyArg: 3.678 ± 0.739
3.152GlySer: 3.152 ± 0.64
4.413GlyThr: 4.413 ± 0.794
5.884GlyVal: 5.884 ± 1.039
1.156GlyTrp: 1.156 ± 0.345
2.207GlyTyr: 2.207 ± 0.552
0.0GlyXaa: 0.0 ± 0.0
His
1.891HisAla: 1.891 ± 0.661
0.525HisCys: 0.525 ± 0.256
0.946HisAsp: 0.946 ± 0.295
1.051HisGlu: 1.051 ± 0.268
0.63HisPhe: 0.63 ± 0.346
1.261HisGly: 1.261 ± 0.296
0.315HisHis: 0.315 ± 0.158
0.841HisIle: 0.841 ± 0.265
0.946HisLys: 0.946 ± 0.337
1.366HisLeu: 1.366 ± 0.353
0.736HisMet: 0.736 ± 0.226
1.051HisAsn: 1.051 ± 0.338
1.261HisPro: 1.261 ± 0.386
1.156HisGln: 1.156 ± 0.278
1.366HisArg: 1.366 ± 0.381
0.946HisSer: 0.946 ± 0.283
1.156HisThr: 1.156 ± 0.31
0.841HisVal: 0.841 ± 0.279
0.315HisTrp: 0.315 ± 0.246
0.63HisTyr: 0.63 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
5.149IleAla: 5.149 ± 0.592
0.42IleCys: 0.42 ± 0.159
5.464IleAsp: 5.464 ± 0.782
2.522IleGlu: 2.522 ± 0.448
2.207IlePhe: 2.207 ± 0.56
3.678IleGly: 3.678 ± 0.548
0.736IleHis: 0.736 ± 0.301
2.207IleIle: 2.207 ± 0.633
2.102IleLys: 2.102 ± 0.544
2.102IleLeu: 2.102 ± 0.463
1.156IleMet: 1.156 ± 0.34
2.627IleAsn: 2.627 ± 0.476
2.312IlePro: 2.312 ± 0.607
1.681IleGln: 1.681 ± 0.431
5.359IleArg: 5.359 ± 0.985
4.518IleSer: 4.518 ± 0.844
4.308IleThr: 4.308 ± 0.654
3.783IleVal: 3.783 ± 0.663
0.525IleTrp: 0.525 ± 0.247
0.736IleTyr: 0.736 ± 0.301
0.0IleXaa: 0.0 ± 0.0
Lys
6.41LysAla: 6.41 ± 0.839
0.315LysCys: 0.315 ± 0.161
2.102LysAsp: 2.102 ± 0.393
3.678LysGlu: 3.678 ± 0.643
1.366LysPhe: 1.366 ± 0.338
3.362LysGly: 3.362 ± 0.519
1.471LysHis: 1.471 ± 0.401
1.681LysIle: 1.681 ± 0.43
3.257LysLys: 3.257 ± 0.574
5.674LysLeu: 5.674 ± 0.841
0.946LysMet: 0.946 ± 0.251
2.942LysAsn: 2.942 ± 0.6
3.257LysPro: 3.257 ± 0.696
2.207LysGln: 2.207 ± 0.421
3.678LysArg: 3.678 ± 0.49
2.837LysSer: 2.837 ± 0.41
3.993LysThr: 3.993 ± 0.602
3.467LysVal: 3.467 ± 0.685
1.366LysTrp: 1.366 ± 0.418
2.102LysTyr: 2.102 ± 0.525
0.0LysXaa: 0.0 ± 0.0
Leu
10.718LeuAla: 10.718 ± 0.859
1.051LeuCys: 1.051 ± 0.358
6.305LeuAsp: 6.305 ± 0.745
5.359LeuGlu: 5.359 ± 0.69
2.732LeuPhe: 2.732 ± 0.669
5.884LeuGly: 5.884 ± 0.939
1.891LeuHis: 1.891 ± 0.357
4.518LeuIle: 4.518 ± 0.546
4.939LeuLys: 4.939 ± 0.627
6.305LeuLeu: 6.305 ± 0.667
2.417LeuMet: 2.417 ± 0.651
3.467LeuAsn: 3.467 ± 0.449
3.888LeuPro: 3.888 ± 0.693
3.993LeuGln: 3.993 ± 0.544
5.989LeuArg: 5.989 ± 0.86
7.355LeuSer: 7.355 ± 0.881
7.565LeuThr: 7.565 ± 1.508
4.098LeuVal: 4.098 ± 0.566
1.471LeuTrp: 1.471 ± 0.388
2.522LeuTyr: 2.522 ± 0.468
0.0LeuXaa: 0.0 ± 0.0
Met
3.152MetAla: 3.152 ± 0.547
0.21MetCys: 0.21 ± 0.118
0.841MetAsp: 0.841 ± 0.267
1.261MetGlu: 1.261 ± 0.315
1.051MetPhe: 1.051 ± 0.376
0.525MetGly: 0.525 ± 0.217
0.525MetHis: 0.525 ± 0.25
0.63MetIle: 0.63 ± 0.249
1.261MetLys: 1.261 ± 0.315
2.732MetLeu: 2.732 ± 0.597
0.525MetMet: 0.525 ± 0.252
1.471MetAsn: 1.471 ± 0.358
0.63MetPro: 0.63 ± 0.223
1.681MetGln: 1.681 ± 0.527
1.681MetArg: 1.681 ± 0.475
2.312MetSer: 2.312 ± 0.443
2.312MetThr: 2.312 ± 0.43
1.786MetVal: 1.786 ± 0.4
0.105MetTrp: 0.105 ± 0.091
0.525MetTyr: 0.525 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
3.783AsnAla: 3.783 ± 0.84
0.42AsnCys: 0.42 ± 0.262
1.261AsnAsp: 1.261 ± 0.311
2.312AsnGlu: 2.312 ± 0.459
1.261AsnPhe: 1.261 ± 0.334
4.518AsnGly: 4.518 ± 0.872
0.525AsnHis: 0.525 ± 0.252
3.678AsnIle: 3.678 ± 0.672
2.417AsnLys: 2.417 ± 0.544
2.837AsnLeu: 2.837 ± 0.587
1.051AsnMet: 1.051 ± 0.343
1.996AsnAsn: 1.996 ± 0.588
2.102AsnPro: 2.102 ± 0.498
1.051AsnGln: 1.051 ± 0.307
2.522AsnArg: 2.522 ± 0.501
2.312AsnSer: 2.312 ± 0.398
2.417AsnThr: 2.417 ± 0.573
2.837AsnVal: 2.837 ± 0.385
0.525AsnTrp: 0.525 ± 0.245
1.156AsnTyr: 1.156 ± 0.28
0.0AsnXaa: 0.0 ± 0.0
Pro
4.833ProAla: 4.833 ± 0.899
0.21ProCys: 0.21 ± 0.167
3.783ProAsp: 3.783 ± 0.738
3.783ProGlu: 3.783 ± 0.478
0.946ProPhe: 0.946 ± 0.426
2.312ProGly: 2.312 ± 0.626
1.261ProHis: 1.261 ± 0.495
2.102ProIle: 2.102 ± 0.442
1.786ProLys: 1.786 ± 0.512
4.098ProLeu: 4.098 ± 0.659
0.21ProMet: 0.21 ± 0.151
1.366ProAsn: 1.366 ± 0.467
1.891ProPro: 1.891 ± 0.448
1.156ProGln: 1.156 ± 0.425
2.522ProArg: 2.522 ± 0.591
2.207ProSer: 2.207 ± 0.54
1.051ProThr: 1.051 ± 0.31
4.098ProVal: 4.098 ± 0.709
0.42ProTrp: 0.42 ± 0.183
0.841ProTyr: 0.841 ± 0.312
0.0ProXaa: 0.0 ± 0.0
Gln
4.518GlnAla: 4.518 ± 1.375
0.105GlnCys: 0.105 ± 0.113
2.102GlnAsp: 2.102 ± 0.592
1.681GlnGlu: 1.681 ± 0.503
1.681GlnPhe: 1.681 ± 0.374
1.786GlnGly: 1.786 ± 0.31
0.63GlnHis: 0.63 ± 0.275
2.417GlnIle: 2.417 ± 0.598
3.362GlnLys: 3.362 ± 0.53
4.413GlnLeu: 4.413 ± 0.804
0.525GlnMet: 0.525 ± 0.254
1.261GlnAsn: 1.261 ± 0.465
1.051GlnPro: 1.051 ± 0.278
2.207GlnGln: 2.207 ± 0.788
3.257GlnArg: 3.257 ± 0.482
3.047GlnSer: 3.047 ± 0.591
2.207GlnThr: 2.207 ± 0.506
1.261GlnVal: 1.261 ± 0.304
0.736GlnTrp: 0.736 ± 0.324
1.051GlnTyr: 1.051 ± 0.395
0.0GlnXaa: 0.0 ± 0.0
Arg
5.359ArgAla: 5.359 ± 0.698
0.525ArgCys: 0.525 ± 0.233
2.942ArgAsp: 2.942 ± 0.453
4.413ArgGlu: 4.413 ± 0.727
1.576ArgPhe: 1.576 ± 0.454
3.678ArgGly: 3.678 ± 0.921
1.996ArgHis: 1.996 ± 0.462
3.362ArgIle: 3.362 ± 0.719
4.098ArgLys: 4.098 ± 0.696
7.46ArgLeu: 7.46 ± 0.943
1.576ArgMet: 1.576 ± 0.369
2.417ArgAsn: 2.417 ± 0.453
1.996ArgPro: 1.996 ± 0.497
2.732ArgGln: 2.732 ± 0.692
5.254ArgArg: 5.254 ± 0.783
2.522ArgSer: 2.522 ± 0.395
3.257ArgThr: 3.257 ± 0.546
4.833ArgVal: 4.833 ± 0.734
0.946ArgTrp: 0.946 ± 0.302
1.576ArgTyr: 1.576 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
7.25SerAla: 7.25 ± 1.071
0.525SerCys: 0.525 ± 0.183
3.888SerAsp: 3.888 ± 0.55
4.098SerGlu: 4.098 ± 0.56
1.681SerPhe: 1.681 ± 0.404
5.044SerGly: 5.044 ± 0.864
1.156SerHis: 1.156 ± 0.463
2.102SerIle: 2.102 ± 0.447
3.573SerLys: 3.573 ± 0.712
5.989SerLeu: 5.989 ± 0.757
1.576SerMet: 1.576 ± 0.436
3.047SerAsn: 3.047 ± 0.721
2.312SerPro: 2.312 ± 0.527
2.627SerGln: 2.627 ± 0.451
3.888SerArg: 3.888 ± 0.669
1.996SerSer: 1.996 ± 0.562
3.467SerThr: 3.467 ± 0.463
4.939SerVal: 4.939 ± 0.602
0.841SerTrp: 0.841 ± 0.309
1.051SerTyr: 1.051 ± 0.36
0.0SerXaa: 0.0 ± 0.0
Thr
7.881ThrAla: 7.881 ± 1.352
0.736ThrCys: 0.736 ± 0.276
4.623ThrAsp: 4.623 ± 0.621
2.417ThrGlu: 2.417 ± 0.56
1.366ThrPhe: 1.366 ± 0.458
5.674ThrGly: 5.674 ± 0.681
0.736ThrHis: 0.736 ± 0.286
3.678ThrIle: 3.678 ± 0.432
2.627ThrLys: 2.627 ± 0.566
9.142ThrLeu: 9.142 ± 1.006
1.891ThrMet: 1.891 ± 0.428
1.471ThrAsn: 1.471 ± 0.313
3.257ThrPro: 3.257 ± 0.486
2.207ThrGln: 2.207 ± 0.355
3.257ThrArg: 3.257 ± 0.583
3.152ThrSer: 3.152 ± 0.619
4.308ThrThr: 4.308 ± 0.884
5.044ThrVal: 5.044 ± 0.713
1.156ThrTrp: 1.156 ± 0.337
1.051ThrTyr: 1.051 ± 0.276
0.0ThrXaa: 0.0 ± 0.0
Val
6.515ValAla: 6.515 ± 0.65
0.63ValCys: 0.63 ± 0.224
3.678ValAsp: 3.678 ± 0.539
4.518ValGlu: 4.518 ± 0.819
2.417ValPhe: 2.417 ± 0.474
3.783ValGly: 3.783 ± 0.825
0.525ValHis: 0.525 ± 0.224
3.888ValIle: 3.888 ± 0.785
3.678ValLys: 3.678 ± 0.64
4.728ValLeu: 4.728 ± 0.565
2.207ValMet: 2.207 ± 0.489
3.047ValAsn: 3.047 ± 0.549
2.522ValPro: 2.522 ± 0.521
2.522ValGln: 2.522 ± 0.593
3.047ValArg: 3.047 ± 0.539
4.728ValSer: 4.728 ± 0.612
5.779ValThr: 5.779 ± 0.579
3.993ValVal: 3.993 ± 0.747
1.156ValTrp: 1.156 ± 0.315
1.681ValTyr: 1.681 ± 0.401
0.0ValXaa: 0.0 ± 0.0
Trp
1.471TrpAla: 1.471 ± 0.369
0.0TrpCys: 0.0 ± 0.0
0.841TrpAsp: 0.841 ± 0.319
1.051TrpGlu: 1.051 ± 0.269
0.525TrpPhe: 0.525 ± 0.264
0.42TrpGly: 0.42 ± 0.228
1.051TrpHis: 1.051 ± 0.284
0.736TrpIle: 0.736 ± 0.234
0.525TrpLys: 0.525 ± 0.27
3.047TrpLeu: 3.047 ± 0.662
0.315TrpMet: 0.315 ± 0.195
0.315TrpAsn: 0.315 ± 0.187
1.051TrpPro: 1.051 ± 0.386
0.525TrpGln: 0.525 ± 0.203
1.576TrpArg: 1.576 ± 0.396
0.63TrpSer: 0.63 ± 0.301
0.315TrpThr: 0.315 ± 0.185
0.42TrpVal: 0.42 ± 0.251
0.315TrpTrp: 0.315 ± 0.169
0.525TrpTyr: 0.525 ± 0.201
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.102TyrAla: 2.102 ± 0.484
0.105TyrCys: 0.105 ± 0.099
1.786TyrAsp: 1.786 ± 0.395
1.576TyrGlu: 1.576 ± 0.366
0.841TyrPhe: 0.841 ± 0.401
1.681TyrGly: 1.681 ± 0.406
0.525TyrHis: 0.525 ± 0.257
1.786TyrIle: 1.786 ± 0.374
0.63TyrLys: 0.63 ± 0.237
2.102TyrLeu: 2.102 ± 0.381
0.525TyrMet: 0.525 ± 0.225
0.63TyrAsn: 0.63 ± 0.244
1.261TyrPro: 1.261 ± 0.423
1.576TyrGln: 1.576 ± 0.402
1.996TyrArg: 1.996 ± 0.462
1.576TyrSer: 1.576 ± 0.458
1.681TyrThr: 1.681 ± 0.394
2.207TyrVal: 2.207 ± 0.547
0.42TyrTrp: 0.42 ± 0.166
0.42TyrTyr: 0.42 ± 0.196
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 42 proteins (9518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski