Amino acid dipepetide frequency for Salmonella virus STSR3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.396AlaAla: 6.396 ± 1.028
0.695AlaCys: 0.695 ± 0.232
3.059AlaAsp: 3.059 ± 0.382
4.171AlaGlu: 4.171 ± 0.556
3.615AlaPhe: 3.615 ± 0.543
4.588AlaGly: 4.588 ± 0.75
0.834AlaHis: 0.834 ± 0.192
5.492AlaIle: 5.492 ± 0.58
4.936AlaLys: 4.936 ± 0.568
9.455AlaLeu: 9.455 ± 0.787
2.225AlaMet: 2.225 ± 0.433
3.198AlaAsn: 3.198 ± 0.478
1.738AlaPro: 1.738 ± 0.301
2.642AlaGln: 2.642 ± 0.457
5.909AlaArg: 5.909 ± 0.785
5.145AlaSer: 5.145 ± 0.616
4.032AlaThr: 4.032 ± 0.608
5.075AlaVal: 5.075 ± 0.684
0.834AlaTrp: 0.834 ± 0.288
2.781AlaTyr: 2.781 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
1.877CysAla: 1.877 ± 0.305
0.765CysCys: 0.765 ± 0.31
0.904CysAsp: 0.904 ± 0.287
0.695CysGlu: 0.695 ± 0.2
0.904CysPhe: 0.904 ± 0.258
1.46CysGly: 1.46 ± 0.314
0.348CysHis: 0.348 ± 0.158
1.043CysIle: 1.043 ± 0.244
1.043CysLys: 1.043 ± 0.274
2.016CysLeu: 2.016 ± 0.342
0.417CysMet: 0.417 ± 0.169
1.46CysAsn: 1.46 ± 0.338
0.626CysPro: 0.626 ± 0.21
0.278CysGln: 0.278 ± 0.125
1.043CysArg: 1.043 ± 0.286
1.947CysSer: 1.947 ± 0.412
1.043CysThr: 1.043 ± 0.261
1.182CysVal: 1.182 ± 0.318
0.765CysTrp: 0.765 ± 0.199
1.182CysTyr: 1.182 ± 0.265
0.0CysXaa: 0.0 ± 0.0
Asp
2.92AspAla: 2.92 ± 0.438
0.834AspCys: 0.834 ± 0.252
3.337AspAsp: 3.337 ± 0.53
3.337AspGlu: 3.337 ± 0.581
2.086AspPhe: 2.086 ± 0.457
4.588AspGly: 4.588 ± 0.689
0.626AspHis: 0.626 ± 0.18
2.642AspIle: 2.642 ± 0.45
3.128AspLys: 3.128 ± 0.474
2.433AspLeu: 2.433 ± 0.448
1.251AspMet: 1.251 ± 0.277
1.599AspAsn: 1.599 ± 0.317
1.46AspPro: 1.46 ± 0.304
1.182AspGln: 1.182 ± 0.303
3.337AspArg: 3.337 ± 0.505
2.642AspSer: 2.642 ± 0.415
2.642AspThr: 2.642 ± 0.428
2.016AspVal: 2.016 ± 0.343
1.112AspTrp: 1.112 ± 0.28
1.738AspTyr: 1.738 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
4.102GluAla: 4.102 ± 0.551
1.529GluCys: 1.529 ± 0.281
2.155GluAsp: 2.155 ± 0.391
2.92GluGlu: 2.92 ± 0.424
2.225GluPhe: 2.225 ± 0.412
2.781GluGly: 2.781 ± 0.385
1.39GluHis: 1.39 ± 0.424
3.963GluIle: 3.963 ± 0.484
3.476GluLys: 3.476 ± 0.552
4.727GluLeu: 4.727 ± 0.614
2.503GluMet: 2.503 ± 0.546
2.642GluAsn: 2.642 ± 0.336
1.043GluPro: 1.043 ± 0.279
2.711GluGln: 2.711 ± 0.401
2.711GluArg: 2.711 ± 0.484
3.337GluSer: 3.337 ± 0.452
2.989GluThr: 2.989 ± 0.523
3.476GluVal: 3.476 ± 0.512
0.765GluTrp: 0.765 ± 0.244
2.016GluTyr: 2.016 ± 0.361
0.0GluXaa: 0.0 ± 0.0
Phe
2.642PheAla: 2.642 ± 0.392
0.834PheCys: 0.834 ± 0.213
1.947PheAsp: 1.947 ± 0.407
1.877PheGlu: 1.877 ± 0.384
1.321PhePhe: 1.321 ± 0.245
2.572PheGly: 2.572 ± 0.41
0.695PheHis: 0.695 ± 0.182
2.294PheIle: 2.294 ± 0.397
2.989PheLys: 2.989 ± 0.476
3.268PheLeu: 3.268 ± 0.499
1.321PheMet: 1.321 ± 0.322
1.39PheAsn: 1.39 ± 0.342
1.39PhePro: 1.39 ± 0.276
1.112PheGln: 1.112 ± 0.273
4.31PheArg: 4.31 ± 0.553
2.503PheSer: 2.503 ± 0.381
2.642PheThr: 2.642 ± 0.495
1.947PheVal: 1.947 ± 0.372
1.043PheTrp: 1.043 ± 0.304
0.973PheTyr: 0.973 ± 0.28
0.0PheXaa: 0.0 ± 0.0
Gly
4.797GlyAla: 4.797 ± 0.758
1.529GlyCys: 1.529 ± 0.308
3.754GlyAsp: 3.754 ± 0.558
4.032GlyGlu: 4.032 ± 0.467
2.781GlyPhe: 2.781 ± 0.409
3.685GlyGly: 3.685 ± 0.659
1.39GlyHis: 1.39 ± 0.362
3.824GlyIle: 3.824 ± 0.501
4.449GlyLys: 4.449 ± 0.636
4.727GlyLeu: 4.727 ± 0.626
1.877GlyMet: 1.877 ± 0.442
3.128GlyAsn: 3.128 ± 0.49
0.278GlyPro: 0.278 ± 0.127
1.39GlyGln: 1.39 ± 0.45
4.032GlyArg: 4.032 ± 0.695
3.476GlySer: 3.476 ± 0.506
2.572GlyThr: 2.572 ± 0.449
4.519GlyVal: 4.519 ± 0.541
1.251GlyTrp: 1.251 ± 0.322
2.086GlyTyr: 2.086 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
1.599HisAla: 1.599 ± 0.353
0.626HisCys: 0.626 ± 0.199
0.904HisAsp: 0.904 ± 0.215
1.738HisGlu: 1.738 ± 0.413
1.112HisPhe: 1.112 ± 0.325
1.529HisGly: 1.529 ± 0.377
1.182HisHis: 1.182 ± 0.352
0.765HisIle: 0.765 ± 0.293
1.529HisLys: 1.529 ± 0.326
1.808HisLeu: 1.808 ± 0.373
0.348HisMet: 0.348 ± 0.135
1.39HisAsn: 1.39 ± 0.317
0.626HisPro: 0.626 ± 0.271
0.973HisGln: 0.973 ± 0.215
1.599HisArg: 1.599 ± 0.256
1.529HisSer: 1.529 ± 0.389
1.529HisThr: 1.529 ± 0.311
0.904HisVal: 0.904 ± 0.268
0.695HisTrp: 0.695 ± 0.265
0.765HisTyr: 0.765 ± 0.202
0.0HisXaa: 0.0 ± 0.0
Ile
6.813IleAla: 6.813 ± 0.759
1.669IleCys: 1.669 ± 0.352
3.476IleAsp: 3.476 ± 0.49
3.337IleGlu: 3.337 ± 0.489
2.086IlePhe: 2.086 ± 0.313
3.337IleGly: 3.337 ± 0.507
1.182IleHis: 1.182 ± 0.34
2.92IleIle: 2.92 ± 0.538
4.171IleLys: 4.171 ± 0.559
4.449IleLeu: 4.449 ± 0.596
1.599IleMet: 1.599 ± 0.351
2.642IleAsn: 2.642 ± 0.458
2.433IlePro: 2.433 ± 0.398
2.364IleGln: 2.364 ± 0.353
5.006IleArg: 5.006 ± 0.582
5.353IleSer: 5.353 ± 0.614
4.658IleThr: 4.658 ± 0.577
3.337IleVal: 3.337 ± 0.573
1.112IleTrp: 1.112 ± 0.267
2.016IleTyr: 2.016 ± 0.351
0.0IleXaa: 0.0 ± 0.0
Lys
5.423LysAla: 5.423 ± 0.58
0.695LysCys: 0.695 ± 0.254
3.198LysAsp: 3.198 ± 0.617
3.476LysGlu: 3.476 ± 0.571
2.503LysPhe: 2.503 ± 0.387
3.337LysGly: 3.337 ± 0.513
2.016LysHis: 2.016 ± 0.416
4.936LysIle: 4.936 ± 0.764
2.989LysLys: 2.989 ± 0.479
7.161LysLeu: 7.161 ± 0.673
2.572LysMet: 2.572 ± 0.406
2.572LysAsn: 2.572 ± 0.339
2.85LysPro: 2.85 ± 0.464
3.059LysGln: 3.059 ± 0.484
5.84LysArg: 5.84 ± 0.827
5.145LysSer: 5.145 ± 0.525
4.449LysThr: 4.449 ± 0.641
3.754LysVal: 3.754 ± 0.462
1.321LysTrp: 1.321 ± 0.27
2.016LysTyr: 2.016 ± 0.404
0.0LysXaa: 0.0 ± 0.0
Leu
8.065LeuAla: 8.065 ± 0.804
1.39LeuCys: 1.39 ± 0.312
3.407LeuAsp: 3.407 ± 0.551
3.685LeuGlu: 3.685 ± 0.543
2.781LeuPhe: 2.781 ± 0.422
3.546LeuGly: 3.546 ± 0.453
1.877LeuHis: 1.877 ± 0.338
5.423LeuIle: 5.423 ± 0.613
7.578LeuLys: 7.578 ± 0.861
10.289LeuLeu: 10.289 ± 0.887
2.989LeuMet: 2.989 ± 0.411
3.963LeuAsn: 3.963 ± 0.552
3.893LeuPro: 3.893 ± 0.607
3.407LeuGln: 3.407 ± 0.68
7.369LeuArg: 7.369 ± 0.911
5.84LeuSer: 5.84 ± 0.665
7.23LeuThr: 7.23 ± 0.781
4.936LeuVal: 4.936 ± 0.773
0.834LeuTrp: 0.834 ± 0.213
2.781LeuTyr: 2.781 ± 0.451
0.0LeuXaa: 0.0 ± 0.0
Met
2.294MetAla: 2.294 ± 0.401
0.487MetCys: 0.487 ± 0.19
1.182MetAsp: 1.182 ± 0.253
1.669MetGlu: 1.669 ± 0.368
1.251MetPhe: 1.251 ± 0.288
1.321MetGly: 1.321 ± 0.295
0.695MetHis: 0.695 ± 0.221
2.433MetIle: 2.433 ± 0.418
2.364MetLys: 2.364 ± 0.434
2.086MetLeu: 2.086 ± 0.456
0.834MetMet: 0.834 ± 0.217
1.251MetAsn: 1.251 ± 0.304
0.487MetPro: 0.487 ± 0.202
1.112MetGln: 1.112 ± 0.241
2.572MetArg: 2.572 ± 0.387
2.016MetSer: 2.016 ± 0.347
1.947MetThr: 1.947 ± 0.399
1.738MetVal: 1.738 ± 0.324
0.278MetTrp: 0.278 ± 0.148
0.556MetTyr: 0.556 ± 0.154
0.0MetXaa: 0.0 ± 0.0
Asn
2.781AsnAla: 2.781 ± 0.487
1.043AsnCys: 1.043 ± 0.292
1.808AsnAsp: 1.808 ± 0.351
2.364AsnGlu: 2.364 ± 0.426
1.46AsnPhe: 1.46 ± 0.315
3.198AsnGly: 3.198 ± 0.444
0.973AsnHis: 0.973 ± 0.233
2.016AsnIle: 2.016 ± 0.432
3.059AsnLys: 3.059 ± 0.493
3.407AsnLeu: 3.407 ± 0.441
1.182AsnMet: 1.182 ± 0.261
1.669AsnAsn: 1.669 ± 0.35
1.669AsnPro: 1.669 ± 0.385
2.225AsnGln: 2.225 ± 0.345
3.546AsnArg: 3.546 ± 0.572
2.989AsnSer: 2.989 ± 0.546
2.155AsnThr: 2.155 ± 0.322
3.407AsnVal: 3.407 ± 0.465
0.556AsnTrp: 0.556 ± 0.186
1.599AsnTyr: 1.599 ± 0.305
0.0AsnXaa: 0.0 ± 0.0
Pro
2.503ProAla: 2.503 ± 0.445
1.182ProCys: 1.182 ± 0.352
1.599ProAsp: 1.599 ± 0.471
2.85ProGlu: 2.85 ± 0.518
1.112ProPhe: 1.112 ± 0.257
1.321ProGly: 1.321 ± 0.34
0.904ProHis: 0.904 ± 0.248
2.433ProIle: 2.433 ± 0.433
1.182ProLys: 1.182 ± 0.233
2.92ProLeu: 2.92 ± 0.503
0.626ProMet: 0.626 ± 0.199
1.112ProAsn: 1.112 ± 0.31
1.251ProPro: 1.251 ± 0.263
0.765ProGln: 0.765 ± 0.231
1.947ProArg: 1.947 ± 0.415
1.529ProSer: 1.529 ± 0.325
1.46ProThr: 1.46 ± 0.36
2.433ProVal: 2.433 ± 0.497
0.487ProTrp: 0.487 ± 0.168
1.043ProTyr: 1.043 ± 0.258
0.0ProXaa: 0.0 ± 0.0
Gln
2.503GlnAla: 2.503 ± 0.396
0.834GlnCys: 0.834 ± 0.191
1.39GlnAsp: 1.39 ± 0.265
2.294GlnGlu: 2.294 ± 0.421
1.182GlnPhe: 1.182 ± 0.341
1.738GlnGly: 1.738 ± 0.314
0.904GlnHis: 0.904 ± 0.238
2.086GlnIle: 2.086 ± 0.347
2.781GlnLys: 2.781 ± 0.461
4.102GlnLeu: 4.102 ± 0.682
0.904GlnMet: 0.904 ± 0.235
1.39GlnAsn: 1.39 ± 0.258
1.529GlnPro: 1.529 ± 0.342
2.364GlnGln: 2.364 ± 0.441
3.407GlnArg: 3.407 ± 0.672
2.433GlnSer: 2.433 ± 0.437
2.016GlnThr: 2.016 ± 0.453
1.738GlnVal: 1.738 ± 0.283
0.278GlnTrp: 0.278 ± 0.152
1.669GlnTyr: 1.669 ± 0.302
0.0GlnXaa: 0.0 ± 0.0
Arg
4.449ArgAla: 4.449 ± 0.512
1.182ArgCys: 1.182 ± 0.275
1.947ArgAsp: 1.947 ± 0.32
3.893ArgGlu: 3.893 ± 0.43
3.407ArgPhe: 3.407 ± 0.446
2.989ArgGly: 2.989 ± 0.484
1.599ArgHis: 1.599 ± 0.394
4.519ArgIle: 4.519 ± 0.487
7.23ArgLys: 7.23 ± 1.125
9.316ArgLeu: 9.316 ± 1.033
2.572ArgMet: 2.572 ± 0.406
3.268ArgAsn: 3.268 ± 0.509
1.599ArgPro: 1.599 ± 0.307
2.711ArgGln: 2.711 ± 0.438
9.177ArgArg: 9.177 ± 1.214
6.813ArgSer: 6.813 ± 0.872
2.781ArgThr: 2.781 ± 0.644
5.353ArgVal: 5.353 ± 0.629
1.599ArgTrp: 1.599 ± 0.329
2.781ArgTyr: 2.781 ± 0.436
0.0ArgXaa: 0.0 ± 0.0
Ser
5.284SerAla: 5.284 ± 0.541
1.321SerCys: 1.321 ± 0.327
3.546SerAsp: 3.546 ± 0.56
3.893SerGlu: 3.893 ± 0.462
3.198SerPhe: 3.198 ± 0.523
6.326SerGly: 6.326 ± 0.643
2.016SerHis: 2.016 ± 0.394
4.38SerIle: 4.38 ± 0.683
3.893SerLys: 3.893 ± 0.624
6.605SerLeu: 6.605 ± 1.134
1.112SerMet: 1.112 ± 0.227
2.294SerAsn: 2.294 ± 0.379
2.294SerPro: 2.294 ± 0.374
2.642SerGln: 2.642 ± 0.521
5.075SerArg: 5.075 ± 0.642
3.476SerSer: 3.476 ± 0.462
3.754SerThr: 3.754 ± 0.542
4.727SerVal: 4.727 ± 0.641
0.904SerTrp: 0.904 ± 0.235
2.225SerTyr: 2.225 ± 0.554
0.0SerXaa: 0.0 ± 0.0
Thr
4.588ThrAla: 4.588 ± 0.547
1.321ThrCys: 1.321 ± 0.285
2.155ThrAsp: 2.155 ± 0.353
2.711ThrGlu: 2.711 ± 0.376
2.572ThrPhe: 2.572 ± 0.389
4.936ThrGly: 4.936 ± 0.648
1.46ThrHis: 1.46 ± 0.293
3.824ThrIle: 3.824 ± 0.552
4.31ThrLys: 4.31 ± 0.467
5.423ThrLeu: 5.423 ± 0.663
1.669ThrMet: 1.669 ± 0.295
2.92ThrAsn: 2.92 ± 0.407
2.016ThrPro: 2.016 ± 0.274
2.294ThrGln: 2.294 ± 0.467
3.754ThrArg: 3.754 ± 0.504
3.128ThrSer: 3.128 ± 0.511
3.615ThrThr: 3.615 ± 0.646
3.685ThrVal: 3.685 ± 0.41
0.626ThrTrp: 0.626 ± 0.173
1.738ThrTyr: 1.738 ± 0.338
0.0ThrXaa: 0.0 ± 0.0
Val
3.893ValAla: 3.893 ± 0.512
2.155ValCys: 2.155 ± 0.388
2.989ValAsp: 2.989 ± 0.446
2.711ValGlu: 2.711 ± 0.506
1.669ValPhe: 1.669 ± 0.335
3.337ValGly: 3.337 ± 0.449
1.182ValHis: 1.182 ± 0.301
4.867ValIle: 4.867 ± 0.542
5.492ValLys: 5.492 ± 0.804
4.032ValLeu: 4.032 ± 0.569
1.669ValMet: 1.669 ± 0.301
3.128ValAsn: 3.128 ± 0.354
1.738ValPro: 1.738 ± 0.35
2.016ValGln: 2.016 ± 0.347
3.754ValArg: 3.754 ± 0.544
5.214ValSer: 5.214 ± 0.653
4.449ValThr: 4.449 ± 0.484
3.059ValVal: 3.059 ± 0.498
0.904ValTrp: 0.904 ± 0.225
2.503ValTyr: 2.503 ± 0.397
0.0ValXaa: 0.0 ± 0.0
Trp
0.973TrpAla: 0.973 ± 0.252
0.139TrpCys: 0.139 ± 0.093
0.626TrpAsp: 0.626 ± 0.191
0.278TrpGlu: 0.278 ± 0.135
0.973TrpPhe: 0.973 ± 0.23
0.626TrpGly: 0.626 ± 0.15
0.417TrpHis: 0.417 ± 0.162
1.669TrpIle: 1.669 ± 0.356
0.904TrpLys: 0.904 ± 0.295
1.529TrpLeu: 1.529 ± 0.317
0.348TrpMet: 0.348 ± 0.166
0.556TrpAsn: 0.556 ± 0.277
0.556TrpPro: 0.556 ± 0.141
0.556TrpGln: 0.556 ± 0.209
2.433TrpArg: 2.433 ± 0.38
1.251TrpSer: 1.251 ± 0.352
0.695TrpThr: 0.695 ± 0.234
1.112TrpVal: 1.112 ± 0.28
0.139TrpTrp: 0.139 ± 0.086
0.417TrpTyr: 0.417 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.92TyrAla: 2.92 ± 0.524
0.765TyrCys: 0.765 ± 0.155
1.529TyrAsp: 1.529 ± 0.327
1.599TyrGlu: 1.599 ± 0.333
0.834TyrPhe: 0.834 ± 0.227
2.364TyrGly: 2.364 ± 0.387
1.46TyrHis: 1.46 ± 0.342
2.433TyrIle: 2.433 ± 0.554
1.808TyrLys: 1.808 ± 0.355
1.529TyrLeu: 1.529 ± 0.324
0.487TyrMet: 0.487 ± 0.177
1.529TyrAsn: 1.529 ± 0.305
1.182TyrPro: 1.182 ± 0.32
1.738TyrGln: 1.738 ± 0.427
2.364TyrArg: 2.364 ± 0.385
3.198TyrSer: 3.198 ± 0.469
2.086TyrThr: 2.086 ± 0.355
2.364TyrVal: 2.364 ± 0.316
0.695TyrTrp: 0.695 ± 0.197
0.834TyrTyr: 0.834 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 85 proteins (14385 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski