Amino acid dipepetide frequency for Salmonella virus Chi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.919AlaAla: 11.919 ± 1.361
0.738AlaCys: 0.738 ± 0.219
6.434AlaAsp: 6.434 ± 0.511
6.645AlaGlu: 6.645 ± 0.606
3.164AlaPhe: 3.164 ± 0.326
7.805AlaGly: 7.805 ± 0.73
1.477AlaHis: 1.477 ± 0.305
5.274AlaIle: 5.274 ± 0.65
5.59AlaLys: 5.59 ± 0.735
7.753AlaLeu: 7.753 ± 0.908
3.903AlaMet: 3.903 ± 0.53
3.692AlaAsn: 3.692 ± 0.386
3.481AlaPro: 3.481 ± 0.39
3.639AlaGln: 3.639 ± 0.653
5.801AlaArg: 5.801 ± 0.71
5.063AlaSer: 5.063 ± 0.605
5.749AlaThr: 5.749 ± 0.759
6.54AlaVal: 6.54 ± 0.751
1.16AlaTrp: 1.16 ± 0.221
3.006AlaTyr: 3.006 ± 0.378
0.0AlaXaa: 0.0 ± 0.0
Cys
0.633CysAla: 0.633 ± 0.19
0.105CysCys: 0.105 ± 0.09
0.791CysAsp: 0.791 ± 0.211
0.686CysGlu: 0.686 ± 0.201
0.369CysPhe: 0.369 ± 0.151
0.844CysGly: 0.844 ± 0.205
0.264CysHis: 0.264 ± 0.129
0.422CysIle: 0.422 ± 0.121
0.369CysLys: 0.369 ± 0.169
0.686CysLeu: 0.686 ± 0.172
0.105CysMet: 0.105 ± 0.068
0.316CysAsn: 0.316 ± 0.135
0.633CysPro: 0.633 ± 0.223
0.211CysGln: 0.211 ± 0.095
0.738CysArg: 0.738 ± 0.232
0.527CysSer: 0.527 ± 0.186
0.686CysThr: 0.686 ± 0.181
0.475CysVal: 0.475 ± 0.166
0.105CysTrp: 0.105 ± 0.093
0.316CysTyr: 0.316 ± 0.144
0.0CysXaa: 0.0 ± 0.0
Asp
6.698AspAla: 6.698 ± 0.485
0.316AspCys: 0.316 ± 0.16
4.852AspAsp: 4.852 ± 0.55
5.59AspGlu: 5.59 ± 0.642
2.584AspPhe: 2.584 ± 0.356
5.696AspGly: 5.696 ± 0.647
1.16AspHis: 1.16 ± 0.289
3.112AspIle: 3.112 ± 0.351
3.164AspLys: 3.164 ± 0.501
5.01AspLeu: 5.01 ± 0.501
2.373AspMet: 2.373 ± 0.28
2.268AspAsn: 2.268 ± 0.397
3.375AspPro: 3.375 ± 0.453
2.215AspGln: 2.215 ± 0.361
3.534AspArg: 3.534 ± 0.422
2.162AspSer: 2.162 ± 0.33
3.692AspThr: 3.692 ± 0.5
5.221AspVal: 5.221 ± 0.479
1.951AspTrp: 1.951 ± 0.395
1.951AspTyr: 1.951 ± 0.3
0.0AspXaa: 0.0 ± 0.0
Glu
5.696GluAla: 5.696 ± 0.629
0.527GluCys: 0.527 ± 0.187
3.797GluAsp: 3.797 ± 0.45
4.061GluGlu: 4.061 ± 0.591
1.951GluPhe: 1.951 ± 0.39
4.43GluGly: 4.43 ± 0.483
1.266GluHis: 1.266 ± 0.299
4.061GluIle: 4.061 ± 0.519
2.901GluLys: 2.901 ± 0.46
4.008GluLeu: 4.008 ± 0.556
2.004GluMet: 2.004 ± 0.34
2.637GluAsn: 2.637 ± 0.465
2.795GluPro: 2.795 ± 0.637
2.795GluGln: 2.795 ± 0.44
3.481GluArg: 3.481 ± 0.471
3.27GluSer: 3.27 ± 0.493
3.217GluThr: 3.217 ± 0.384
5.116GluVal: 5.116 ± 0.547
1.529GluTrp: 1.529 ± 0.223
2.004GluTyr: 2.004 ± 0.343
0.0GluXaa: 0.0 ± 0.0
Phe
3.639PheAla: 3.639 ± 0.433
0.475PheCys: 0.475 ± 0.154
3.112PheAsp: 3.112 ± 0.476
2.057PheGlu: 2.057 ± 0.375
1.16PhePhe: 1.16 ± 0.25
3.164PheGly: 3.164 ± 0.488
0.527PheHis: 0.527 ± 0.178
2.268PheIle: 2.268 ± 0.323
1.951PheLys: 1.951 ± 0.447
1.951PheLeu: 1.951 ± 0.362
0.949PheMet: 0.949 ± 0.262
1.899PheAsn: 1.899 ± 0.321
1.371PhePro: 1.371 ± 0.246
1.213PheGln: 1.213 ± 0.233
2.584PheArg: 2.584 ± 0.37
1.846PheSer: 1.846 ± 0.22
2.215PheThr: 2.215 ± 0.32
1.74PheVal: 1.74 ± 0.293
0.158PheTrp: 0.158 ± 0.11
0.791PheTyr: 0.791 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
6.962GlyAla: 6.962 ± 0.694
1.055GlyCys: 1.055 ± 0.237
4.219GlyAsp: 4.219 ± 0.529
5.01GlyGlu: 5.01 ± 0.53
2.479GlyPhe: 2.479 ± 0.394
6.382GlyGly: 6.382 ± 0.751
1.055GlyHis: 1.055 ± 0.32
3.955GlyIle: 3.955 ± 0.373
3.217GlyLys: 3.217 ± 0.459
5.854GlyLeu: 5.854 ± 0.697
1.529GlyMet: 1.529 ± 0.295
2.637GlyAsn: 2.637 ± 0.43
1.899GlyPro: 1.899 ± 0.421
2.795GlyGln: 2.795 ± 0.364
4.694GlyArg: 4.694 ± 0.506
3.692GlySer: 3.692 ± 0.464
5.169GlyThr: 5.169 ± 0.714
4.747GlyVal: 4.747 ± 0.366
1.688GlyTrp: 1.688 ± 0.29
3.375GlyTyr: 3.375 ± 0.549
0.0GlyXaa: 0.0 ± 0.0
His
1.424HisAla: 1.424 ± 0.308
0.316HisCys: 0.316 ± 0.154
1.582HisAsp: 1.582 ± 0.318
0.844HisGlu: 0.844 ± 0.216
0.738HisPhe: 0.738 ± 0.19
1.266HisGly: 1.266 ± 0.195
0.211HisHis: 0.211 ± 0.124
1.108HisIle: 1.108 ± 0.251
0.738HisLys: 0.738 ± 0.238
1.424HisLeu: 1.424 ± 0.3
0.316HisMet: 0.316 ± 0.122
0.844HisAsn: 0.844 ± 0.176
0.949HisPro: 0.949 ± 0.207
0.422HisGln: 0.422 ± 0.146
1.318HisArg: 1.318 ± 0.245
0.527HisSer: 0.527 ± 0.172
0.527HisThr: 0.527 ± 0.168
1.213HisVal: 1.213 ± 0.294
0.422HisTrp: 0.422 ± 0.175
0.897HisTyr: 0.897 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
5.643IleAla: 5.643 ± 0.6
0.58IleCys: 0.58 ± 0.177
3.534IleAsp: 3.534 ± 0.455
4.008IleGlu: 4.008 ± 0.424
1.688IlePhe: 1.688 ± 0.284
4.43IleGly: 4.43 ± 0.472
1.266IleHis: 1.266 ± 0.329
3.006IleIle: 3.006 ± 0.376
3.428IleLys: 3.428 ± 0.58
3.164IleLeu: 3.164 ± 0.412
1.477IleMet: 1.477 ± 0.281
2.584IleAsn: 2.584 ± 0.387
2.479IlePro: 2.479 ± 0.313
1.793IleGln: 1.793 ± 0.332
3.534IleArg: 3.534 ± 0.439
2.795IleSer: 2.795 ± 0.451
3.27IleThr: 3.27 ± 0.41
3.85IleVal: 3.85 ± 0.432
0.686IleTrp: 0.686 ± 0.237
1.529IleTyr: 1.529 ± 0.347
0.0IleXaa: 0.0 ± 0.0
Lys
4.958LysAla: 4.958 ± 0.535
0.475LysCys: 0.475 ± 0.143
3.217LysAsp: 3.217 ± 0.533
3.27LysGlu: 3.27 ± 0.39
1.582LysPhe: 1.582 ± 0.26
3.534LysGly: 3.534 ± 0.445
1.16LysHis: 1.16 ± 0.312
2.637LysIle: 2.637 ± 0.345
3.112LysLys: 3.112 ± 0.476
3.639LysLeu: 3.639 ± 0.36
1.899LysMet: 1.899 ± 0.333
1.846LysAsn: 1.846 ± 0.349
2.637LysPro: 2.637 ± 0.389
1.582LysGln: 1.582 ± 0.329
3.586LysArg: 3.586 ± 0.438
2.426LysSer: 2.426 ± 0.348
3.481LysThr: 3.481 ± 0.463
3.586LysVal: 3.586 ± 0.377
1.108LysTrp: 1.108 ± 0.308
2.057LysTyr: 2.057 ± 0.31
0.0LysXaa: 0.0 ± 0.0
Leu
6.645LeuAla: 6.645 ± 0.865
0.791LeuCys: 0.791 ± 0.213
5.221LeuAsp: 5.221 ± 0.551
4.483LeuGlu: 4.483 ± 0.501
2.321LeuPhe: 2.321 ± 0.357
3.85LeuGly: 3.85 ± 0.553
1.213LeuHis: 1.213 ± 0.271
3.955LeuIle: 3.955 ± 0.429
4.166LeuLys: 4.166 ± 0.367
4.588LeuLeu: 4.588 ± 0.493
1.793LeuMet: 1.793 ± 0.407
3.955LeuAsn: 3.955 ± 0.5
4.219LeuPro: 4.219 ± 0.605
2.479LeuGln: 2.479 ± 0.495
4.483LeuArg: 4.483 ± 0.516
5.643LeuSer: 5.643 ± 0.633
5.169LeuThr: 5.169 ± 0.458
4.536LeuVal: 4.536 ± 0.557
1.16LeuTrp: 1.16 ± 0.221
2.479LeuTyr: 2.479 ± 0.295
0.0LeuXaa: 0.0 ± 0.0
Met
3.534MetAla: 3.534 ± 0.408
0.422MetCys: 0.422 ± 0.144
1.688MetAsp: 1.688 ± 0.299
1.371MetGlu: 1.371 ± 0.286
1.108MetPhe: 1.108 ± 0.223
1.74MetGly: 1.74 ± 0.324
0.422MetHis: 0.422 ± 0.163
1.213MetIle: 1.213 ± 0.263
1.846MetLys: 1.846 ± 0.348
2.11MetLeu: 2.11 ± 0.287
0.686MetMet: 0.686 ± 0.162
1.74MetAsn: 1.74 ± 0.322
1.108MetPro: 1.108 ± 0.247
1.213MetGln: 1.213 ± 0.3
2.215MetArg: 2.215 ± 0.401
2.162MetSer: 2.162 ± 0.427
2.321MetThr: 2.321 ± 0.299
2.057MetVal: 2.057 ± 0.323
0.422MetTrp: 0.422 ± 0.146
1.266MetTyr: 1.266 ± 0.23
0.0MetXaa: 0.0 ± 0.0
Asn
4.799AsnAla: 4.799 ± 0.612
0.316AsnCys: 0.316 ± 0.113
2.321AsnAsp: 2.321 ± 0.369
1.529AsnGlu: 1.529 ± 0.279
1.371AsnPhe: 1.371 ± 0.25
3.534AsnGly: 3.534 ± 0.455
0.738AsnHis: 0.738 ± 0.21
2.215AsnIle: 2.215 ± 0.398
1.793AsnLys: 1.793 ± 0.41
3.006AsnLeu: 3.006 ± 0.438
1.108AsnMet: 1.108 ± 0.23
1.846AsnAsn: 1.846 ± 0.455
3.27AsnPro: 3.27 ± 0.477
1.793AsnGln: 1.793 ± 0.302
2.004AsnArg: 2.004 ± 0.262
2.742AsnSer: 2.742 ± 0.508
2.426AsnThr: 2.426 ± 0.402
3.164AsnVal: 3.164 ± 0.321
0.475AsnTrp: 0.475 ± 0.167
1.793AsnTyr: 1.793 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
4.377ProAla: 4.377 ± 0.504
0.58ProCys: 0.58 ± 0.191
3.85ProAsp: 3.85 ± 0.584
3.375ProGlu: 3.375 ± 0.446
1.529ProPhe: 1.529 ± 0.296
3.375ProGly: 3.375 ± 0.494
1.055ProHis: 1.055 ± 0.225
2.532ProIle: 2.532 ± 0.345
2.69ProLys: 2.69 ± 0.537
3.639ProLeu: 3.639 ± 0.51
1.424ProMet: 1.424 ± 0.34
2.004ProAsn: 2.004 ± 0.255
2.426ProPro: 2.426 ± 0.733
1.424ProGln: 1.424 ± 0.26
1.688ProArg: 1.688 ± 0.337
2.479ProSer: 2.479 ± 0.399
3.006ProThr: 3.006 ± 0.419
3.164ProVal: 3.164 ± 0.513
0.527ProTrp: 0.527 ± 0.209
1.899ProTyr: 1.899 ± 0.455
0.0ProXaa: 0.0 ± 0.0
Gln
3.428GlnAla: 3.428 ± 0.874
0.211GlnCys: 0.211 ± 0.108
2.321GlnAsp: 2.321 ± 0.338
1.582GlnGlu: 1.582 ± 0.3
2.268GlnPhe: 2.268 ± 0.295
1.688GlnGly: 1.688 ± 0.319
0.897GlnHis: 0.897 ± 0.226
2.69GlnIle: 2.69 ± 0.334
2.215GlnLys: 2.215 ± 0.394
2.479GlnLeu: 2.479 ± 0.429
1.74GlnMet: 1.74 ± 0.316
1.582GlnAsn: 1.582 ± 0.318
1.793GlnPro: 1.793 ± 0.427
1.74GlnGln: 1.74 ± 0.559
2.321GlnArg: 2.321 ± 0.357
2.479GlnSer: 2.479 ± 0.441
2.321GlnThr: 2.321 ± 0.482
2.532GlnVal: 2.532 ± 0.568
0.738GlnTrp: 0.738 ± 0.164
1.529GlnTyr: 1.529 ± 0.303
0.0GlnXaa: 0.0 ± 0.0
Arg
5.907ArgAla: 5.907 ± 0.66
0.527ArgCys: 0.527 ± 0.189
3.903ArgAsp: 3.903 ± 0.478
2.953ArgGlu: 2.953 ± 0.463
2.637ArgPhe: 2.637 ± 0.379
3.586ArgGly: 3.586 ± 0.385
1.16ArgHis: 1.16 ± 0.286
3.534ArgIle: 3.534 ± 0.375
3.164ArgLys: 3.164 ± 0.42
5.379ArgLeu: 5.379 ± 0.637
2.11ArgMet: 2.11 ± 0.401
2.11ArgAsn: 2.11 ± 0.35
2.637ArgPro: 2.637 ± 0.32
2.426ArgGln: 2.426 ± 0.363
3.797ArgArg: 3.797 ± 0.599
2.637ArgSer: 2.637 ± 0.302
3.428ArgThr: 3.428 ± 0.396
4.958ArgVal: 4.958 ± 0.469
0.897ArgTrp: 0.897 ± 0.228
1.74ArgTyr: 1.74 ± 0.365
0.0ArgXaa: 0.0 ± 0.0
Ser
5.01SerAla: 5.01 ± 0.531
0.475SerCys: 0.475 ± 0.144
4.008SerAsp: 4.008 ± 0.433
2.373SerGlu: 2.373 ± 0.337
2.321SerPhe: 2.321 ± 0.421
4.641SerGly: 4.641 ± 0.467
0.475SerHis: 0.475 ± 0.128
3.006SerIle: 3.006 ± 0.339
3.006SerLys: 3.006 ± 0.472
3.955SerLeu: 3.955 ± 0.461
1.477SerMet: 1.477 ± 0.254
2.637SerAsn: 2.637 ± 0.388
2.584SerPro: 2.584 ± 0.327
2.69SerGln: 2.69 ± 0.519
2.795SerArg: 2.795 ± 0.434
2.69SerSer: 2.69 ± 0.404
3.375SerThr: 3.375 ± 0.515
4.325SerVal: 4.325 ± 0.5
0.897SerTrp: 0.897 ± 0.225
1.635SerTyr: 1.635 ± 0.35
0.0SerXaa: 0.0 ± 0.0
Thr
6.223ThrAla: 6.223 ± 0.608
0.475ThrCys: 0.475 ± 0.202
3.955ThrAsp: 3.955 ± 0.454
3.112ThrGlu: 3.112 ± 0.418
1.846ThrPhe: 1.846 ± 0.33
4.114ThrGly: 4.114 ± 0.467
0.897ThrHis: 0.897 ± 0.242
3.428ThrIle: 3.428 ± 0.449
2.953ThrLys: 2.953 ± 0.39
5.907ThrLeu: 5.907 ± 0.517
1.582ThrMet: 1.582 ± 0.313
2.426ThrAsn: 2.426 ± 0.399
3.692ThrPro: 3.692 ± 0.5
2.69ThrGln: 2.69 ± 0.348
3.164ThrArg: 3.164 ± 0.374
3.481ThrSer: 3.481 ± 0.483
4.641ThrThr: 4.641 ± 0.793
5.169ThrVal: 5.169 ± 0.576
1.213ThrTrp: 1.213 ± 0.239
2.321ThrTyr: 2.321 ± 0.36
0.0ThrXaa: 0.0 ± 0.0
Val
6.698ValAla: 6.698 ± 0.601
0.475ValCys: 0.475 ± 0.147
3.903ValAsp: 3.903 ± 0.436
5.379ValGlu: 5.379 ± 0.455
2.373ValPhe: 2.373 ± 0.397
4.641ValGly: 4.641 ± 0.463
0.897ValHis: 0.897 ± 0.197
3.745ValIle: 3.745 ± 0.487
3.481ValLys: 3.481 ± 0.476
5.169ValLeu: 5.169 ± 0.641
2.426ValMet: 2.426 ± 0.315
3.006ValAsn: 3.006 ± 0.405
3.006ValPro: 3.006 ± 0.421
3.059ValGln: 3.059 ± 0.469
4.588ValArg: 4.588 ± 0.552
4.377ValSer: 4.377 ± 0.447
5.01ValThr: 5.01 ± 0.548
5.643ValVal: 5.643 ± 0.573
1.213ValTrp: 1.213 ± 0.23
2.426ValTyr: 2.426 ± 0.385
0.0ValXaa: 0.0 ± 0.0
Trp
1.635TrpAla: 1.635 ± 0.302
0.105TrpCys: 0.105 ± 0.064
1.846TrpAsp: 1.846 ± 0.339
1.108TrpGlu: 1.108 ± 0.222
0.686TrpPhe: 0.686 ± 0.221
1.213TrpGly: 1.213 ± 0.242
0.369TrpHis: 0.369 ± 0.136
0.738TrpIle: 0.738 ± 0.185
0.686TrpLys: 0.686 ± 0.152
1.477TrpLeu: 1.477 ± 0.249
0.633TrpMet: 0.633 ± 0.169
0.897TrpAsn: 0.897 ± 0.254
0.58TrpPro: 0.58 ± 0.199
0.844TrpGln: 0.844 ± 0.2
0.897TrpArg: 0.897 ± 0.254
1.002TrpSer: 1.002 ± 0.235
0.897TrpThr: 0.897 ± 0.255
0.791TrpVal: 0.791 ± 0.229
0.369TrpTrp: 0.369 ± 0.145
0.58TrpTyr: 0.58 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.006TyrAla: 3.006 ± 0.415
0.316TyrCys: 0.316 ± 0.159
2.373TyrAsp: 2.373 ± 0.354
2.162TyrGlu: 2.162 ± 0.392
0.897TyrPhe: 0.897 ± 0.228
2.479TyrGly: 2.479 ± 0.393
0.58TyrHis: 0.58 ± 0.166
1.846TyrIle: 1.846 ± 0.3
1.318TyrLys: 1.318 ± 0.221
1.951TyrLeu: 1.951 ± 0.443
1.055TyrMet: 1.055 ± 0.191
1.318TyrAsn: 1.318 ± 0.265
2.11TyrPro: 2.11 ± 0.38
1.529TyrGln: 1.529 ± 0.346
2.268TyrArg: 2.268 ± 0.362
2.373TyrSer: 2.373 ± 0.409
2.69TyrThr: 2.69 ± 0.426
2.69TyrVal: 2.69 ± 0.305
0.633TyrTrp: 0.633 ± 0.198
1.055TyrTyr: 1.055 ± 0.254
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (18962 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski