Amino acid dipepetide frequency for Porcine torovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.811AlaAla: 2.811 ± 1.423
1.036AlaCys: 1.036 ± 0.145
2.515AlaAsp: 2.515 ± 0.492
1.48AlaGlu: 1.48 ± 0.371
2.219AlaPhe: 2.219 ± 0.512
1.48AlaGly: 1.48 ± 0.366
1.258AlaHis: 1.258 ± 0.338
2.959AlaIle: 2.959 ± 1.077
2.811AlaLys: 2.811 ± 0.465
4.587AlaLeu: 4.587 ± 0.723
1.184AlaMet: 1.184 ± 0.799
2.515AlaAsn: 2.515 ± 0.464
1.628AlaPro: 1.628 ± 0.385
1.48AlaGln: 1.48 ± 0.44
1.628AlaArg: 1.628 ± 0.349
2.441AlaSer: 2.441 ± 0.477
3.477AlaThr: 3.477 ± 0.694
3.847AlaVal: 3.847 ± 0.42
0.518AlaTrp: 0.518 ± 0.309
2.737AlaTyr: 2.737 ± 0.622
0.0AlaXaa: 0.0 ± 0.0
Cys
1.406CysAla: 1.406 ± 0.35
0.444CysCys: 0.444 ± 0.207
3.181CysAsp: 3.181 ± 0.645
1.85CysGlu: 1.85 ± 0.489
2.071CysPhe: 2.071 ± 0.214
1.776CysGly: 1.776 ± 0.29
0.444CysHis: 0.444 ± 0.402
1.11CysIle: 1.11 ± 0.644
1.406CysLys: 1.406 ± 0.221
3.403CysLeu: 3.403 ± 0.715
0.222CysMet: 0.222 ± 0.217
1.48CysAsn: 1.48 ± 0.323
1.997CysPro: 1.997 ± 0.294
1.036CysGln: 1.036 ± 0.201
1.11CysArg: 1.11 ± 0.239
2.367CysSer: 2.367 ± 0.57
2.071CysThr: 2.071 ± 0.708
2.293CysVal: 2.293 ± 0.188
0.444CysTrp: 0.444 ± 0.152
1.11CysTyr: 1.11 ± 0.407
0.0CysXaa: 0.0 ± 0.0
Asp
2.367AspAla: 2.367 ± 0.286
2.219AspCys: 2.219 ± 0.602
4.587AspAsp: 4.587 ± 1.349
2.885AspGlu: 2.885 ± 0.51
5.771AspPhe: 5.771 ± 1.132
2.589AspGly: 2.589 ± 0.264
0.518AspHis: 0.518 ± 0.1
2.959AspIle: 2.959 ± 0.486
2.959AspLys: 2.959 ± 0.742
6.362AspLeu: 6.362 ± 1.355
0.74AspMet: 0.74 ± 0.149
2.663AspAsn: 2.663 ± 0.226
1.776AspPro: 1.776 ± 0.374
2.515AspGln: 2.515 ± 0.355
1.258AspArg: 1.258 ± 0.333
2.589AspSer: 2.589 ± 0.281
3.033AspThr: 3.033 ± 0.271
4.439AspVal: 4.439 ± 0.386
0.666AspTrp: 0.666 ± 0.151
2.811AspTyr: 2.811 ± 0.494
0.0AspXaa: 0.0 ± 0.0
Glu
1.776GluAla: 1.776 ± 0.375
1.776GluCys: 1.776 ± 0.584
1.997GluAsp: 1.997 ± 0.65
2.145GluGlu: 2.145 ± 0.437
2.293GluPhe: 2.293 ± 0.676
2.589GluGly: 2.589 ± 0.691
0.74GluHis: 0.74 ± 0.149
1.48GluIle: 1.48 ± 0.533
3.033GluLys: 3.033 ± 0.737
3.847GluLeu: 3.847 ± 0.665
1.036GluMet: 1.036 ± 0.548
3.255GluAsn: 3.255 ± 0.702
1.85GluPro: 1.85 ± 0.401
3.033GluGln: 3.033 ± 0.369
1.036GluArg: 1.036 ± 0.187
2.515GluSer: 2.515 ± 0.455
1.776GluThr: 1.776 ± 0.204
4.143GluVal: 4.143 ± 0.574
0.518GluTrp: 0.518 ± 0.1
1.332GluTyr: 1.332 ± 0.262
0.0GluXaa: 0.0 ± 0.0
Phe
2.663PheAla: 2.663 ± 0.695
1.628PheCys: 1.628 ± 0.351
3.921PheAsp: 3.921 ± 0.395
3.477PheGlu: 3.477 ± 0.416
2.441PhePhe: 2.441 ± 0.449
3.847PheGly: 3.847 ± 0.353
0.962PheHis: 0.962 ± 0.186
2.811PheIle: 2.811 ± 0.797
5.401PheLys: 5.401 ± 0.414
4.587PheLeu: 4.587 ± 0.288
0.666PheMet: 0.666 ± 0.229
2.663PheAsn: 2.663 ± 0.248
1.258PhePro: 1.258 ± 0.572
2.589PheGln: 2.589 ± 0.643
2.811PheArg: 2.811 ± 0.455
6.88PheSer: 6.88 ± 0.417
3.477PheThr: 3.477 ± 0.203
6.362PheVal: 6.362 ± 1.557
1.258PheTrp: 1.258 ± 0.458
4.439PheTyr: 4.439 ± 0.95
0.0PheXaa: 0.0 ± 0.0
Gly
1.924GlyAla: 1.924 ± 0.324
1.48GlyCys: 1.48 ± 0.142
3.033GlyAsp: 3.033 ± 0.334
1.85GlyGlu: 1.85 ± 0.498
4.513GlyPhe: 4.513 ± 0.578
2.515GlyGly: 2.515 ± 0.356
1.036GlyHis: 1.036 ± 0.269
2.737GlyIle: 2.737 ± 0.165
3.773GlyLys: 3.773 ± 0.717
4.809GlyLeu: 4.809 ± 0.475
1.036GlyMet: 1.036 ± 0.352
1.85GlyAsn: 1.85 ± 0.345
1.702GlyPro: 1.702 ± 0.432
2.145GlyGln: 2.145 ± 0.369
1.11GlyArg: 1.11 ± 0.117
2.811GlySer: 2.811 ± 1.145
3.403GlyThr: 3.403 ± 0.719
4.735GlyVal: 4.735 ± 0.351
0.37GlyTrp: 0.37 ± 0.421
2.737GlyTyr: 2.737 ± 0.282
0.0GlyXaa: 0.0 ± 0.0
His
1.184HisAla: 1.184 ± 0.221
0.444HisCys: 0.444 ± 0.265
1.11HisAsp: 1.11 ± 0.362
0.592HisGlu: 0.592 ± 0.166
1.554HisPhe: 1.554 ± 0.237
1.11HisGly: 1.11 ± 0.275
0.666HisHis: 0.666 ± 0.127
1.036HisIle: 1.036 ± 0.295
1.11HisLys: 1.11 ± 0.273
2.071HisLeu: 2.071 ± 0.301
0.74HisMet: 0.74 ± 0.15
1.258HisAsn: 1.258 ± 0.115
1.11HisPro: 1.11 ± 0.271
1.332HisGln: 1.332 ± 0.257
0.37HisArg: 0.37 ± 0.21
1.406HisSer: 1.406 ± 0.35
0.518HisThr: 0.518 ± 0.1
1.776HisVal: 1.776 ± 0.352
0.592HisTrp: 0.592 ± 0.172
2.145HisTyr: 2.145 ± 0.437
0.0HisXaa: 0.0 ± 0.0
Ile
2.145IleAla: 2.145 ± 1.214
1.554IleCys: 1.554 ± 0.34
1.997IleAsp: 1.997 ± 0.188
1.702IleGlu: 1.702 ± 0.557
3.329IlePhe: 3.329 ± 1.135
2.737IleGly: 2.737 ± 0.334
1.184IleHis: 1.184 ± 0.278
2.589IleIle: 2.589 ± 0.383
3.181IleLys: 3.181 ± 0.549
5.549IleLeu: 5.549 ± 0.815
1.258IleMet: 1.258 ± 0.226
1.184IleAsn: 1.184 ± 0.537
1.997IlePro: 1.997 ± 0.487
1.332IleGln: 1.332 ± 0.5
0.814IleArg: 0.814 ± 0.247
4.365IleSer: 4.365 ± 1.414
3.403IleThr: 3.403 ± 0.681
5.327IleVal: 5.327 ± 0.75
1.036IleTrp: 1.036 ± 0.329
1.997IleTyr: 1.997 ± 0.28
0.0IleXaa: 0.0 ± 0.0
Lys
2.367LysAla: 2.367 ± 0.314
2.219LysCys: 2.219 ± 0.724
2.663LysAsp: 2.663 ± 0.711
1.776LysGlu: 1.776 ± 0.284
3.625LysPhe: 3.625 ± 0.645
1.628LysGly: 1.628 ± 0.534
1.184LysHis: 1.184 ± 0.397
3.329LysIle: 3.329 ± 0.573
1.776LysLys: 1.776 ± 0.468
5.992LysLeu: 5.992 ± 0.803
1.11LysMet: 1.11 ± 0.367
2.885LysAsn: 2.885 ± 0.56
4.587LysPro: 4.587 ± 0.559
3.699LysGln: 3.699 ± 0.822
1.48LysArg: 1.48 ± 0.292
4.217LysSer: 4.217 ± 0.392
3.329LysThr: 3.329 ± 0.611
4.587LysVal: 4.587 ± 0.413
0.74LysTrp: 0.74 ± 0.253
2.293LysTyr: 2.293 ± 0.323
0.0LysXaa: 0.0 ± 0.0
Leu
5.697LeuAla: 5.697 ± 0.488
2.589LeuCys: 2.589 ± 0.287
5.475LeuAsp: 5.475 ± 0.661
4.957LeuGlu: 4.957 ± 0.713
6.066LeuPhe: 6.066 ± 0.668
5.475LeuGly: 5.475 ± 0.886
2.071LeuHis: 2.071 ± 0.48
4.587LeuIle: 4.587 ± 0.755
5.179LeuLys: 5.179 ± 1.142
8.952LeuLeu: 8.952 ± 0.758
1.628LeuMet: 1.628 ± 0.316
4.217LeuAsn: 4.217 ± 0.477
6.88LeuPro: 6.88 ± 1.334
4.883LeuGln: 4.883 ± 1.078
2.959LeuArg: 2.959 ± 0.385
10.283LeuSer: 10.283 ± 1.234
6.88LeuThr: 6.88 ± 0.641
6.88LeuVal: 6.88 ± 1.114
1.036LeuTrp: 1.036 ± 0.354
4.439LeuTyr: 4.439 ± 0.58
0.0LeuXaa: 0.0 ± 0.0
Met
0.592MetAla: 0.592 ± 0.436
1.036MetCys: 1.036 ± 0.264
0.74MetAsp: 0.74 ± 0.309
0.444MetGlu: 0.444 ± 0.083
1.776MetPhe: 1.776 ± 0.952
0.444MetGly: 0.444 ± 0.225
0.37MetHis: 0.37 ± 0.121
1.036MetIle: 1.036 ± 0.145
0.814MetLys: 0.814 ± 0.149
2.293MetLeu: 2.293 ± 0.293
1.11MetMet: 1.11 ± 0.671
0.888MetAsn: 0.888 ± 0.282
1.258MetPro: 1.258 ± 0.637
0.888MetGln: 0.888 ± 0.279
0.962MetArg: 0.962 ± 0.344
1.48MetSer: 1.48 ± 0.325
1.554MetThr: 1.554 ± 0.304
1.48MetVal: 1.48 ± 0.292
0.888MetTrp: 0.888 ± 0.383
1.036MetTyr: 1.036 ± 0.221
0.0MetXaa: 0.0 ± 0.0
Asn
1.776AsnAla: 1.776 ± 0.383
2.145AsnCys: 2.145 ± 0.255
2.145AsnAsp: 2.145 ± 0.223
2.145AsnGlu: 2.145 ± 0.331
3.699AsnPhe: 3.699 ± 0.816
3.033AsnGly: 3.033 ± 0.413
1.11AsnHis: 1.11 ± 0.202
2.219AsnIle: 2.219 ± 0.507
1.85AsnLys: 1.85 ± 0.701
4.143AsnLeu: 4.143 ± 0.646
0.888AsnMet: 0.888 ± 0.271
1.85AsnAsn: 1.85 ± 0.451
2.293AsnPro: 2.293 ± 0.653
1.48AsnGln: 1.48 ± 0.929
1.776AsnArg: 1.776 ± 0.685
2.737AsnSer: 2.737 ± 0.372
2.145AsnThr: 2.145 ± 0.567
4.735AsnVal: 4.735 ± 0.336
0.962AsnTrp: 0.962 ± 0.21
2.293AsnTyr: 2.293 ± 0.921
0.0AsnXaa: 0.0 ± 0.0
Pro
2.293ProAla: 2.293 ± 0.278
0.592ProCys: 0.592 ± 0.204
2.441ProAsp: 2.441 ± 0.414
2.293ProGlu: 2.293 ± 0.333
3.699ProPhe: 3.699 ± 0.951
2.071ProGly: 2.071 ± 0.696
1.258ProHis: 1.258 ± 0.225
2.959ProIle: 2.959 ± 0.618
2.589ProLys: 2.589 ± 0.361
4.735ProLeu: 4.735 ± 0.439
1.406ProMet: 1.406 ± 0.421
1.997ProAsn: 1.997 ± 0.661
2.367ProPro: 2.367 ± 0.304
2.293ProGln: 2.293 ± 1.105
1.997ProArg: 1.997 ± 0.359
4.513ProSer: 4.513 ± 0.43
3.551ProThr: 3.551 ± 0.251
4.439ProVal: 4.439 ± 0.521
0.37ProTrp: 0.37 ± 0.121
1.924ProTyr: 1.924 ± 0.771
0.0ProXaa: 0.0 ± 0.0
Gln
2.071GlnAla: 2.071 ± 0.291
1.48GlnCys: 1.48 ± 0.66
3.699GlnAsp: 3.699 ± 0.581
1.11GlnGlu: 1.11 ± 0.201
2.071GlnPhe: 2.071 ± 0.449
2.367GlnGly: 2.367 ± 0.585
1.406GlnHis: 1.406 ± 0.264
2.145GlnIle: 2.145 ± 0.486
1.258GlnLys: 1.258 ± 0.338
5.623GlnLeu: 5.623 ± 0.466
1.11GlnMet: 1.11 ± 0.332
2.071GlnAsn: 2.071 ± 0.824
3.551GlnPro: 3.551 ± 0.763
4.217GlnGln: 4.217 ± 1.107
1.924GlnArg: 1.924 ± 0.742
3.847GlnSer: 3.847 ± 0.684
1.332GlnThr: 1.332 ± 0.214
4.217GlnVal: 4.217 ± 1.408
0.518GlnTrp: 0.518 ± 0.241
1.332GlnTyr: 1.332 ± 0.238
0.0GlnXaa: 0.0 ± 0.0
Arg
1.48ArgAla: 1.48 ± 0.425
0.518ArgCys: 0.518 ± 0.198
1.776ArgAsp: 1.776 ± 0.386
0.444ArgGlu: 0.444 ± 0.083
2.441ArgPhe: 2.441 ± 0.373
1.702ArgGly: 1.702 ± 0.723
1.11ArgHis: 1.11 ± 0.169
0.666ArgIle: 0.666 ± 0.224
1.406ArgLys: 1.406 ± 0.271
3.995ArgLeu: 3.995 ± 0.512
0.888ArgMet: 0.888 ± 0.162
1.036ArgAsn: 1.036 ± 0.436
1.406ArgPro: 1.406 ± 0.253
1.332ArgGln: 1.332 ± 0.725
2.663ArgArg: 2.663 ± 1.557
2.589ArgSer: 2.589 ± 0.884
1.702ArgThr: 1.702 ± 0.296
2.885ArgVal: 2.885 ± 0.317
0.222ArgTrp: 0.222 ± 0.076
2.663ArgTyr: 2.663 ± 0.295
0.0ArgXaa: 0.0 ± 0.0
Ser
2.219SerAla: 2.219 ± 0.507
2.293SerCys: 2.293 ± 0.244
4.735SerAsp: 4.735 ± 0.677
2.737SerGlu: 2.737 ± 0.161
4.587SerPhe: 4.587 ± 0.766
3.847SerGly: 3.847 ± 1.176
1.628SerHis: 1.628 ± 0.423
3.847SerIle: 3.847 ± 1.384
4.883SerLys: 4.883 ± 1.107
9.174SerLeu: 9.174 ± 1.727
1.702SerMet: 1.702 ± 0.799
3.477SerAsn: 3.477 ± 0.695
3.477SerPro: 3.477 ± 0.938
3.033SerGln: 3.033 ± 0.503
2.589SerArg: 2.589 ± 1.147
7.62SerSer: 7.62 ± 2.79
4.809SerThr: 4.809 ± 1.05
7.028SerVal: 7.028 ± 0.797
1.628SerTrp: 1.628 ± 0.402
3.477SerTyr: 3.477 ± 0.908
0.0SerXaa: 0.0 ± 0.0
Thr
3.033ThrAla: 3.033 ± 0.348
2.367ThrCys: 2.367 ± 0.232
2.367ThrAsp: 2.367 ± 0.896
1.997ThrGlu: 1.997 ± 0.252
2.515ThrPhe: 2.515 ± 0.811
3.921ThrGly: 3.921 ± 0.412
1.48ThrHis: 1.48 ± 0.292
3.625ThrIle: 3.625 ± 0.457
3.107ThrLys: 3.107 ± 0.645
6.436ThrLeu: 6.436 ± 1.035
0.888ThrMet: 0.888 ± 0.348
2.589ThrAsn: 2.589 ± 0.343
3.995ThrPro: 3.995 ± 0.972
2.737ThrGln: 2.737 ± 0.875
1.48ThrArg: 1.48 ± 0.191
5.105ThrSer: 5.105 ± 0.708
4.661ThrThr: 4.661 ± 0.488
3.847ThrVal: 3.847 ± 0.506
1.11ThrTrp: 1.11 ± 0.159
2.219ThrTyr: 2.219 ± 0.282
0.0ThrXaa: 0.0 ± 0.0
Val
4.513ValAla: 4.513 ± 0.827
2.515ValCys: 2.515 ± 0.466
4.143ValAsp: 4.143 ± 0.519
5.918ValGlu: 5.918 ± 1.487
4.883ValPhe: 4.883 ± 0.401
4.513ValGly: 4.513 ± 0.59
1.702ValHis: 1.702 ± 0.455
2.959ValIle: 2.959 ± 0.366
5.253ValLys: 5.253 ± 0.797
8.508ValLeu: 8.508 ± 1.107
1.924ValMet: 1.924 ± 0.531
4.957ValAsn: 4.957 ± 0.611
4.365ValPro: 4.365 ± 0.391
4.291ValGln: 4.291 ± 0.44
2.589ValArg: 2.589 ± 0.291
6.066ValSer: 6.066 ± 0.388
4.809ValThr: 4.809 ± 1.271
8.73ValVal: 8.73 ± 1.602
0.814ValTrp: 0.814 ± 0.196
4.217ValTyr: 4.217 ± 0.443
0.0ValXaa: 0.0 ± 0.0
Trp
0.296TrpAla: 0.296 ± 0.285
0.666TrpCys: 0.666 ± 0.208
0.518TrpAsp: 0.518 ± 0.44
0.222TrpGlu: 0.222 ± 0.076
1.628TrpPhe: 1.628 ± 0.356
0.074TrpGly: 0.074 ± 0.124
0.444TrpHis: 0.444 ± 0.153
0.296TrpIle: 0.296 ± 0.215
0.888TrpLys: 0.888 ± 0.289
2.293TrpLeu: 2.293 ± 0.41
0.37TrpMet: 0.37 ± 0.121
0.444TrpAsn: 0.444 ± 0.193
1.036TrpPro: 1.036 ± 0.317
0.444TrpGln: 0.444 ± 0.152
0.518TrpArg: 0.518 ± 0.241
1.332TrpSer: 1.332 ± 0.293
0.74TrpThr: 0.74 ± 0.181
1.036TrpVal: 1.036 ± 0.343
0.444TrpTrp: 0.444 ± 0.257
0.814TrpTyr: 0.814 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.997TyrAla: 1.997 ± 0.37
2.293TyrCys: 2.293 ± 0.589
2.663TyrAsp: 2.663 ± 0.706
2.145TyrGlu: 2.145 ± 0.249
2.959TyrPhe: 2.959 ± 0.376
1.85TyrGly: 1.85 ± 0.358
1.554TyrHis: 1.554 ± 0.456
3.329TyrIle: 3.329 ± 0.331
2.811TyrLys: 2.811 ± 0.475
3.773TyrLeu: 3.773 ± 0.328
1.11TyrMet: 1.11 ± 0.628
2.145TyrAsn: 2.145 ± 0.474
1.258TyrPro: 1.258 ± 0.365
2.589TyrGln: 2.589 ± 0.277
1.85TyrArg: 1.85 ± 0.232
3.625TyrSer: 3.625 ± 1.329
2.811TyrThr: 2.811 ± 1.423
4.883TyrVal: 4.883 ± 0.783
0.37TyrTrp: 0.37 ± 0.578
2.145TyrTyr: 2.145 ± 0.444
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (13518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski