Amino acid dipepetide frequency for Rat coronavirus Parker

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.997AlaAla: 4.997 ± 0.34
2.904AlaCys: 2.904 ± 0.594
4.524AlaAsp: 4.524 ± 0.514
2.026AlaGlu: 2.026 ± 0.279
3.849AlaPhe: 3.849 ± 0.467
3.781AlaGly: 3.781 ± 0.848
1.283AlaHis: 1.283 ± 0.356
4.187AlaIle: 4.187 ± 0.476
4.592AlaLys: 4.592 ± 0.342
4.997AlaLeu: 4.997 ± 0.487
1.621AlaMet: 1.621 ± 0.341
4.187AlaAsn: 4.187 ± 0.646
2.093AlaPro: 2.093 ± 0.623
2.026AlaGln: 2.026 ± 0.568
2.026AlaArg: 2.026 ± 0.236
6.077AlaSer: 6.077 ± 1.133
3.511AlaThr: 3.511 ± 0.469
6.618AlaVal: 6.618 ± 0.612
0.945AlaTrp: 0.945 ± 0.198
2.296AlaTyr: 2.296 ± 0.273
0.0AlaXaa: 0.0 ± 0.0
Cys
2.026CysAla: 2.026 ± 0.493
1.756CysCys: 1.756 ± 0.266
2.634CysAsp: 2.634 ± 0.474
1.013CysGlu: 1.013 ± 0.266
2.228CysPhe: 2.228 ± 0.325
2.701CysGly: 2.701 ± 0.411
0.473CysHis: 0.473 ± 0.117
2.093CysIle: 2.093 ± 0.556
2.431CysLys: 2.431 ± 0.274
3.511CysLeu: 3.511 ± 0.408
0.473CysMet: 0.473 ± 0.61
2.161CysAsn: 2.161 ± 0.352
1.148CysPro: 1.148 ± 0.136
0.81CysGln: 0.81 ± 0.314
1.351CysArg: 1.351 ± 0.196
3.579CysSer: 3.579 ± 0.71
1.823CysThr: 1.823 ± 0.427
2.701CysVal: 2.701 ± 0.425
0.608CysTrp: 0.608 ± 0.194
2.161CysTyr: 2.161 ± 0.43
0.0CysXaa: 0.0 ± 0.0
Asp
4.389AspAla: 4.389 ± 1.046
2.026AspCys: 2.026 ± 0.181
3.579AspAsp: 3.579 ± 0.462
2.701AspGlu: 2.701 ± 0.39
2.769AspPhe: 2.769 ± 0.575
4.389AspGly: 4.389 ± 0.575
0.608AspHis: 0.608 ± 0.229
1.891AspIle: 1.891 ± 0.601
3.174AspLys: 3.174 ± 0.3
5.267AspLeu: 5.267 ± 0.442
1.756AspMet: 1.756 ± 0.407
2.093AspAsn: 2.093 ± 0.37
1.756AspPro: 1.756 ± 0.352
1.756AspGln: 1.756 ± 0.366
1.756AspArg: 1.756 ± 0.387
4.322AspSer: 4.322 ± 0.517
2.228AspThr: 2.228 ± 0.419
7.36AspVal: 7.36 ± 1.013
0.405AspTrp: 0.405 ± 0.323
2.701AspTyr: 2.701 ± 0.256
0.0AspXaa: 0.0 ± 0.0
Glu
4.389GluAla: 4.389 ± 0.452
1.013GluCys: 1.013 ± 0.251
2.566GluAsp: 2.566 ± 0.449
2.566GluGlu: 2.566 ± 0.667
2.431GluPhe: 2.431 ± 0.254
2.431GluGly: 2.431 ± 0.566
0.473GluHis: 0.473 ± 0.117
1.891GluIle: 1.891 ± 0.628
1.891GluLys: 1.891 ± 0.287
4.187GluLeu: 4.187 ± 0.661
0.81GluMet: 0.81 ± 0.27
1.351GluAsn: 1.351 ± 0.227
1.958GluPro: 1.958 ± 0.253
0.878GluGln: 0.878 ± 0.235
1.486GluArg: 1.486 ± 0.219
2.228GluSer: 2.228 ± 0.501
2.093GluThr: 2.093 ± 0.332
4.254GluVal: 4.254 ± 0.831
0.54GluTrp: 0.54 ± 0.252
1.553GluTyr: 1.553 ± 0.353
0.0GluXaa: 0.0 ± 0.0
Phe
2.701PheAla: 2.701 ± 0.308
1.891PheCys: 1.891 ± 0.492
3.309PheAsp: 3.309 ± 0.478
2.093PheGlu: 2.093 ± 0.225
1.958PhePhe: 1.958 ± 0.207
3.241PheGly: 3.241 ± 0.426
0.608PheHis: 0.608 ± 0.206
2.228PheIle: 2.228 ± 0.899
3.781PheLys: 3.781 ± 0.731
3.376PheLeu: 3.376 ± 0.632
1.215PheMet: 1.215 ± 0.351
4.119PheAsn: 4.119 ± 0.505
1.486PhePro: 1.486 ± 0.164
1.283PheGln: 1.283 ± 0.318
1.891PheArg: 1.891 ± 0.783
3.309PheSer: 3.309 ± 0.351
2.971PheThr: 2.971 ± 0.411
6.483PheVal: 6.483 ± 1.242
0.608PheTrp: 0.608 ± 0.209
3.579PheTyr: 3.579 ± 0.676
0.0PheXaa: 0.0 ± 0.0
Gly
2.971GlyAla: 2.971 ± 0.339
3.309GlyCys: 3.309 ± 0.321
3.241GlyAsp: 3.241 ± 0.695
1.486GlyGlu: 1.486 ± 0.154
3.917GlyPhe: 3.917 ± 1.097
3.376GlyGly: 3.376 ± 0.566
1.486GlyHis: 1.486 ± 0.648
2.634GlyIle: 2.634 ± 0.916
3.781GlyLys: 3.781 ± 0.398
5.267GlyLeu: 5.267 ± 0.63
1.215GlyMet: 1.215 ± 0.328
3.309GlyAsn: 3.309 ± 0.562
1.08GlyPro: 1.08 ± 0.502
1.756GlyGln: 1.756 ± 0.56
2.026GlyArg: 2.026 ± 0.346
5.335GlySer: 5.335 ± 0.627
3.984GlyThr: 3.984 ± 0.446
7.428GlyVal: 7.428 ± 0.806
0.743GlyTrp: 0.743 ± 0.184
3.039GlyTyr: 3.039 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
1.553HisAla: 1.553 ± 0.454
0.405HisCys: 0.405 ± 0.325
1.08HisAsp: 1.08 ± 0.176
0.743HisGlu: 0.743 ± 0.153
1.283HisPhe: 1.283 ± 0.163
0.675HisGly: 0.675 ± 0.292
0.135HisHis: 0.135 ± 0.177
0.675HisIle: 0.675 ± 0.174
1.08HisLys: 1.08 ± 0.242
1.823HisLeu: 1.823 ± 0.703
0.54HisMet: 0.54 ± 0.256
1.013HisAsn: 1.013 ± 0.265
0.54HisPro: 0.54 ± 0.152
0.675HisGln: 0.675 ± 0.172
0.608HisArg: 0.608 ± 0.167
0.743HisSer: 0.743 ± 0.108
0.945HisThr: 0.945 ± 0.133
2.228HisVal: 2.228 ± 0.388
0.203HisTrp: 0.203 ± 0.065
0.54HisTyr: 0.54 ± 0.284
0.0HisXaa: 0.0 ± 0.0
Ile
2.769IleAla: 2.769 ± 0.325
1.891IleCys: 1.891 ± 0.219
2.093IleAsp: 2.093 ± 0.396
1.688IleGlu: 1.688 ± 0.238
1.688IlePhe: 1.688 ± 0.507
3.174IleGly: 3.174 ± 0.659
0.675IleHis: 0.675 ± 0.237
2.701IleIle: 2.701 ± 1.015
3.444IleLys: 3.444 ± 0.596
4.524IleLeu: 4.524 ± 0.744
0.945IleMet: 0.945 ± 0.321
2.431IleAsn: 2.431 ± 0.595
1.621IlePro: 1.621 ± 0.631
1.688IleGln: 1.688 ± 0.435
1.283IleArg: 1.283 ± 0.511
1.891IleSer: 1.891 ± 0.712
2.836IleThr: 2.836 ± 0.176
4.524IleVal: 4.524 ± 1.426
0.338IleTrp: 0.338 ± 0.142
1.013IleTyr: 1.013 ± 0.299
0.0IleXaa: 0.0 ± 0.0
Lys
4.187LysAla: 4.187 ± 0.446
2.498LysCys: 2.498 ± 0.439
1.891LysAsp: 1.891 ± 0.374
3.039LysGlu: 3.039 ± 0.224
3.376LysPhe: 3.376 ± 0.746
3.984LysGly: 3.984 ± 0.388
1.351LysHis: 1.351 ± 0.249
2.701LysIle: 2.701 ± 0.402
1.958LysLys: 1.958 ± 0.256
6.55LysLeu: 6.55 ± 0.682
1.013LysMet: 1.013 ± 0.559
2.161LysAsn: 2.161 ± 0.474
3.106LysPro: 3.106 ± 0.699
2.363LysGln: 2.363 ± 0.662
2.228LysArg: 2.228 ± 0.315
3.039LysSer: 3.039 ± 0.332
1.958LysThr: 1.958 ± 0.201
5.672LysVal: 5.672 ± 0.854
1.283LysTrp: 1.283 ± 0.174
2.836LysTyr: 2.836 ± 0.448
0.0LysXaa: 0.0 ± 0.0
Leu
6.347LeuAla: 6.347 ± 0.899
3.984LeuCys: 3.984 ± 0.543
5.74LeuAsp: 5.74 ± 0.877
3.917LeuGlu: 3.917 ± 0.309
5.132LeuPhe: 5.132 ± 0.54
5.537LeuGly: 5.537 ± 1.067
1.283LeuHis: 1.283 ± 0.292
3.781LeuIle: 3.781 ± 0.358
3.917LeuLys: 3.917 ± 0.484
8.306LeuLeu: 8.306 ± 1.963
1.283LeuMet: 1.283 ± 0.406
4.997LeuAsn: 4.997 ± 0.696
4.322LeuPro: 4.322 ± 0.682
4.254LeuGln: 4.254 ± 0.501
3.511LeuArg: 3.511 ± 0.449
7.766LeuSer: 7.766 ± 0.647
5.74LeuThr: 5.74 ± 0.539
7.495LeuVal: 7.495 ± 0.723
1.283LeuTrp: 1.283 ± 0.282
5.402LeuTyr: 5.402 ± 0.587
0.0LeuXaa: 0.0 ± 0.0
Met
2.566MetAla: 2.566 ± 0.86
1.013MetCys: 1.013 ± 0.357
1.08MetAsp: 1.08 ± 0.223
0.473MetGlu: 0.473 ± 0.347
1.148MetPhe: 1.148 ± 0.151
0.945MetGly: 0.945 ± 0.844
0.878MetHis: 0.878 ± 0.285
0.405MetIle: 0.405 ± 0.126
0.405MetLys: 0.405 ± 0.317
3.309MetLeu: 3.309 ± 0.342
0.54MetMet: 0.54 ± 0.167
0.878MetAsn: 0.878 ± 0.256
1.486MetPro: 1.486 ± 0.284
1.283MetGln: 1.283 ± 0.173
0.743MetArg: 0.743 ± 0.206
1.351MetSer: 1.351 ± 0.195
1.351MetThr: 1.351 ± 0.223
1.351MetVal: 1.351 ± 0.337
0.473MetTrp: 0.473 ± 0.257
1.351MetTyr: 1.351 ± 0.151
0.0MetXaa: 0.0 ± 0.0
Asn
3.241AsnAla: 3.241 ± 0.508
1.756AsnCys: 1.756 ± 0.346
1.688AsnAsp: 1.688 ± 0.31
1.958AsnGlu: 1.958 ± 0.225
2.701AsnPhe: 2.701 ± 0.406
3.781AsnGly: 3.781 ± 0.499
0.81AsnHis: 0.81 ± 0.16
1.688AsnIle: 1.688 ± 0.388
2.701AsnLys: 2.701 ± 0.461
3.511AsnLeu: 3.511 ± 0.871
1.418AsnMet: 1.418 ± 0.224
2.634AsnAsn: 2.634 ± 0.807
2.228AsnPro: 2.228 ± 0.402
1.756AsnGln: 1.756 ± 0.495
2.228AsnArg: 2.228 ± 0.518
3.511AsnSer: 3.511 ± 0.731
2.971AsnThr: 2.971 ± 0.62
5.672AsnVal: 5.672 ± 0.735
0.675AsnTrp: 0.675 ± 0.174
1.823AsnTyr: 1.823 ± 0.74
0.0AsnXaa: 0.0 ± 0.0
Pro
2.566ProAla: 2.566 ± 0.323
0.945ProCys: 0.945 ± 0.242
2.093ProAsp: 2.093 ± 0.396
2.026ProGlu: 2.026 ± 0.31
1.486ProPhe: 1.486 ± 0.226
2.161ProGly: 2.161 ± 0.364
1.08ProHis: 1.08 ± 0.683
1.486ProIle: 1.486 ± 0.612
2.363ProLys: 2.363 ± 0.712
3.106ProLeu: 3.106 ± 0.611
0.405ProMet: 0.405 ± 0.345
1.553ProAsn: 1.553 ± 0.517
1.148ProPro: 1.148 ± 0.398
1.486ProGln: 1.486 ± 0.326
1.756ProArg: 1.756 ± 0.227
2.769ProSer: 2.769 ± 0.604
3.714ProThr: 3.714 ± 0.368
3.714ProVal: 3.714 ± 0.73
0.54ProTrp: 0.54 ± 0.171
1.486ProTyr: 1.486 ± 0.25
0.0ProXaa: 0.0 ± 0.0
Gln
1.351GlnAla: 1.351 ± 0.491
1.148GlnCys: 1.148 ± 0.31
1.688GlnAsp: 1.688 ± 0.23
1.891GlnGlu: 1.891 ± 0.301
1.688GlnPhe: 1.688 ± 0.644
2.161GlnGly: 2.161 ± 0.254
0.945GlnHis: 0.945 ± 0.296
1.891GlnIle: 1.891 ± 0.413
1.756GlnLys: 1.756 ± 0.632
4.052GlnLeu: 4.052 ± 0.479
0.27GlnMet: 0.27 ± 0.099
1.418GlnAsn: 1.418 ± 0.372
1.08GlnPro: 1.08 ± 0.542
1.148GlnGln: 1.148 ± 0.238
0.945GlnArg: 0.945 ± 0.338
3.039GlnSer: 3.039 ± 0.326
1.958GlnThr: 1.958 ± 0.325
2.971GlnVal: 2.971 ± 0.479
1.148GlnTrp: 1.148 ± 0.245
1.215GlnTyr: 1.215 ± 0.368
0.0GlnXaa: 0.0 ± 0.0
Arg
3.106ArgAla: 3.106 ± 0.55
1.013ArgCys: 1.013 ± 0.219
2.026ArgAsp: 2.026 ± 0.392
1.756ArgGlu: 1.756 ± 0.464
1.891ArgPhe: 1.891 ± 0.26
2.228ArgGly: 2.228 ± 0.631
0.945ArgHis: 0.945 ± 0.198
0.878ArgIle: 0.878 ± 0.422
2.431ArgLys: 2.431 ± 0.319
3.781ArgLeu: 3.781 ± 0.434
0.945ArgMet: 0.945 ± 0.211
1.283ArgAsn: 1.283 ± 0.387
1.215ArgPro: 1.215 ± 0.548
1.148ArgGln: 1.148 ± 0.422
1.283ArgArg: 1.283 ± 0.575
3.849ArgSer: 3.849 ± 0.732
1.891ArgThr: 1.891 ± 0.309
3.376ArgVal: 3.376 ± 0.456
0.135ArgTrp: 0.135 ± 0.224
1.351ArgTyr: 1.351 ± 0.184
0.0ArgXaa: 0.0 ± 0.0
Ser
5.672SerAla: 5.672 ± 0.859
2.498SerCys: 2.498 ± 0.536
4.187SerAsp: 4.187 ± 0.488
2.971SerGlu: 2.971 ± 0.197
3.241SerPhe: 3.241 ± 0.312
4.862SerGly: 4.862 ± 1.05
1.418SerHis: 1.418 ± 0.408
4.524SerIle: 4.524 ± 0.744
4.119SerLys: 4.119 ± 0.304
7.09SerLeu: 7.09 ± 0.845
2.161SerMet: 2.161 ± 0.315
2.566SerAsn: 2.566 ± 0.571
2.498SerPro: 2.498 ± 0.64
2.026SerGln: 2.026 ± 0.25
2.769SerArg: 2.769 ± 1.317
5.064SerSer: 5.064 ± 1.115
3.174SerThr: 3.174 ± 0.295
7.428SerVal: 7.428 ± 1.283
0.878SerTrp: 0.878 ± 0.15
3.309SerTyr: 3.309 ± 0.51
0.0SerXaa: 0.0 ± 0.0
Thr
3.444ThrAla: 3.444 ± 0.634
1.553ThrCys: 1.553 ± 0.289
3.579ThrAsp: 3.579 ± 0.463
2.566ThrGlu: 2.566 ± 0.599
3.714ThrPhe: 3.714 ± 0.559
4.592ThrGly: 4.592 ± 0.484
1.013ThrHis: 1.013 ± 0.21
1.958ThrIle: 1.958 ± 0.642
2.904ThrLys: 2.904 ± 0.627
5.807ThrLeu: 5.807 ± 0.747
2.228ThrMet: 2.228 ± 0.358
2.296ThrAsn: 2.296 ± 0.316
2.431ThrPro: 2.431 ± 0.485
1.621ThrGln: 1.621 ± 0.309
2.026ThrArg: 2.026 ± 0.428
3.714ThrSer: 3.714 ± 0.7
3.579ThrThr: 3.579 ± 0.399
4.119ThrVal: 4.119 ± 0.435
0.405ThrTrp: 0.405 ± 0.122
3.106ThrTyr: 3.106 ± 0.227
0.0ThrXaa: 0.0 ± 0.0
Val
6.618ValAla: 6.618 ± 0.652
3.579ValCys: 3.579 ± 0.445
7.225ValAsp: 7.225 ± 0.804
3.984ValGlu: 3.984 ± 0.45
3.714ValPhe: 3.714 ± 0.24
4.187ValGly: 4.187 ± 0.494
1.08ValHis: 1.08 ± 0.176
3.714ValIle: 3.714 ± 0.746
7.495ValLys: 7.495 ± 1.104
9.859ValLeu: 9.859 ± 1.07
2.701ValMet: 2.701 ± 0.479
4.929ValAsn: 4.929 ± 0.876
4.794ValPro: 4.794 ± 0.662
3.714ValGln: 3.714 ± 0.635
3.714ValArg: 3.714 ± 0.529
6.753ValSer: 6.753 ± 0.526
4.997ValThr: 4.997 ± 0.802
12.087ValVal: 12.087 ± 2.746
1.283ValTrp: 1.283 ± 0.442
4.524ValTyr: 4.524 ± 0.522
0.0ValXaa: 0.0 ± 0.0
Trp
0.81TrpAla: 0.81 ± 0.158
0.27TrpCys: 0.27 ± 0.177
0.338TrpAsp: 0.338 ± 0.212
0.27TrpGlu: 0.27 ± 0.08
1.351TrpPhe: 1.351 ± 0.189
0.338TrpGly: 0.338 ± 0.104
0.338TrpHis: 0.338 ± 0.097
0.405TrpIle: 0.405 ± 0.304
0.27TrpLys: 0.27 ± 0.08
2.363TrpLeu: 2.363 ± 0.369
0.203TrpMet: 0.203 ± 0.158
0.878TrpAsn: 0.878 ± 0.248
0.608TrpPro: 0.608 ± 0.233
0.473TrpGln: 0.473 ± 0.13
0.81TrpArg: 0.81 ± 0.252
1.148TrpSer: 1.148 ± 0.183
0.608TrpThr: 0.608 ± 0.122
0.743TrpVal: 0.743 ± 0.166
0.068TrpTrp: 0.068 ± 0.161
0.675TrpTyr: 0.675 ± 0.272
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.836TyrAla: 2.836 ± 0.289
2.228TyrCys: 2.228 ± 0.292
2.363TyrAsp: 2.363 ± 0.533
1.823TyrGlu: 1.823 ± 0.142
2.566TyrPhe: 2.566 ± 0.522
2.836TyrGly: 2.836 ± 0.386
0.608TyrHis: 0.608 ± 0.287
1.486TyrIle: 1.486 ± 0.187
2.701TyrLys: 2.701 ± 0.744
3.376TyrLeu: 3.376 ± 0.465
1.215TyrMet: 1.215 ± 0.219
2.498TyrAsn: 2.498 ± 0.621
1.283TyrPro: 1.283 ± 0.346
1.621TyrGln: 1.621 ± 0.225
2.093TyrArg: 2.093 ± 0.212
2.971TyrSer: 2.971 ± 0.411
4.322TyrThr: 4.322 ± 0.521
4.727TyrVal: 4.727 ± 0.645
0.405TyrTrp: 0.405 ± 0.132
3.511TyrTyr: 3.511 ± 0.534
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (14810 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski