Amino acid dipepetide frequency for Bovine nidovirus TCH5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.93AlaAla: 4.93 ± 0.676
2.465AlaCys: 2.465 ± 0.333
2.876AlaAsp: 2.876 ± 0.463
2.67AlaGlu: 2.67 ± 0.633
2.465AlaPhe: 2.465 ± 0.354
3.697AlaGly: 3.697 ± 0.693
0.719AlaHis: 0.719 ± 0.226
2.67AlaIle: 2.67 ± 0.654
4.93AlaLys: 4.93 ± 0.526
6.162AlaLeu: 6.162 ± 0.472
1.951AlaMet: 1.951 ± 0.329
2.465AlaAsn: 2.465 ± 0.428
1.541AlaPro: 1.541 ± 0.354
2.67AlaGln: 2.67 ± 0.281
2.773AlaArg: 2.773 ± 0.534
4.313AlaSer: 4.313 ± 0.61
1.438AlaThr: 1.438 ± 0.457
7.292AlaVal: 7.292 ± 1.369
0.514AlaTrp: 0.514 ± 0.16
2.259AlaTyr: 2.259 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
2.362CysAla: 2.362 ± 0.569
1.232CysCys: 1.232 ± 0.335
3.184CysAsp: 3.184 ± 0.27
1.849CysGlu: 1.849 ± 0.285
1.541CysPhe: 1.541 ± 0.236
1.849CysGly: 1.849 ± 0.663
0.308CysHis: 0.308 ± 0.1
1.746CysIle: 1.746 ± 0.384
3.492CysLys: 3.492 ± 0.595
1.541CysLeu: 1.541 ± 0.565
0.616CysMet: 0.616 ± 0.269
2.054CysAsn: 2.054 ± 0.437
0.616CysPro: 0.616 ± 0.298
1.335CysGln: 1.335 ± 0.314
0.411CysArg: 0.411 ± 0.136
1.849CysSer: 1.849 ± 0.396
1.335CysThr: 1.335 ± 0.561
2.876CysVal: 2.876 ± 0.593
0.103CysTrp: 0.103 ± 0.068
2.259CysTyr: 2.259 ± 0.475
0.0CysXaa: 0.0 ± 0.0
Asp
3.492AspAla: 3.492 ± 0.391
1.643AspCys: 1.643 ± 0.177
2.362AspAsp: 2.362 ± 0.366
3.492AspGlu: 3.492 ± 0.343
2.978AspPhe: 2.978 ± 0.403
4.519AspGly: 4.519 ± 0.707
0.822AspHis: 0.822 ± 0.411
2.259AspIle: 2.259 ± 0.321
4.211AspLys: 4.211 ± 1.239
4.416AspLeu: 4.416 ± 0.603
1.13AspMet: 1.13 ± 0.213
1.643AspAsn: 1.643 ± 0.327
1.541AspPro: 1.541 ± 0.246
0.616AspGln: 0.616 ± 0.248
1.438AspArg: 1.438 ± 0.978
4.005AspSer: 4.005 ± 0.379
1.027AspThr: 1.027 ± 0.615
4.108AspVal: 4.108 ± 0.475
0.719AspTrp: 0.719 ± 0.177
3.184AspTyr: 3.184 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
4.313GluAla: 4.313 ± 0.604
3.595GluCys: 3.595 ± 0.738
3.286GluAsp: 3.286 ± 0.592
5.34GluGlu: 5.34 ± 1.49
4.519GluPhe: 4.519 ± 0.848
3.389GluGly: 3.389 ± 0.371
1.13GluHis: 1.13 ± 0.215
3.8GluIle: 3.8 ± 0.249
4.416GluLys: 4.416 ± 0.318
4.211GluLeu: 4.211 ± 0.405
1.335GluMet: 1.335 ± 0.417
2.465GluAsn: 2.465 ± 0.207
1.951GluPro: 1.951 ± 0.401
2.568GluGln: 2.568 ± 0.519
1.438GluArg: 1.438 ± 0.307
2.259GluSer: 2.259 ± 0.27
1.13GluThr: 1.13 ± 0.241
6.573GluVal: 6.573 ± 0.695
0.0GluTrp: 0.0 ± 0.0
3.595GluTyr: 3.595 ± 0.491
0.0GluXaa: 0.0 ± 0.0
Phe
4.211PheAla: 4.211 ± 0.534
1.951PheCys: 1.951 ± 0.24
3.184PheAsp: 3.184 ± 0.96
4.622PheGlu: 4.622 ± 0.595
2.054PhePhe: 2.054 ± 0.299
3.595PheGly: 3.595 ± 0.474
0.411PheHis: 0.411 ± 0.148
3.081PheIle: 3.081 ± 0.388
5.34PheLys: 5.34 ± 0.257
2.67PheLeu: 2.67 ± 0.487
1.438PheMet: 1.438 ± 0.35
4.519PheAsn: 4.519 ± 0.6
1.232PhePro: 1.232 ± 0.237
2.876PheGln: 2.876 ± 0.375
1.746PheArg: 1.746 ± 0.246
4.416PheSer: 4.416 ± 0.316
2.157PheThr: 2.157 ± 0.488
6.881PheVal: 6.881 ± 0.419
0.924PheTrp: 0.924 ± 0.21
3.286PheTyr: 3.286 ± 0.679
0.0PheXaa: 0.0 ± 0.0
Gly
2.568GlyAla: 2.568 ± 0.503
2.054GlyCys: 2.054 ± 0.304
3.697GlyAsp: 3.697 ± 0.337
2.67GlyGlu: 2.67 ± 0.224
4.108GlyPhe: 4.108 ± 0.749
2.054GlyGly: 2.054 ± 0.483
2.157GlyHis: 2.157 ± 0.284
2.054GlyIle: 2.054 ± 0.281
5.34GlyLys: 5.34 ± 0.279
5.032GlyLeu: 5.032 ± 0.626
1.232GlyMet: 1.232 ± 0.405
3.081GlyAsn: 3.081 ± 0.253
1.027GlyPro: 1.027 ± 0.329
2.876GlyGln: 2.876 ± 0.583
3.697GlyArg: 3.697 ± 1.117
5.135GlySer: 5.135 ± 0.509
1.951GlyThr: 1.951 ± 0.733
7.086GlyVal: 7.086 ± 0.44
0.719GlyTrp: 0.719 ± 0.297
4.93GlyTyr: 4.93 ± 0.602
0.0GlyXaa: 0.0 ± 0.0
His
0.103HisAla: 0.103 ± 0.068
0.822HisCys: 0.822 ± 0.157
0.616HisAsp: 0.616 ± 0.504
1.335HisGlu: 1.335 ± 0.453
1.027HisPhe: 1.027 ± 0.438
1.438HisGly: 1.438 ± 0.289
0.308HisHis: 0.308 ± 0.172
0.719HisIle: 0.719 ± 0.314
1.232HisLys: 1.232 ± 0.402
1.027HisLeu: 1.027 ± 0.354
0.205HisMet: 0.205 ± 0.071
0.822HisAsn: 0.822 ± 0.343
0.616HisPro: 0.616 ± 0.201
0.719HisGln: 0.719 ± 0.177
0.205HisArg: 0.205 ± 0.071
0.924HisSer: 0.924 ± 0.121
0.411HisThr: 0.411 ± 0.163
1.541HisVal: 1.541 ± 0.245
0.308HisTrp: 0.308 ± 0.172
1.438HisTyr: 1.438 ± 0.801
0.0HisXaa: 0.0 ± 0.0
Ile
2.876IleAla: 2.876 ± 0.795
0.924IleCys: 0.924 ± 0.121
2.978IleAsp: 2.978 ± 0.375
2.876IleGlu: 2.876 ± 0.913
2.465IlePhe: 2.465 ± 0.844
3.081IleGly: 3.081 ± 0.238
0.411IleHis: 0.411 ± 0.136
2.157IleIle: 2.157 ± 0.526
2.773IleLys: 2.773 ± 0.24
2.773IleLeu: 2.773 ± 0.636
1.643IleMet: 1.643 ± 0.706
2.157IleAsn: 2.157 ± 0.992
2.054IlePro: 2.054 ± 0.467
1.541IleGln: 1.541 ± 0.305
2.568IleArg: 2.568 ± 0.594
2.978IleSer: 2.978 ± 0.695
1.232IleThr: 1.232 ± 0.222
5.032IleVal: 5.032 ± 0.583
0.719IleTrp: 0.719 ± 0.107
2.157IleTyr: 2.157 ± 0.625
0.0IleXaa: 0.0 ± 0.0
Lys
4.313LysAla: 4.313 ± 1.144
2.362LysCys: 2.362 ± 0.569
3.903LysAsp: 3.903 ± 0.32
4.93LysGlu: 4.93 ± 0.554
6.265LysPhe: 6.265 ± 0.524
3.697LysGly: 3.697 ± 0.555
1.643LysHis: 1.643 ± 0.469
2.362LysIle: 2.362 ± 0.206
3.697LysLys: 3.697 ± 0.698
5.957LysLeu: 5.957 ± 0.607
2.157LysMet: 2.157 ± 0.361
3.492LysAsn: 3.492 ± 0.545
2.259LysPro: 2.259 ± 0.417
3.8LysGln: 3.8 ± 0.409
2.465LysArg: 2.465 ± 0.42
3.286LysSer: 3.286 ± 0.343
2.465LysThr: 2.465 ± 0.421
7.805LysVal: 7.805 ± 1.514
1.232LysTrp: 1.232 ± 0.27
4.005LysTyr: 4.005 ± 0.338
0.0LysXaa: 0.0 ± 0.0
Leu
5.751LeuAla: 5.751 ± 0.49
2.157LeuCys: 2.157 ± 0.394
3.286LeuAsp: 3.286 ± 0.669
4.416LeuGlu: 4.416 ± 0.853
4.622LeuPhe: 4.622 ± 1.269
5.649LeuGly: 5.649 ± 0.536
1.335LeuHis: 1.335 ± 0.266
4.005LeuIle: 4.005 ± 0.624
5.546LeuLys: 5.546 ± 0.564
8.011LeuLeu: 8.011 ± 0.799
3.389LeuMet: 3.389 ± 1.071
4.724LeuAsn: 4.724 ± 0.556
3.8LeuPro: 3.8 ± 0.371
3.081LeuGln: 3.081 ± 0.778
2.876LeuArg: 2.876 ± 0.569
5.032LeuSer: 5.032 ± 0.68
3.492LeuThr: 3.492 ± 0.682
8.113LeuVal: 8.113 ± 0.905
0.514LeuTrp: 0.514 ± 0.136
2.773LeuTyr: 2.773 ± 0.46
0.0LeuXaa: 0.0 ± 0.0
Met
1.951MetAla: 1.951 ± 0.231
0.924MetCys: 0.924 ± 0.91
1.13MetAsp: 1.13 ± 0.245
2.157MetGlu: 2.157 ± 0.554
1.541MetPhe: 1.541 ± 0.266
0.616MetGly: 0.616 ± 0.248
0.719MetHis: 0.719 ± 0.15
1.13MetIle: 1.13 ± 0.263
1.13MetLys: 1.13 ± 0.21
3.184MetLeu: 3.184 ± 0.609
0.822MetMet: 0.822 ± 0.221
0.514MetAsn: 0.514 ± 0.159
0.308MetPro: 0.308 ± 0.191
1.438MetGln: 1.438 ± 0.255
1.13MetArg: 1.13 ± 0.461
1.643MetSer: 1.643 ± 0.266
1.232MetThr: 1.232 ± 0.402
3.595MetVal: 3.595 ± 0.66
0.411MetTrp: 0.411 ± 0.141
1.232MetTyr: 1.232 ± 0.269
0.0MetXaa: 0.0 ± 0.0
Asn
2.568AsnAla: 2.568 ± 0.38
1.643AsnCys: 1.643 ± 0.313
2.259AsnAsp: 2.259 ± 0.31
2.465AsnGlu: 2.465 ± 0.364
2.773AsnPhe: 2.773 ± 0.824
5.34AsnGly: 5.34 ± 1.125
0.411AsnHis: 0.411 ± 0.141
2.465AsnIle: 2.465 ± 0.195
4.519AsnLys: 4.519 ± 0.263
2.876AsnLeu: 2.876 ± 0.48
1.541AsnMet: 1.541 ± 0.709
3.492AsnAsn: 3.492 ± 0.509
2.465AsnPro: 2.465 ± 0.483
1.541AsnGln: 1.541 ± 0.362
1.849AsnArg: 1.849 ± 0.932
2.054AsnSer: 2.054 ± 1.028
0.514AsnThr: 0.514 ± 0.639
5.649AsnVal: 5.649 ± 0.809
0.205AsnTrp: 0.205 ± 0.071
2.054AsnTyr: 2.054 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
2.157ProAla: 2.157 ± 0.527
1.027ProCys: 1.027 ± 0.234
1.027ProAsp: 1.027 ± 0.286
1.951ProGlu: 1.951 ± 0.357
1.643ProPhe: 1.643 ± 0.391
1.541ProGly: 1.541 ± 0.566
0.924ProHis: 0.924 ± 0.165
1.746ProIle: 1.746 ± 0.447
1.849ProLys: 1.849 ± 0.582
2.362ProLeu: 2.362 ± 0.267
1.027ProMet: 1.027 ± 0.139
1.232ProAsn: 1.232 ± 0.405
1.643ProPro: 1.643 ± 0.52
1.643ProGln: 1.643 ± 0.251
1.541ProArg: 1.541 ± 0.236
3.286ProSer: 3.286 ± 0.312
1.13ProThr: 1.13 ± 0.21
4.622ProVal: 4.622 ± 0.806
0.411ProTrp: 0.411 ± 0.31
1.746ProTyr: 1.746 ± 0.2
0.0ProXaa: 0.0 ± 0.0
Gln
2.054GlnAla: 2.054 ± 0.283
0.924GlnCys: 0.924 ± 0.218
1.746GlnAsp: 1.746 ± 0.243
2.054GlnGlu: 2.054 ± 0.397
2.157GlnPhe: 2.157 ± 0.681
2.054GlnGly: 2.054 ± 0.55
0.616GlnHis: 0.616 ± 0.331
1.438GlnIle: 1.438 ± 0.285
2.465GlnLys: 2.465 ± 0.195
4.313GlnLeu: 4.313 ± 0.521
0.616GlnMet: 0.616 ± 0.199
1.438GlnAsn: 1.438 ± 0.415
0.924GlnPro: 0.924 ± 0.429
1.849GlnGln: 1.849 ± 1.941
3.184GlnArg: 3.184 ± 1.468
2.876GlnSer: 2.876 ± 0.737
1.746GlnThr: 1.746 ± 0.393
4.211GlnVal: 4.211 ± 0.436
1.027GlnTrp: 1.027 ± 0.353
1.027GlnTyr: 1.027 ± 0.677
0.0GlnXaa: 0.0 ± 0.0
Arg
1.746ArgAla: 1.746 ± 0.509
1.13ArgCys: 1.13 ± 0.167
1.13ArgAsp: 1.13 ± 0.238
2.465ArgGlu: 2.465 ± 0.206
2.568ArgPhe: 2.568 ± 1.344
2.157ArgGly: 2.157 ± 0.519
0.616ArgHis: 0.616 ± 0.106
1.232ArgIle: 1.232 ± 0.222
2.362ArgLys: 2.362 ± 0.511
3.286ArgLeu: 3.286 ± 0.569
0.205ArgMet: 0.205 ± 0.188
2.362ArgAsn: 2.362 ± 1.253
1.849ArgPro: 1.849 ± 0.176
1.438ArgGln: 1.438 ± 0.892
2.054ArgArg: 2.054 ± 1.793
3.903ArgSer: 3.903 ± 1.502
1.643ArgThr: 1.643 ± 0.239
4.108ArgVal: 4.108 ± 0.582
0.205ArgTrp: 0.205 ± 0.071
2.157ArgTyr: 2.157 ± 0.537
0.0ArgXaa: 0.0 ± 0.0
Ser
3.184SerAla: 3.184 ± 0.568
2.157SerCys: 2.157 ± 0.464
3.184SerAsp: 3.184 ± 0.28
2.876SerGlu: 2.876 ± 1.506
3.903SerPhe: 3.903 ± 0.38
4.622SerGly: 4.622 ± 0.351
0.822SerHis: 0.822 ± 0.313
2.259SerIle: 2.259 ± 0.322
3.697SerLys: 3.697 ± 0.773
6.367SerLeu: 6.367 ± 0.763
2.054SerMet: 2.054 ± 0.252
1.746SerAsn: 1.746 ± 0.804
2.568SerPro: 2.568 ± 0.369
2.054SerGln: 2.054 ± 0.5
2.773SerArg: 2.773 ± 1.118
3.903SerSer: 3.903 ± 0.665
2.773SerThr: 2.773 ± 0.51
8.627SerVal: 8.627 ± 0.999
1.027SerTrp: 1.027 ± 0.192
3.286SerTyr: 3.286 ± 0.893
0.0SerXaa: 0.0 ± 0.0
Thr
2.67ThrAla: 2.67 ± 0.531
0.719ThrCys: 0.719 ± 0.107
1.335ThrAsp: 1.335 ± 0.299
1.027ThrGlu: 1.027 ± 0.285
1.849ThrPhe: 1.849 ± 0.399
2.259ThrGly: 2.259 ± 0.532
0.205ThrHis: 0.205 ± 0.188
2.259ThrIle: 2.259 ± 0.92
1.541ThrLys: 1.541 ± 0.381
3.184ThrLeu: 3.184 ± 0.327
0.719ThrMet: 0.719 ± 0.493
1.438ThrAsn: 1.438 ± 0.329
2.259ThrPro: 2.259 ± 0.388
1.13ThrGln: 1.13 ± 0.446
1.335ThrArg: 1.335 ± 0.328
1.951ThrSer: 1.951 ± 0.225
2.157ThrThr: 2.157 ± 0.48
2.978ThrVal: 2.978 ± 0.381
0.616ThrTrp: 0.616 ± 0.212
1.643ThrTyr: 1.643 ± 0.468
0.0ThrXaa: 0.0 ± 0.0
Val
6.676ValAla: 6.676 ± 0.932
2.259ValCys: 2.259 ± 0.651
5.032ValAsp: 5.032 ± 0.352
7.189ValGlu: 7.189 ± 1.171
8.524ValPhe: 8.524 ± 2.002
6.984ValGly: 6.984 ± 0.771
1.438ValHis: 1.438 ± 0.295
4.93ValIle: 4.93 ± 1.411
7.703ValLys: 7.703 ± 0.599
9.243ValLeu: 9.243 ± 0.786
2.362ValMet: 2.362 ± 0.353
5.34ValAsn: 5.34 ± 0.45
5.238ValPro: 5.238 ± 0.652
2.978ValGln: 2.978 ± 0.67
2.465ValArg: 2.465 ± 0.527
6.47ValSer: 6.47 ± 1.027
3.081ValThr: 3.081 ± 0.916
9.654ValVal: 9.654 ± 0.862
1.335ValTrp: 1.335 ± 0.298
6.573ValTyr: 6.573 ± 0.846
0.103ValXaa: 0.103 ± 0.068
Trp
0.0TrpAla: 0.0 ± 0.0
0.924TrpCys: 0.924 ± 0.165
0.103TrpAsp: 0.103 ± 0.214
0.308TrpGlu: 0.308 ± 0.1
0.514TrpPhe: 0.514 ± 0.637
0.514TrpGly: 0.514 ± 0.16
0.514TrpHis: 0.514 ± 0.219
1.027TrpIle: 1.027 ± 0.32
0.616TrpLys: 0.616 ± 0.201
2.259TrpLeu: 2.259 ± 0.323
0.308TrpMet: 0.308 ± 0.1
0.103TrpAsn: 0.103 ± 0.068
0.205TrpPro: 0.205 ± 0.188
0.411TrpGln: 0.411 ± 0.377
0.719TrpArg: 0.719 ± 0.219
0.719TrpSer: 0.719 ± 0.226
0.514TrpThr: 0.514 ± 0.159
0.308TrpVal: 0.308 ± 0.172
0.616TrpTrp: 0.616 ± 0.212
1.335TrpTyr: 1.335 ± 0.228
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.876TyrAla: 2.876 ± 0.515
1.746TyrCys: 1.746 ± 0.251
3.286TyrAsp: 3.286 ± 0.338
4.519TyrGlu: 4.519 ± 0.832
3.081TyrPhe: 3.081 ± 0.347
4.416TyrGly: 4.416 ± 0.675
0.308TyrHis: 0.308 ± 0.692
2.054TyrIle: 2.054 ± 0.435
5.238TyrLys: 5.238 ± 1.057
3.697TyrLeu: 3.697 ± 0.453
1.951TyrMet: 1.951 ± 0.383
3.595TyrAsn: 3.595 ± 0.582
0.411TyrPro: 0.411 ± 0.31
2.157TyrGln: 2.157 ± 0.743
1.951TyrArg: 1.951 ± 0.884
2.876TyrSer: 2.876 ± 0.802
1.849TyrThr: 1.849 ± 0.681
4.519TyrVal: 4.519 ± 0.722
0.411TyrTrp: 0.411 ± 0.148
2.67TyrTyr: 2.67 ± 0.432
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.103XaaGlu: 0.103 ± 0.068
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (9738 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski