Amino acid dipepetide frequency for Avian infectious bronchitis virus (strain Beaudette) (IBV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.333AlaAla: 5.333 ± 0.856
2.588AlaCys: 2.588 ± 0.21
3.059AlaAsp: 3.059 ± 0.354
2.353AlaGlu: 2.353 ± 0.225
2.666AlaPhe: 2.666 ± 0.186
3.764AlaGly: 3.764 ± 0.467
0.941AlaHis: 0.941 ± 0.065
5.019AlaIle: 5.019 ± 0.469
5.96AlaLys: 5.96 ± 0.624
5.647AlaLeu: 5.647 ± 0.386
1.49AlaMet: 1.49 ± 0.124
3.608AlaAsn: 3.608 ± 0.46
2.039AlaPro: 2.039 ± 0.312
2.196AlaGln: 2.196 ± 0.128
2.353AlaArg: 2.353 ± 0.24
4.392AlaSer: 4.392 ± 0.319
4.549AlaThr: 4.549 ± 0.226
5.254AlaVal: 5.254 ± 0.503
1.098AlaTrp: 1.098 ± 0.088
3.451AlaTyr: 3.451 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
1.804CysAla: 1.804 ± 0.259
1.255CysCys: 1.255 ± 0.285
2.117CysAsp: 2.117 ± 0.204
1.647CysGlu: 1.647 ± 0.255
1.804CysPhe: 1.804 ± 0.202
2.902CysGly: 2.902 ± 0.434
0.627CysHis: 0.627 ± 0.111
1.647CysIle: 1.647 ± 0.225
2.353CysLys: 2.353 ± 0.215
2.039CysLeu: 2.039 ± 0.204
0.235CysMet: 0.235 ± 0.046
1.49CysAsn: 1.49 ± 0.422
0.863CysPro: 0.863 ± 0.154
1.176CysGln: 1.176 ± 0.149
1.333CysArg: 1.333 ± 0.127
1.569CysSer: 1.569 ± 0.254
1.725CysThr: 1.725 ± 0.448
3.059CysVal: 3.059 ± 0.257
0.627CysTrp: 0.627 ± 0.111
2.588CysTyr: 2.588 ± 0.223
0.0CysXaa: 0.0 ± 0.0
Asp
4.706AspAla: 4.706 ± 0.434
1.255AspCys: 1.255 ± 0.392
3.059AspAsp: 3.059 ± 0.576
3.372AspGlu: 3.372 ± 0.582
3.608AspPhe: 3.608 ± 0.298
4.392AspGly: 4.392 ± 0.468
0.471AspHis: 0.471 ± 0.205
2.196AspIle: 2.196 ± 0.446
3.137AspLys: 3.137 ± 0.23
3.764AspLeu: 3.764 ± 0.302
0.471AspMet: 0.471 ± 0.201
2.902AspAsn: 2.902 ± 0.259
2.274AspPro: 2.274 ± 0.387
2.196AspGln: 2.196 ± 0.509
1.804AspArg: 1.804 ± 0.117
4.235AspSer: 4.235 ± 0.4
2.98AspThr: 2.98 ± 0.16
5.254AspVal: 5.254 ± 0.635
0.392AspTrp: 0.392 ± 0.068
2.902AspTyr: 2.902 ± 0.444
0.0AspXaa: 0.0 ± 0.0
Glu
3.372GluAla: 3.372 ± 0.489
1.255GluCys: 1.255 ± 0.164
3.608GluAsp: 3.608 ± 0.327
3.294GluGlu: 3.294 ± 0.477
2.745GluPhe: 2.745 ± 0.459
2.51GluGly: 2.51 ± 0.441
0.863GluHis: 0.863 ± 0.167
2.196GluIle: 2.196 ± 0.179
3.137GluLys: 3.137 ± 0.333
4.47GluLeu: 4.47 ± 0.271
0.392GluMet: 0.392 ± 0.307
2.353GluAsn: 2.353 ± 0.184
2.51GluPro: 2.51 ± 0.276
2.274GluGln: 2.274 ± 0.259
1.569GluArg: 1.569 ± 0.22
1.882GluSer: 1.882 ± 0.612
2.431GluThr: 2.431 ± 0.32
4.0GluVal: 4.0 ± 0.521
0.157GluTrp: 0.157 ± 0.037
2.039GluTyr: 2.039 ± 0.226
0.0GluXaa: 0.0 ± 0.0
Phe
2.431PheAla: 2.431 ± 0.311
2.431PheCys: 2.431 ± 0.309
3.686PheAsp: 3.686 ± 0.438
3.764PheGlu: 3.764 ± 0.373
1.961PhePhe: 1.961 ± 0.238
2.588PheGly: 2.588 ± 0.201
0.157PheHis: 0.157 ± 0.093
3.608PheIle: 3.608 ± 0.421
4.549PheLys: 4.549 ± 0.379
4.784PheLeu: 4.784 ± 0.61
0.784PheMet: 0.784 ± 0.138
3.294PheAsn: 3.294 ± 0.694
0.235PhePro: 0.235 ± 0.117
1.725PheGln: 1.725 ± 0.139
1.333PheArg: 1.333 ± 0.387
3.686PheSer: 3.686 ± 0.368
3.137PheThr: 3.137 ± 0.339
7.215PheVal: 7.215 ± 0.486
1.49PheTrp: 1.49 ± 0.423
2.745PheTyr: 2.745 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
3.372GlyAla: 3.372 ± 0.236
1.569GlyCys: 1.569 ± 0.472
4.47GlyAsp: 4.47 ± 0.537
2.666GlyGlu: 2.666 ± 0.278
4.235GlyPhe: 4.235 ± 0.489
4.313GlyGly: 4.313 ± 0.606
1.804GlyHis: 1.804 ± 0.253
2.823GlyIle: 2.823 ± 0.389
3.843GlyLys: 3.843 ± 0.328
4.078GlyLeu: 4.078 ± 0.405
1.255GlyMet: 1.255 ± 0.167
3.608GlyAsn: 3.608 ± 0.531
1.569GlyPro: 1.569 ± 0.472
1.098GlyGln: 1.098 ± 0.377
1.804GlyArg: 1.804 ± 0.781
5.411GlySer: 5.411 ± 0.47
3.215GlyThr: 3.215 ± 0.231
7.764GlyVal: 7.764 ± 0.756
0.784GlyTrp: 0.784 ± 0.162
1.882GlyTyr: 1.882 ± 0.273
0.0GlyXaa: 0.0 ± 0.0
His
1.098HisAla: 1.098 ± 0.268
1.176HisCys: 1.176 ± 0.149
0.471HisAsp: 0.471 ± 0.092
1.02HisGlu: 1.02 ± 0.234
1.176HisPhe: 1.176 ± 0.228
1.412HisGly: 1.412 ± 0.167
0.392HisHis: 0.392 ± 0.068
1.098HisIle: 1.098 ± 0.277
1.098HisLys: 1.098 ± 0.26
1.49HisLeu: 1.49 ± 0.133
0.549HisMet: 0.549 ± 0.105
1.49HisAsn: 1.49 ± 0.214
0.941HisPro: 0.941 ± 0.119
0.314HisGln: 0.314 ± 0.086
0.235HisArg: 0.235 ± 0.112
0.549HisSer: 0.549 ± 0.144
1.176HisThr: 1.176 ± 0.231
1.804HisVal: 1.804 ± 0.253
0.157HisTrp: 0.157 ± 0.037
0.078HisTyr: 0.078 ± 0.048
0.0HisXaa: 0.0 ± 0.0
Ile
4.0IleAla: 4.0 ± 0.112
0.706IleCys: 0.706 ± 0.153
2.823IleAsp: 2.823 ± 0.378
1.725IleGlu: 1.725 ± 0.92
3.137IlePhe: 3.137 ± 0.229
2.353IleGly: 2.353 ± 0.381
0.941IleHis: 0.941 ± 0.175
2.274IleIle: 2.274 ± 1.066
2.666IleLys: 2.666 ± 0.399
6.431IleLeu: 6.431 ± 0.343
0.549IleMet: 0.549 ± 0.082
2.117IleAsn: 2.117 ± 0.268
2.823IlePro: 2.823 ± 0.118
2.353IleGln: 2.353 ± 0.905
1.49IleArg: 1.49 ± 0.169
2.823IleSer: 2.823 ± 0.505
4.313IleThr: 4.313 ± 0.817
5.49IleVal: 5.49 ± 0.387
0.549IleTrp: 0.549 ± 0.1
3.451IleTyr: 3.451 ± 0.366
0.0IleXaa: 0.0 ± 0.0
Lys
5.568LysAla: 5.568 ± 0.599
2.431LysCys: 2.431 ± 0.246
3.843LysAsp: 3.843 ± 0.39
2.902LysGlu: 2.902 ± 0.176
4.157LysPhe: 4.157 ± 0.34
3.764LysGly: 3.764 ± 0.334
1.333LysHis: 1.333 ± 0.269
2.51LysIle: 2.51 ± 0.28
3.294LysLys: 3.294 ± 0.544
5.411LysLeu: 5.411 ± 0.63
1.49LysMet: 1.49 ± 0.218
3.059LysAsn: 3.059 ± 0.231
3.215LysPro: 3.215 ± 0.354
2.666LysGln: 2.666 ± 0.287
1.882LysArg: 1.882 ± 0.313
4.627LysSer: 4.627 ± 0.317
2.745LysThr: 2.745 ± 0.226
5.333LysVal: 5.333 ± 0.444
1.098LysTrp: 1.098 ± 0.205
2.98LysTyr: 2.98 ± 0.333
0.0LysXaa: 0.0 ± 0.0
Leu
7.137LeuAla: 7.137 ± 0.619
2.666LeuCys: 2.666 ± 0.333
4.235LeuAsp: 4.235 ± 0.519
3.059LeuGlu: 3.059 ± 0.447
5.176LeuPhe: 5.176 ± 0.602
4.706LeuGly: 4.706 ± 0.393
2.353LeuHis: 2.353 ± 0.297
5.254LeuIle: 5.254 ± 0.498
6.196LeuLys: 6.196 ± 0.737
7.372LeuLeu: 7.372 ± 0.767
1.961LeuMet: 1.961 ± 0.225
3.921LeuAsn: 3.921 ± 0.973
3.686LeuPro: 3.686 ± 0.335
3.764LeuGln: 3.764 ± 0.758
2.902LeuArg: 2.902 ± 0.482
4.941LeuSer: 4.941 ± 0.382
5.098LeuThr: 5.098 ± 0.723
7.294LeuVal: 7.294 ± 0.532
1.02LeuTrp: 1.02 ± 0.412
4.784LeuTyr: 4.784 ± 0.454
0.0LeuXaa: 0.0 ± 0.0
Met
1.725MetAla: 1.725 ± 0.249
0.314MetCys: 0.314 ± 0.073
0.941MetAsp: 0.941 ± 0.119
0.392MetGlu: 0.392 ± 0.094
1.569MetPhe: 1.569 ± 0.292
1.02MetGly: 1.02 ± 0.179
0.627MetHis: 0.627 ± 0.111
0.235MetIle: 0.235 ± 0.303
0.314MetLys: 0.314 ± 0.141
1.804MetLeu: 1.804 ± 0.257
0.235MetMet: 0.235 ± 0.14
0.706MetAsn: 0.706 ± 0.22
0.863MetPro: 0.863 ± 0.268
0.627MetGln: 0.627 ± 0.147
0.863MetArg: 0.863 ± 0.103
1.255MetSer: 1.255 ± 0.191
1.098MetThr: 1.098 ± 0.164
1.49MetVal: 1.49 ± 0.274
0.0MetTrp: 0.0 ± 0.0
1.255MetTyr: 1.255 ± 0.23
0.0MetXaa: 0.0 ± 0.0
Asn
3.529AsnAla: 3.529 ± 0.168
2.353AsnCys: 2.353 ± 0.184
1.882AsnAsp: 1.882 ± 0.44
2.039AsnGlu: 2.039 ± 0.423
3.294AsnPhe: 3.294 ± 0.663
4.0AsnGly: 4.0 ± 0.604
0.627AsnHis: 0.627 ± 0.115
3.764AsnIle: 3.764 ± 0.289
2.902AsnLys: 2.902 ± 0.298
5.803AsnLeu: 5.803 ± 0.71
0.627AsnMet: 0.627 ± 0.316
3.215AsnAsn: 3.215 ± 0.412
2.039AsnPro: 2.039 ± 0.299
1.49AsnGln: 1.49 ± 0.24
1.02AsnArg: 1.02 ± 0.325
3.686AsnSer: 3.686 ± 0.204
2.98AsnThr: 2.98 ± 0.261
3.843AsnVal: 3.843 ± 0.728
1.098AsnTrp: 1.098 ± 0.207
2.353AsnTyr: 2.353 ± 0.306
0.0AsnXaa: 0.0 ± 0.0
Pro
1.961ProAla: 1.961 ± 0.404
0.549ProCys: 0.549 ± 0.148
2.039ProAsp: 2.039 ± 0.567
2.274ProGlu: 2.274 ± 0.28
1.49ProPhe: 1.49 ± 0.213
2.196ProGly: 2.196 ± 0.297
0.549ProHis: 0.549 ± 0.13
1.961ProIle: 1.961 ± 0.123
2.98ProLys: 2.98 ± 0.976
3.451ProLeu: 3.451 ± 0.214
0.392ProMet: 0.392 ± 0.158
2.274ProAsn: 2.274 ± 0.224
2.274ProPro: 2.274 ± 0.435
1.961ProGln: 1.961 ± 0.208
1.49ProArg: 1.49 ± 0.135
1.961ProSer: 1.961 ± 0.419
2.823ProThr: 2.823 ± 0.282
3.059ProVal: 3.059 ± 0.308
0.627ProTrp: 0.627 ± 0.083
1.255ProTyr: 1.255 ± 0.191
0.0ProXaa: 0.0 ± 0.0
Gln
2.274GlnAla: 2.274 ± 0.498
0.941GlnCys: 0.941 ± 0.198
2.039GlnAsp: 2.039 ± 0.223
2.353GlnGlu: 2.353 ± 0.283
1.725GlnPhe: 1.725 ± 0.329
1.804GlnGly: 1.804 ± 0.433
0.941GlnHis: 0.941 ± 0.33
1.804GlnIle: 1.804 ± 0.31
2.431GlnLys: 2.431 ± 0.306
3.294GlnLeu: 3.294 ± 0.286
0.392GlnMet: 0.392 ± 0.132
1.098GlnAsn: 1.098 ± 0.362
0.784GlnPro: 0.784 ± 0.183
1.961GlnGln: 1.961 ± 0.575
1.333GlnArg: 1.333 ± 0.166
2.666GlnSer: 2.666 ± 0.861
2.274GlnThr: 2.274 ± 0.373
2.51GlnVal: 2.51 ± 0.311
1.255GlnTrp: 1.255 ± 0.166
1.725GlnTyr: 1.725 ± 0.325
0.0GlnXaa: 0.0 ± 0.0
Arg
2.823ArgAla: 2.823 ± 0.202
0.941ArgCys: 0.941 ± 0.195
1.882ArgAsp: 1.882 ± 0.178
1.882ArgGlu: 1.882 ± 0.216
1.569ArgPhe: 1.569 ± 0.344
2.431ArgGly: 2.431 ± 0.428
0.784ArgHis: 0.784 ± 0.108
1.569ArgIle: 1.569 ± 0.233
2.039ArgLys: 2.039 ± 0.152
2.431ArgLeu: 2.431 ± 0.241
0.314ArgMet: 0.314 ± 0.18
2.745ArgAsn: 2.745 ± 0.449
1.02ArgPro: 1.02 ± 0.287
0.941ArgGln: 0.941 ± 0.237
1.412ArgArg: 1.412 ± 0.416
2.431ArgSer: 2.431 ± 0.71
1.098ArgThr: 1.098 ± 0.266
3.215ArgVal: 3.215 ± 0.37
0.078ArgTrp: 0.078 ± 0.117
1.333ArgTyr: 1.333 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
3.529SerAla: 3.529 ± 0.365
1.49SerCys: 1.49 ± 0.238
2.98SerAsp: 2.98 ± 0.457
2.274SerGlu: 2.274 ± 0.414
5.098SerPhe: 5.098 ± 0.419
5.333SerGly: 5.333 ± 0.536
0.863SerHis: 0.863 ± 0.147
4.627SerIle: 4.627 ± 0.308
4.313SerLys: 4.313 ± 0.406
5.882SerLeu: 5.882 ± 0.705
0.784SerMet: 0.784 ± 0.122
3.137SerAsn: 3.137 ± 0.291
1.804SerPro: 1.804 ± 0.273
2.196SerGln: 2.196 ± 0.32
2.196SerArg: 2.196 ± 0.88
5.725SerSer: 5.725 ± 1.205
3.294SerThr: 3.294 ± 0.309
6.98SerVal: 6.98 ± 0.704
0.784SerTrp: 0.784 ± 0.248
2.745SerTyr: 2.745 ± 0.584
0.0SerXaa: 0.0 ± 0.0
Thr
4.549ThrAla: 4.549 ± 0.633
1.882ThrCys: 1.882 ± 0.274
2.666ThrAsp: 2.666 ± 0.278
2.98ThrGlu: 2.98 ± 0.229
2.588ThrPhe: 2.588 ± 0.187
3.608ThrGly: 3.608 ± 0.889
0.392ThrHis: 0.392 ± 0.094
2.431ThrIle: 2.431 ± 0.224
2.666ThrLys: 2.666 ± 0.183
4.627ThrLeu: 4.627 ± 0.279
1.804ThrMet: 1.804 ± 0.199
2.98ThrAsn: 2.98 ± 0.302
3.529ThrPro: 3.529 ± 0.343
2.745ThrGln: 2.745 ± 0.297
2.117ThrArg: 2.117 ± 0.331
4.784ThrSer: 4.784 ± 0.559
3.451ThrThr: 3.451 ± 0.39
6.666ThrVal: 6.666 ± 0.584
0.706ThrTrp: 0.706 ± 0.118
1.569ThrTyr: 1.569 ± 0.187
0.0ThrXaa: 0.0 ± 0.0
Val
5.019ValAla: 5.019 ± 0.317
3.921ValCys: 3.921 ± 0.45
6.509ValAsp: 6.509 ± 0.719
4.627ValGlu: 4.627 ± 0.737
4.47ValPhe: 4.47 ± 0.354
4.862ValGly: 4.862 ± 0.292
1.882ValHis: 1.882 ± 0.334
6.039ValIle: 6.039 ± 0.682
5.96ValLys: 5.96 ± 0.712
9.097ValLeu: 9.097 ± 0.866
1.804ValMet: 1.804 ± 0.339
4.706ValAsn: 4.706 ± 0.882
3.372ValPro: 3.372 ± 0.388
3.059ValGln: 3.059 ± 0.231
2.823ValArg: 2.823 ± 0.501
6.352ValSer: 6.352 ± 0.791
5.803ValThr: 5.803 ± 0.631
11.136ValVal: 11.136 ± 1.386
1.098ValTrp: 1.098 ± 0.185
3.686ValTyr: 3.686 ± 0.456
0.0ValXaa: 0.0 ± 0.0
Trp
0.314TrpAla: 0.314 ± 0.073
0.549TrpCys: 0.549 ± 0.278
0.471TrpAsp: 0.471 ± 0.227
0.706TrpGlu: 0.706 ± 0.26
1.098TrpPhe: 1.098 ± 0.088
0.706TrpGly: 0.706 ± 0.206
0.471TrpHis: 0.471 ± 0.149
0.392TrpIle: 0.392 ± 0.158
0.784TrpLys: 0.784 ± 0.137
2.196TrpLeu: 2.196 ± 0.356
0.314TrpMet: 0.314 ± 0.073
1.02TrpAsn: 1.02 ± 0.147
0.314TrpPro: 0.314 ± 0.172
0.078TrpGln: 0.078 ± 0.107
0.549TrpArg: 0.549 ± 0.098
0.941TrpSer: 0.941 ± 0.096
0.157TrpThr: 0.157 ± 0.095
1.412TrpVal: 1.412 ± 0.119
0.392TrpTrp: 0.392 ± 0.154
0.941TrpTyr: 0.941 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.215TyrAla: 3.215 ± 0.181
2.588TyrCys: 2.588 ± 0.585
2.588TyrAsp: 2.588 ± 0.759
1.725TyrGlu: 1.725 ± 0.358
2.039TyrPhe: 2.039 ± 0.214
2.902TyrGly: 2.902 ± 0.4
0.706TyrHis: 0.706 ± 0.261
1.569TyrIle: 1.569 ± 0.377
3.686TyrLys: 3.686 ± 0.377
3.451TyrLeu: 3.451 ± 0.398
1.412TyrMet: 1.412 ± 0.24
2.823TyrAsn: 2.823 ± 0.591
1.412TyrPro: 1.412 ± 0.279
0.784TyrGln: 0.784 ± 0.296
2.431TyrArg: 2.431 ± 0.336
2.196TyrSer: 2.196 ± 0.263
4.392TyrThr: 4.392 ± 0.492
3.451TyrVal: 3.451 ± 0.376
0.392TyrTrp: 0.392 ± 0.238
2.666TyrTyr: 2.666 ± 0.318
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (12752 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski