Amino acid dipepetide frequency for Alpaca respiratory coronavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.664AlaAla: 4.664 ± 0.866
1.71AlaCys: 1.71 ± 0.387
2.487AlaAsp: 2.487 ± 0.995
3.265AlaGlu: 3.265 ± 0.562
4.819AlaPhe: 4.819 ± 1.148
4.042AlaGly: 4.042 ± 0.813
0.933AlaHis: 0.933 ± 0.357
4.975AlaIle: 4.975 ± 0.883
4.12AlaLys: 4.12 ± 1.167
5.13AlaLeu: 5.13 ± 1.097
2.487AlaMet: 2.487 ± 0.419
4.042AlaAsn: 4.042 ± 0.32
1.943AlaPro: 1.943 ± 0.285
1.943AlaGln: 1.943 ± 0.2
1.71AlaArg: 1.71 ± 0.33
4.197AlaSer: 4.197 ± 0.543
3.731AlaThr: 3.731 ± 0.946
8.084AlaVal: 8.084 ± 1.484
0.933AlaTrp: 0.933 ± 0.22
3.964AlaTyr: 3.964 ± 0.815
0.0AlaXaa: 0.0 ± 0.0
Cys
2.954CysAla: 2.954 ± 0.662
1.088CysCys: 1.088 ± 0.538
1.866CysAsp: 1.866 ± 0.554
1.088CysGlu: 1.088 ± 0.254
1.788CysPhe: 1.788 ± 0.291
2.099CysGly: 2.099 ± 0.485
0.078CysHis: 0.078 ± 0.323
1.632CysIle: 1.632 ± 0.288
1.866CysLys: 1.866 ± 0.442
1.943CysLeu: 1.943 ± 0.437
0.311CysMet: 0.311 ± 0.129
2.487CysAsn: 2.487 ± 0.511
0.7CysPro: 0.7 ± 0.135
0.389CysGln: 0.389 ± 0.126
1.477CysArg: 1.477 ± 0.413
1.866CysSer: 1.866 ± 0.406
3.42CysThr: 3.42 ± 0.666
3.576CysVal: 3.576 ± 0.477
0.7CysTrp: 0.7 ± 0.273
1.788CysTyr: 1.788 ± 0.569
0.0CysXaa: 0.0 ± 0.0
Asp
4.353AspAla: 4.353 ± 0.49
1.788AspCys: 1.788 ± 0.438
2.798AspAsp: 2.798 ± 0.726
2.332AspGlu: 2.332 ± 0.402
3.809AspPhe: 3.809 ± 0.867
4.742AspGly: 4.742 ± 0.392
1.632AspHis: 1.632 ± 0.433
3.809AspIle: 3.809 ± 0.752
2.954AspLys: 2.954 ± 0.756
3.653AspLeu: 3.653 ± 0.685
0.933AspMet: 0.933 ± 0.281
2.798AspAsn: 2.798 ± 0.736
1.399AspPro: 1.399 ± 0.611
1.166AspGln: 1.166 ± 0.52
1.321AspArg: 1.321 ± 0.256
2.332AspSer: 2.332 ± 1.121
2.487AspThr: 2.487 ± 0.518
5.286AspVal: 5.286 ± 0.991
1.166AspTrp: 1.166 ± 0.443
3.342AspTyr: 3.342 ± 0.97
0.0AspXaa: 0.0 ± 0.0
Glu
1.866GluAla: 1.866 ± 0.129
1.01GluCys: 1.01 ± 0.233
2.099GluAsp: 2.099 ± 0.485
2.643GluGlu: 2.643 ± 0.602
2.487GluPhe: 2.487 ± 0.528
3.031GluGly: 3.031 ± 0.276
1.477GluHis: 1.477 ± 0.456
1.477GluIle: 1.477 ± 0.405
2.721GluLys: 2.721 ± 0.297
3.809GluLeu: 3.809 ± 0.315
0.933GluMet: 0.933 ± 0.17
3.109GluAsn: 3.109 ± 0.491
1.477GluPro: 1.477 ± 1.185
1.71GluGln: 1.71 ± 0.509
1.555GluArg: 1.555 ± 0.211
2.876GluSer: 2.876 ± 0.318
2.332GluThr: 2.332 ± 0.596
3.731GluVal: 3.731 ± 0.907
0.7GluTrp: 0.7 ± 0.273
1.01GluTyr: 1.01 ± 0.313
0.0GluXaa: 0.0 ± 0.0
Phe
3.109PheAla: 3.109 ± 0.341
2.41PheCys: 2.41 ± 0.402
4.275PheAsp: 4.275 ± 0.914
3.265PheGlu: 3.265 ± 0.531
2.021PhePhe: 2.021 ± 0.38
5.519PheGly: 5.519 ± 0.532
0.155PheHis: 0.155 ± 0.102
3.031PheIle: 3.031 ± 0.64
2.721PheLys: 2.721 ± 0.396
3.653PheLeu: 3.653 ± 0.577
1.321PheMet: 1.321 ± 0.222
3.265PheAsn: 3.265 ± 0.496
0.622PhePro: 0.622 ± 0.254
0.622PheGln: 0.622 ± 0.507
1.321PheArg: 1.321 ± 0.638
4.197PheSer: 4.197 ± 0.776
3.031PheThr: 3.031 ± 0.612
9.017PheVal: 9.017 ± 1.181
0.7PheTrp: 0.7 ± 0.265
2.643PheTyr: 2.643 ± 0.606
0.0PheXaa: 0.0 ± 0.0
Gly
3.576GlyAla: 3.576 ± 0.52
2.099GlyCys: 2.099 ± 0.547
4.586GlyAsp: 4.586 ± 0.682
1.866GlyGlu: 1.866 ± 0.369
4.508GlyPhe: 4.508 ± 0.674
4.12GlyGly: 4.12 ± 0.94
0.933GlyHis: 0.933 ± 0.404
3.031GlyIle: 3.031 ± 0.574
4.197GlyLys: 4.197 ± 1.078
5.519GlyLeu: 5.519 ± 0.704
1.01GlyMet: 1.01 ± 0.459
3.653GlyAsn: 3.653 ± 0.916
2.099GlyPro: 2.099 ± 0.493
0.933GlyGln: 0.933 ± 0.318
1.866GlyArg: 1.866 ± 0.84
5.208GlySer: 5.208 ± 0.533
3.498GlyThr: 3.498 ± 0.713
8.006GlyVal: 8.006 ± 0.518
0.7GlyTrp: 0.7 ± 0.265
3.109GlyTyr: 3.109 ± 0.445
0.0GlyXaa: 0.0 ± 0.0
His
1.244HisAla: 1.244 ± 0.455
0.389HisCys: 0.389 ± 0.177
1.166HisAsp: 1.166 ± 0.303
1.166HisGlu: 1.166 ± 0.328
1.166HisPhe: 1.166 ± 0.192
0.933HisGly: 0.933 ± 0.301
0.078HisHis: 0.078 ± 0.051
0.933HisIle: 0.933 ± 0.948
1.088HisLys: 1.088 ± 0.221
1.321HisLeu: 1.321 ± 0.437
0.155HisMet: 0.155 ± 0.501
0.933HisAsn: 0.933 ± 0.297
0.544HisPro: 0.544 ± 0.483
0.233HisGln: 0.233 ± 0.09
0.544HisArg: 0.544 ± 0.219
1.01HisSer: 1.01 ± 0.233
1.088HisThr: 1.088 ± 0.224
1.632HisVal: 1.632 ± 0.627
0.155HisTrp: 0.155 ± 0.065
1.088HisTyr: 1.088 ± 0.35
0.0HisXaa: 0.0 ± 0.0
Ile
3.187IleAla: 3.187 ± 0.475
0.933IleCys: 0.933 ± 0.357
3.42IleAsp: 3.42 ± 0.556
1.866IleGlu: 1.866 ± 0.666
2.876IlePhe: 2.876 ± 0.609
2.565IleGly: 2.565 ± 0.442
0.389IleHis: 0.389 ± 0.202
2.876IleIle: 2.876 ± 1.427
3.42IleLys: 3.42 ± 0.97
4.664IleLeu: 4.664 ± 0.517
1.399IleMet: 1.399 ± 0.613
2.954IleAsn: 2.954 ± 0.768
2.332IlePro: 2.332 ± 0.93
2.487IleGln: 2.487 ± 1.03
1.244IleArg: 1.244 ± 0.249
3.887IleSer: 3.887 ± 0.774
3.031IleThr: 3.031 ± 1.251
5.363IleVal: 5.363 ± 0.675
0.544IleTrp: 0.544 ± 0.511
1.555IleTyr: 1.555 ± 0.613
0.0IleXaa: 0.0 ± 0.0
Lys
4.975LysAla: 4.975 ± 1.174
2.021LysCys: 2.021 ± 0.493
3.342LysAsp: 3.342 ± 0.474
2.721LysGlu: 2.721 ± 0.802
3.187LysPhe: 3.187 ± 0.736
2.332LysGly: 2.332 ± 0.525
2.487LysHis: 2.487 ± 0.407
2.565LysIle: 2.565 ± 0.468
1.632LysLys: 1.632 ± 0.364
5.674LysLeu: 5.674 ± 1.091
1.166LysMet: 1.166 ± 0.36
2.254LysAsn: 2.254 ± 0.555
4.275LysPro: 4.275 ± 0.859
1.632LysGln: 1.632 ± 0.452
2.099LysArg: 2.099 ± 0.691
4.508LysSer: 4.508 ± 0.899
3.342LysThr: 3.342 ± 0.246
5.441LysVal: 5.441 ± 1.224
1.01LysTrp: 1.01 ± 0.231
3.42LysTyr: 3.42 ± 0.66
0.0LysXaa: 0.0 ± 0.0
Leu
5.441LeuAla: 5.441 ± 0.962
3.731LeuCys: 3.731 ± 0.562
3.964LeuAsp: 3.964 ± 1.165
2.876LeuGlu: 2.876 ± 0.554
4.508LeuPhe: 4.508 ± 0.968
4.508LeuGly: 4.508 ± 0.499
2.099LeuHis: 2.099 ± 0.147
2.643LeuIle: 2.643 ± 1.001
7.151LeuLys: 7.151 ± 1.285
9.872LeuLeu: 9.872 ± 0.911
1.321LeuMet: 1.321 ± 0.491
5.441LeuAsn: 5.441 ± 0.934
3.498LeuPro: 3.498 ± 1.09
3.653LeuGln: 3.653 ± 0.677
2.643LeuArg: 2.643 ± 0.514
7.618LeuSer: 7.618 ± 0.912
5.208LeuThr: 5.208 ± 0.602
5.674LeuVal: 5.674 ± 1.778
1.244LeuTrp: 1.244 ± 0.56
3.42LeuTyr: 3.42 ± 0.25
0.0LeuXaa: 0.0 ± 0.0
Met
1.866MetAla: 1.866 ± 0.696
0.933MetCys: 0.933 ± 0.359
1.244MetAsp: 1.244 ± 0.485
0.311MetGlu: 0.311 ± 0.131
1.632MetPhe: 1.632 ± 0.455
1.321MetGly: 1.321 ± 0.503
0.777MetHis: 0.777 ± 0.433
1.166MetIle: 1.166 ± 0.446
0.855MetLys: 0.855 ± 0.239
2.643MetLeu: 2.643 ± 0.15
0.389MetMet: 0.389 ± 0.177
0.544MetAsn: 0.544 ± 0.134
1.166MetPro: 1.166 ± 0.283
0.7MetGln: 0.7 ± 0.461
1.088MetArg: 1.088 ± 0.414
0.777MetSer: 0.777 ± 0.388
1.244MetThr: 1.244 ± 0.637
1.555MetVal: 1.555 ± 0.291
0.078MetTrp: 0.078 ± 0.244
1.244MetTyr: 1.244 ± 0.23
0.0MetXaa: 0.0 ± 0.0
Asn
4.12AsnAla: 4.12 ± 0.677
1.943AsnCys: 1.943 ± 0.441
2.099AsnAsp: 2.099 ± 0.293
2.954AsnGlu: 2.954 ± 0.517
2.487AsnPhe: 2.487 ± 1.205
6.529AsnGly: 6.529 ± 0.868
0.466AsnHis: 0.466 ± 0.146
3.342AsnIle: 3.342 ± 0.685
2.798AsnLys: 2.798 ± 0.515
4.275AsnLeu: 4.275 ± 0.427
1.321AsnMet: 1.321 ± 0.382
2.954AsnAsn: 2.954 ± 0.444
1.632AsnPro: 1.632 ± 0.395
1.399AsnGln: 1.399 ± 0.545
1.866AsnArg: 1.866 ± 0.375
4.042AsnSer: 4.042 ± 0.653
3.187AsnThr: 3.187 ± 0.652
6.918AsnVal: 6.918 ± 1.03
0.933AsnTrp: 0.933 ± 0.849
1.477AsnTyr: 1.477 ± 0.496
0.0AsnXaa: 0.0 ± 0.0
Pro
2.176ProAla: 2.176 ± 0.616
0.933ProCys: 0.933 ± 0.203
1.477ProAsp: 1.477 ± 0.274
2.099ProGlu: 2.099 ± 0.327
1.788ProPhe: 1.788 ± 0.28
2.176ProGly: 2.176 ± 0.308
0.622ProHis: 0.622 ± 0.48
2.021ProIle: 2.021 ± 0.335
1.632ProLys: 1.632 ± 0.615
2.954ProLeu: 2.954 ± 0.351
0.155ProMet: 0.155 ± 0.102
1.166ProAsn: 1.166 ± 0.57
1.166ProPro: 1.166 ± 0.316
1.166ProGln: 1.166 ± 0.959
1.632ProArg: 1.632 ± 1.132
3.031ProSer: 3.031 ± 1.346
2.254ProThr: 2.254 ± 0.505
3.265ProVal: 3.265 ± 0.359
0.777ProTrp: 0.777 ± 0.173
1.477ProTyr: 1.477 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
2.254GlnAla: 2.254 ± 0.689
0.544GlnCys: 0.544 ± 0.22
0.933GlnAsp: 0.933 ± 0.489
1.555GlnGlu: 1.555 ± 0.204
1.01GlnPhe: 1.01 ± 0.297
2.332GlnGly: 2.332 ± 0.471
0.155GlnHis: 0.155 ± 0.261
1.632GlnIle: 1.632 ± 0.463
1.321GlnLys: 1.321 ± 0.696
2.798GlnLeu: 2.798 ± 0.389
1.01GlnMet: 1.01 ± 0.242
1.088GlnAsn: 1.088 ± 0.164
1.555GlnPro: 1.555 ± 0.553
1.399GlnGln: 1.399 ± 0.581
1.244GlnArg: 1.244 ± 0.177
2.41GlnSer: 2.41 ± 1.839
2.176GlnThr: 2.176 ± 1.467
2.41GlnVal: 2.41 ± 0.443
0.078GlnTrp: 0.078 ± 0.051
1.088GlnTyr: 1.088 ± 0.53
0.0GlnXaa: 0.0 ± 0.0
Arg
2.643ArgAla: 2.643 ± 0.754
1.632ArgCys: 1.632 ± 0.334
1.088ArgAsp: 1.088 ± 0.306
0.622ArgGlu: 0.622 ± 0.469
2.643ArgPhe: 2.643 ± 0.387
2.176ArgGly: 2.176 ± 0.549
0.544ArgHis: 0.544 ± 0.21
1.555ArgIle: 1.555 ± 0.735
1.943ArgLys: 1.943 ± 0.556
3.498ArgLeu: 3.498 ± 0.684
1.088ArgMet: 1.088 ± 0.254
2.099ArgAsn: 2.099 ± 0.641
0.855ArgPro: 0.855 ± 0.4
1.555ArgGln: 1.555 ± 0.96
1.166ArgArg: 1.166 ± 0.476
1.632ArgSer: 1.632 ± 0.548
1.71ArgThr: 1.71 ± 0.262
2.643ArgVal: 2.643 ± 0.406
0.311ArgTrp: 0.311 ± 0.311
1.166ArgTyr: 1.166 ± 0.342
0.0ArgXaa: 0.0 ± 0.0
Ser
5.363SerAla: 5.363 ± 1.009
1.399SerCys: 1.399 ± 0.402
3.809SerAsp: 3.809 ± 0.755
2.332SerGlu: 2.332 ± 0.605
5.597SerPhe: 5.597 ± 0.968
5.363SerGly: 5.363 ± 0.689
1.166SerHis: 1.166 ± 0.245
3.498SerIle: 3.498 ± 0.963
4.197SerLys: 4.197 ± 1.314
5.286SerLeu: 5.286 ± 0.79
1.477SerMet: 1.477 ± 0.468
4.586SerAsn: 4.586 ± 0.456
1.788SerPro: 1.788 ± 1.212
2.099SerGln: 2.099 ± 1.8
2.41SerArg: 2.41 ± 2.537
5.208SerSer: 5.208 ± 1.322
4.586SerThr: 4.586 ± 0.987
8.006SerVal: 8.006 ± 1.529
0.7SerTrp: 0.7 ± 0.409
3.187SerTyr: 3.187 ± 0.419
0.0SerXaa: 0.0 ± 0.0
Thr
3.498ThrAla: 3.498 ± 0.617
2.021ThrCys: 2.021 ± 0.415
2.643ThrAsp: 2.643 ± 0.467
2.099ThrGlu: 2.099 ± 0.402
2.721ThrPhe: 2.721 ± 0.569
3.265ThrGly: 3.265 ± 1.199
1.01ThrHis: 1.01 ± 0.189
4.508ThrIle: 4.508 ± 0.767
4.12ThrLys: 4.12 ± 0.389
4.975ThrLeu: 4.975 ± 1.2
1.321ThrMet: 1.321 ± 0.476
2.798ThrAsn: 2.798 ± 0.537
2.176ThrPro: 2.176 ± 0.346
1.71ThrGln: 1.71 ± 0.241
1.632ThrArg: 1.632 ± 0.642
5.286ThrSer: 5.286 ± 2.346
3.653ThrThr: 3.653 ± 0.455
7.073ThrVal: 7.073 ± 0.925
0.855ThrTrp: 0.855 ± 0.27
2.021ThrTyr: 2.021 ± 0.618
0.0ThrXaa: 0.0 ± 0.0
Val
8.084ValAla: 8.084 ± 0.381
4.353ValCys: 4.353 ± 0.767
6.296ValAsp: 6.296 ± 0.457
5.597ValGlu: 5.597 ± 1.087
4.275ValPhe: 4.275 ± 1.033
4.819ValGly: 4.819 ± 0.818
0.933ValHis: 0.933 ± 0.29
3.809ValIle: 3.809 ± 0.472
8.317ValLys: 8.317 ± 2.306
9.483ValLeu: 9.483 ± 1.268
2.643ValMet: 2.643 ± 0.757
6.452ValAsn: 6.452 ± 1.03
2.954ValPro: 2.954 ± 0.49
3.031ValGln: 3.031 ± 0.468
3.653ValArg: 3.653 ± 0.326
7.695ValSer: 7.695 ± 0.984
6.141ValThr: 6.141 ± 0.709
10.105ValVal: 10.105 ± 0.851
1.166ValTrp: 1.166 ± 0.206
3.731ValTyr: 3.731 ± 0.573
0.0ValXaa: 0.0 ± 0.0
Trp
0.855TrpAla: 0.855 ± 0.738
0.466TrpCys: 0.466 ± 0.258
0.777TrpAsp: 0.777 ± 0.307
0.233TrpGlu: 0.233 ± 0.09
1.01TrpPhe: 1.01 ± 0.233
0.233TrpGly: 0.233 ± 0.237
0.389TrpHis: 0.389 ± 0.148
0.389TrpIle: 0.389 ± 0.148
0.389TrpLys: 0.389 ± 0.504
2.176TrpLeu: 2.176 ± 0.332
0.0TrpMet: 0.0 ± 0.0
0.933TrpAsn: 0.933 ± 0.924
0.7TrpPro: 0.7 ± 0.197
0.233TrpGln: 0.233 ± 0.09
0.7TrpArg: 0.7 ± 0.221
1.088TrpSer: 1.088 ± 0.538
0.7TrpThr: 0.7 ± 0.135
1.399TrpVal: 1.399 ± 0.418
0.544TrpTrp: 0.544 ± 0.134
0.389TrpTyr: 0.389 ± 0.148
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.954TyrAla: 2.954 ± 0.658
1.71TyrCys: 1.71 ± 0.337
3.731TyrAsp: 3.731 ± 0.67
1.166TyrGlu: 1.166 ± 0.531
2.565TyrPhe: 2.565 ± 0.48
2.41TyrGly: 2.41 ± 0.31
0.544TyrHis: 0.544 ± 0.51
2.099TyrIle: 2.099 ± 0.59
2.643TyrLys: 2.643 ± 0.415
3.187TyrLeu: 3.187 ± 0.662
1.01TyrMet: 1.01 ± 0.3
3.265TyrAsn: 3.265 ± 0.791
1.088TyrPro: 1.088 ± 0.365
0.933TyrGln: 0.933 ± 0.384
1.399TyrArg: 1.399 ± 0.296
3.187TyrSer: 3.187 ± 0.334
2.565TyrThr: 2.565 ± 0.895
4.353TyrVal: 4.353 ± 0.547
0.311TyrTrp: 0.311 ± 0.233
2.176TyrTyr: 2.176 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (12866 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski