Amino acid dipepetide frequency for Bat mastadenovirus WIV11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.237AlaAla: 9.237 ± 1.524
1.421AlaCys: 1.421 ± 0.411
3.079AlaAsp: 3.079 ± 0.594
4.5AlaGlu: 4.5 ± 0.705
2.447AlaPhe: 2.447 ± 0.424
4.895AlaGly: 4.895 ± 0.566
1.737AlaHis: 1.737 ± 0.325
2.526AlaIle: 2.526 ± 0.494
2.921AlaLys: 2.921 ± 0.503
7.184AlaLeu: 7.184 ± 0.768
1.658AlaMet: 1.658 ± 0.338
3.553AlaAsn: 3.553 ± 0.582
5.447AlaPro: 5.447 ± 0.975
2.684AlaGln: 2.684 ± 0.407
5.21AlaArg: 5.21 ± 1.125
7.026AlaSer: 7.026 ± 0.718
4.105AlaThr: 4.105 ± 0.505
4.816AlaVal: 4.816 ± 0.642
1.342AlaTrp: 1.342 ± 0.323
2.368AlaTyr: 2.368 ± 0.573
0.0AlaXaa: 0.0 ± 0.0
Cys
1.105CysAla: 1.105 ± 0.29
0.789CysCys: 0.789 ± 0.258
0.474CysAsp: 0.474 ± 0.202
0.789CysGlu: 0.789 ± 0.237
0.868CysPhe: 0.868 ± 0.218
1.263CysGly: 1.263 ± 0.313
0.474CysHis: 0.474 ± 0.177
0.395CysIle: 0.395 ± 0.157
0.947CysLys: 0.947 ± 0.27
2.289CysLeu: 2.289 ± 0.483
0.553CysMet: 0.553 ± 0.143
1.105CysAsn: 1.105 ± 0.247
0.553CysPro: 0.553 ± 0.238
0.868CysGln: 0.868 ± 0.264
0.947CysArg: 0.947 ± 0.253
1.658CysSer: 1.658 ± 0.445
1.342CysThr: 1.342 ± 0.569
1.658CysVal: 1.658 ± 0.362
0.237CysTrp: 0.237 ± 0.119
0.947CysTyr: 0.947 ± 0.203
0.0CysXaa: 0.0 ± 0.0
Asp
3.237AspAla: 3.237 ± 0.666
0.711AspCys: 0.711 ± 0.197
2.289AspAsp: 2.289 ± 0.577
3.553AspGlu: 3.553 ± 0.561
2.21AspPhe: 2.21 ± 0.484
2.526AspGly: 2.526 ± 0.458
1.421AspHis: 1.421 ± 0.317
2.289AspIle: 2.289 ± 0.463
1.658AspLys: 1.658 ± 0.389
5.447AspLeu: 5.447 ± 0.697
0.711AspMet: 0.711 ± 0.225
1.816AspAsn: 1.816 ± 0.285
3.158AspPro: 3.158 ± 0.517
1.342AspGln: 1.342 ± 0.251
2.684AspArg: 2.684 ± 0.552
3.158AspSer: 3.158 ± 0.443
2.053AspThr: 2.053 ± 0.51
4.105AspVal: 4.105 ± 0.524
0.474AspTrp: 0.474 ± 0.164
2.368AspTyr: 2.368 ± 0.249
0.0AspXaa: 0.0 ± 0.0
Glu
5.052GluAla: 5.052 ± 0.944
1.184GluCys: 1.184 ± 0.335
3.789GluAsp: 3.789 ± 0.44
8.842GluGlu: 8.842 ± 1.967
1.658GluPhe: 1.658 ± 0.447
4.184GluGly: 4.184 ± 0.848
0.947GluHis: 0.947 ± 0.268
2.053GluIle: 2.053 ± 0.411
1.816GluLys: 1.816 ± 0.361
5.21GluLeu: 5.21 ± 0.678
1.263GluMet: 1.263 ± 0.383
3.0GluAsn: 3.0 ± 0.497
3.553GluPro: 3.553 ± 0.533
2.21GluGln: 2.21 ± 0.469
4.026GluArg: 4.026 ± 1.011
3.789GluSer: 3.789 ± 0.647
3.631GluThr: 3.631 ± 0.568
3.71GluVal: 3.71 ± 0.504
0.632GluTrp: 0.632 ± 0.251
1.737GluTyr: 1.737 ± 0.336
0.0GluXaa: 0.0 ± 0.0
Phe
2.368PheAla: 2.368 ± 0.401
1.026PheCys: 1.026 ± 0.291
2.289PheAsp: 2.289 ± 0.45
2.21PheGlu: 2.21 ± 0.449
2.289PhePhe: 2.289 ± 0.451
1.5PheGly: 1.5 ± 0.433
0.632PheHis: 0.632 ± 0.17
1.5PheIle: 1.5 ± 0.305
1.579PheLys: 1.579 ± 0.313
3.079PheLeu: 3.079 ± 0.491
1.184PheMet: 1.184 ± 0.234
2.289PheAsn: 2.289 ± 0.407
1.5PhePro: 1.5 ± 0.398
1.5PheGln: 1.5 ± 0.27
1.737PheArg: 1.737 ± 0.419
3.789PheSer: 3.789 ± 0.509
2.921PheThr: 2.921 ± 0.484
3.158PheVal: 3.158 ± 0.452
0.316PheTrp: 0.316 ± 0.138
2.132PheTyr: 2.132 ± 0.395
0.0PheXaa: 0.0 ± 0.0
Gly
4.658GlyAla: 4.658 ± 0.67
0.789GlyCys: 0.789 ± 0.378
3.237GlyAsp: 3.237 ± 0.575
3.316GlyGlu: 3.316 ± 0.605
2.605GlyPhe: 2.605 ± 0.537
7.105GlyGly: 7.105 ± 1.619
1.026GlyHis: 1.026 ± 0.334
2.053GlyIle: 2.053 ± 0.526
1.895GlyLys: 1.895 ± 0.427
5.131GlyLeu: 5.131 ± 0.573
0.868GlyMet: 0.868 ± 0.261
2.684GlyAsn: 2.684 ± 0.668
3.947GlyPro: 3.947 ± 0.52
2.526GlyGln: 2.526 ± 0.547
5.526GlyArg: 5.526 ± 0.916
5.526GlySer: 5.526 ± 0.806
3.474GlyThr: 3.474 ± 0.412
3.868GlyVal: 3.868 ± 0.781
0.632GlyTrp: 0.632 ± 0.186
1.658GlyTyr: 1.658 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
1.5HisAla: 1.5 ± 0.323
0.553HisCys: 0.553 ± 0.225
0.789HisAsp: 0.789 ± 0.292
1.026HisGlu: 1.026 ± 0.31
0.632HisPhe: 0.632 ± 0.19
1.579HisGly: 1.579 ± 0.402
0.789HisHis: 0.789 ± 0.251
0.947HisIle: 0.947 ± 0.24
1.105HisLys: 1.105 ± 0.276
2.684HisLeu: 2.684 ± 0.548
0.316HisMet: 0.316 ± 0.183
1.184HisAsn: 1.184 ± 0.269
2.447HisPro: 2.447 ± 0.586
1.105HisGln: 1.105 ± 0.355
1.658HisArg: 1.658 ± 0.432
1.184HisSer: 1.184 ± 0.304
1.026HisThr: 1.026 ± 0.413
0.711HisVal: 0.711 ± 0.221
0.079HisTrp: 0.079 ± 0.085
1.184HisTyr: 1.184 ± 0.275
0.0HisXaa: 0.0 ± 0.0
Ile
2.763IleAla: 2.763 ± 0.48
0.868IleCys: 0.868 ± 0.449
2.21IleAsp: 2.21 ± 0.435
1.895IleGlu: 1.895 ± 0.527
1.895IlePhe: 1.895 ± 0.349
2.684IleGly: 2.684 ± 0.592
0.553IleHis: 0.553 ± 0.209
2.132IleIle: 2.132 ± 0.577
2.605IleLys: 2.605 ± 0.416
3.395IleLeu: 3.395 ± 0.641
1.026IleMet: 1.026 ± 0.265
2.289IleAsn: 2.289 ± 0.622
1.974IlePro: 1.974 ± 0.504
2.053IleGln: 2.053 ± 0.418
2.289IleArg: 2.289 ± 0.409
3.079IleSer: 3.079 ± 0.535
2.684IleThr: 2.684 ± 0.386
2.447IleVal: 2.447 ± 0.442
0.632IleTrp: 0.632 ± 0.268
1.658IleTyr: 1.658 ± 0.354
0.0IleXaa: 0.0 ± 0.0
Lys
2.526LysAla: 2.526 ± 0.641
1.184LysCys: 1.184 ± 0.323
1.737LysAsp: 1.737 ± 0.319
1.737LysGlu: 1.737 ± 0.446
1.737LysPhe: 1.737 ± 0.353
2.447LysGly: 2.447 ± 0.624
0.711LysHis: 0.711 ± 0.221
3.237LysIle: 3.237 ± 0.43
2.447LysLys: 2.447 ± 0.468
3.789LysLeu: 3.789 ± 0.567
1.263LysMet: 1.263 ± 0.432
2.053LysAsn: 2.053 ± 0.312
2.368LysPro: 2.368 ± 0.584
1.579LysGln: 1.579 ± 0.417
3.868LysArg: 3.868 ± 0.697
2.842LysSer: 2.842 ± 0.431
2.526LysThr: 2.526 ± 0.615
2.21LysVal: 2.21 ± 0.557
0.316LysTrp: 0.316 ± 0.219
1.026LysTyr: 1.026 ± 0.277
0.0LysXaa: 0.0 ± 0.0
Leu
8.21LeuAla: 8.21 ± 1.101
1.737LeuCys: 1.737 ± 0.405
4.421LeuAsp: 4.421 ± 0.693
5.842LeuGlu: 5.842 ± 0.736
3.395LeuPhe: 3.395 ± 0.481
4.579LeuGly: 4.579 ± 0.52
3.079LeuHis: 3.079 ± 0.685
3.237LeuIle: 3.237 ± 0.966
6.474LeuLys: 6.474 ± 0.793
10.342LeuLeu: 10.342 ± 1.221
2.605LeuMet: 2.605 ± 0.527
4.105LeuAsn: 4.105 ± 0.546
5.842LeuPro: 5.842 ± 0.599
5.131LeuGln: 5.131 ± 0.595
6.237LeuArg: 6.237 ± 1.03
5.131LeuSer: 5.131 ± 0.668
5.131LeuThr: 5.131 ± 0.843
5.842LeuVal: 5.842 ± 0.647
1.658LeuTrp: 1.658 ± 0.307
2.921LeuTyr: 2.921 ± 0.587
0.0LeuXaa: 0.0 ± 0.0
Met
1.658MetAla: 1.658 ± 0.254
0.079MetCys: 0.079 ± 0.095
1.342MetAsp: 1.342 ± 0.345
1.342MetGlu: 1.342 ± 0.394
0.947MetPhe: 0.947 ± 0.31
0.395MetGly: 0.395 ± 0.147
0.553MetHis: 0.553 ± 0.252
0.868MetIle: 0.868 ± 0.284
0.947MetLys: 0.947 ± 0.297
1.816MetLeu: 1.816 ± 0.536
0.474MetMet: 0.474 ± 0.173
1.105MetAsn: 1.105 ± 0.279
1.5MetPro: 1.5 ± 0.374
0.947MetGln: 0.947 ± 0.269
1.105MetArg: 1.105 ± 0.523
2.053MetSer: 2.053 ± 0.339
1.026MetThr: 1.026 ± 0.309
0.789MetVal: 0.789 ± 0.251
0.395MetTrp: 0.395 ± 0.204
0.947MetTyr: 0.947 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
3.158AsnAla: 3.158 ± 0.576
0.474AsnCys: 0.474 ± 0.156
1.816AsnAsp: 1.816 ± 0.336
1.895AsnGlu: 1.895 ± 0.36
2.289AsnPhe: 2.289 ± 0.279
3.079AsnGly: 3.079 ± 0.605
0.711AsnHis: 0.711 ± 0.202
2.526AsnIle: 2.526 ± 0.589
1.816AsnLys: 1.816 ± 0.382
5.052AsnLeu: 5.052 ± 0.856
0.711AsnMet: 0.711 ± 0.267
2.289AsnAsn: 2.289 ± 0.756
2.921AsnPro: 2.921 ± 0.471
2.526AsnGln: 2.526 ± 0.606
3.0AsnArg: 3.0 ± 0.412
3.158AsnSer: 3.158 ± 0.587
2.684AsnThr: 2.684 ± 0.495
3.316AsnVal: 3.316 ± 0.56
0.947AsnTrp: 0.947 ± 0.278
2.684AsnTyr: 2.684 ± 0.562
0.0AsnXaa: 0.0 ± 0.0
Pro
6.0ProAla: 6.0 ± 0.967
0.474ProCys: 0.474 ± 0.196
3.395ProAsp: 3.395 ± 0.523
5.131ProGlu: 5.131 ± 0.799
2.526ProPhe: 2.526 ± 0.513
4.105ProGly: 4.105 ± 0.713
1.026ProHis: 1.026 ± 0.26
2.605ProIle: 2.605 ± 0.483
2.526ProLys: 2.526 ± 0.542
5.605ProLeu: 5.605 ± 0.812
1.105ProMet: 1.105 ± 0.291
2.526ProAsn: 2.526 ± 0.671
6.395ProPro: 6.395 ± 1.11
3.474ProGln: 3.474 ± 0.814
3.631ProArg: 3.631 ± 0.622
4.658ProSer: 4.658 ± 0.624
3.395ProThr: 3.395 ± 0.541
4.026ProVal: 4.026 ± 0.727
0.474ProTrp: 0.474 ± 0.181
1.263ProTyr: 1.263 ± 0.409
0.0ProXaa: 0.0 ± 0.0
Gln
3.553GlnAla: 3.553 ± 0.494
0.711GlnCys: 0.711 ± 0.186
1.895GlnAsp: 1.895 ± 0.435
2.605GlnGlu: 2.605 ± 0.486
1.263GlnPhe: 1.263 ± 0.386
2.447GlnGly: 2.447 ± 0.35
0.947GlnHis: 0.947 ± 0.265
1.974GlnIle: 1.974 ± 0.396
1.658GlnLys: 1.658 ± 0.3
5.526GlnLeu: 5.526 ± 0.558
1.105GlnMet: 1.105 ± 0.258
2.605GlnAsn: 2.605 ± 0.432
2.763GlnPro: 2.763 ± 0.484
3.0GlnGln: 3.0 ± 1.081
3.395GlnArg: 3.395 ± 0.459
2.684GlnSer: 2.684 ± 0.566
3.237GlnThr: 3.237 ± 0.488
3.237GlnVal: 3.237 ± 0.408
0.474GlnTrp: 0.474 ± 0.213
1.026GlnTyr: 1.026 ± 0.28
0.0GlnXaa: 0.0 ± 0.0
Arg
6.0ArgAla: 6.0 ± 0.851
1.5ArgCys: 1.5 ± 0.304
3.316ArgAsp: 3.316 ± 0.716
3.631ArgGlu: 3.631 ± 0.815
2.684ArgPhe: 2.684 ± 0.363
5.605ArgGly: 5.605 ± 1.249
1.421ArgHis: 1.421 ± 0.339
2.21ArgIle: 2.21 ± 0.371
2.289ArgLys: 2.289 ± 0.466
4.974ArgLeu: 4.974 ± 0.748
0.711ArgMet: 0.711 ± 0.198
2.526ArgAsn: 2.526 ± 0.353
4.026ArgPro: 4.026 ± 0.794
3.474ArgGln: 3.474 ± 0.528
8.921ArgArg: 8.921 ± 2.293
4.184ArgSer: 4.184 ± 0.746
3.079ArgThr: 3.079 ± 0.559
4.263ArgVal: 4.263 ± 0.955
1.184ArgTrp: 1.184 ± 0.315
2.763ArgTyr: 2.763 ± 0.585
0.0ArgXaa: 0.0 ± 0.0
Ser
5.368SerAla: 5.368 ± 0.582
2.053SerCys: 2.053 ± 0.48
3.553SerAsp: 3.553 ± 0.645
5.052SerGlu: 5.052 ± 1.061
2.921SerPhe: 2.921 ± 0.711
5.289SerGly: 5.289 ± 0.947
1.184SerHis: 1.184 ± 0.241
3.316SerIle: 3.316 ± 0.469
2.289SerLys: 2.289 ± 0.452
6.631SerLeu: 6.631 ± 0.712
1.263SerMet: 1.263 ± 0.299
3.474SerAsn: 3.474 ± 0.508
4.816SerPro: 4.816 ± 0.773
2.763SerGln: 2.763 ± 0.36
4.421SerArg: 4.421 ± 0.806
6.552SerSer: 6.552 ± 0.864
3.474SerThr: 3.474 ± 0.528
4.579SerVal: 4.579 ± 0.505
1.026SerTrp: 1.026 ± 0.351
2.921SerTyr: 2.921 ± 0.651
0.0SerXaa: 0.0 ± 0.0
Thr
3.947ThrAla: 3.947 ± 0.611
1.342ThrCys: 1.342 ± 0.273
1.974ThrAsp: 1.974 ± 0.379
2.447ThrGlu: 2.447 ± 0.423
2.605ThrPhe: 2.605 ± 0.495
3.0ThrGly: 3.0 ± 0.509
1.5ThrHis: 1.5 ± 0.363
2.842ThrIle: 2.842 ± 0.657
1.658ThrLys: 1.658 ± 0.417
7.105ThrLeu: 7.105 ± 0.58
0.316ThrMet: 0.316 ± 0.152
3.079ThrAsn: 3.079 ± 0.853
4.263ThrPro: 4.263 ± 0.543
2.842ThrGln: 2.842 ± 0.548
2.605ThrArg: 2.605 ± 0.543
3.237ThrSer: 3.237 ± 0.584
3.079ThrThr: 3.079 ± 0.592
3.868ThrVal: 3.868 ± 0.627
0.947ThrTrp: 0.947 ± 0.297
3.0ThrTyr: 3.0 ± 0.622
0.0ThrXaa: 0.0 ± 0.0
Val
5.131ValAla: 5.131 ± 0.741
1.5ValCys: 1.5 ± 0.312
3.316ValAsp: 3.316 ± 0.487
3.789ValGlu: 3.789 ± 0.55
2.21ValPhe: 2.21 ± 0.398
3.079ValGly: 3.079 ± 0.481
1.816ValHis: 1.816 ± 0.349
3.079ValIle: 3.079 ± 0.493
2.842ValLys: 2.842 ± 0.577
5.21ValLeu: 5.21 ± 0.625
2.053ValMet: 2.053 ± 0.39
3.0ValAsn: 3.0 ± 0.638
4.026ValPro: 4.026 ± 0.684
3.395ValGln: 3.395 ± 0.313
3.947ValArg: 3.947 ± 0.976
4.579ValSer: 4.579 ± 0.777
3.71ValThr: 3.71 ± 0.586
4.895ValVal: 4.895 ± 0.791
0.711ValTrp: 0.711 ± 0.28
2.526ValTyr: 2.526 ± 0.433
0.0ValXaa: 0.0 ± 0.0
Trp
0.553TrpAla: 0.553 ± 0.188
0.158TrpCys: 0.158 ± 0.105
0.789TrpAsp: 0.789 ± 0.238
0.868TrpGlu: 0.868 ± 0.246
0.316TrpPhe: 0.316 ± 0.164
1.263TrpGly: 1.263 ± 0.257
0.395TrpHis: 0.395 ± 0.211
0.711TrpIle: 0.711 ± 0.2
0.711TrpLys: 0.711 ± 0.248
1.421TrpLeu: 1.421 ± 0.392
0.316TrpMet: 0.316 ± 0.154
0.395TrpAsn: 0.395 ± 0.206
0.553TrpPro: 0.553 ± 0.213
0.632TrpGln: 0.632 ± 0.229
0.868TrpArg: 0.868 ± 0.237
1.263TrpSer: 1.263 ± 0.403
0.789TrpThr: 0.789 ± 0.275
0.553TrpVal: 0.553 ± 0.24
0.789TrpTrp: 0.789 ± 0.602
0.316TrpTyr: 0.316 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.132TyrAla: 2.132 ± 0.424
0.711TyrCys: 0.711 ± 0.211
1.5TyrAsp: 1.5 ± 0.404
1.816TyrGlu: 1.816 ± 0.533
1.5TyrPhe: 1.5 ± 0.452
1.5TyrGly: 1.5 ± 0.284
1.579TyrHis: 1.579 ± 0.396
0.632TyrIle: 0.632 ± 0.168
1.421TyrLys: 1.421 ± 0.401
4.026TyrLeu: 4.026 ± 0.606
0.711TyrMet: 0.711 ± 0.254
1.974TyrAsn: 1.974 ± 0.401
2.289TyrPro: 2.289 ± 0.329
2.053TyrGln: 2.053 ± 0.37
2.684TyrArg: 2.684 ± 0.503
3.474TyrSer: 3.474 ± 0.541
2.289TyrThr: 2.289 ± 0.433
2.763TyrVal: 2.763 ± 0.471
0.395TyrTrp: 0.395 ± 0.211
1.342TyrTyr: 1.342 ± 0.352
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 33 proteins (12668 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski