Amino acid dipepetide frequency for Mengla dianlovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.194AlaAla: 5.194 ± 1.178
0.416AlaCys: 0.416 ± 0.27
2.701AlaAsp: 2.701 ± 0.838
3.532AlaGlu: 3.532 ± 0.54
3.324AlaPhe: 3.324 ± 0.619
2.701AlaGly: 2.701 ± 0.956
0.831AlaHis: 0.831 ± 0.374
4.571AlaIle: 4.571 ± 0.95
4.571AlaLys: 4.571 ± 1.338
6.441AlaLeu: 6.441 ± 1.763
0.416AlaMet: 0.416 ± 0.282
2.909AlaAsn: 2.909 ± 0.838
2.493AlaPro: 2.493 ± 0.907
2.078AlaGln: 2.078 ± 0.872
2.701AlaArg: 2.701 ± 0.814
3.948AlaSer: 3.948 ± 0.82
2.493AlaThr: 2.493 ± 0.618
2.285AlaVal: 2.285 ± 0.683
0.623AlaTrp: 0.623 ± 0.574
2.078AlaTyr: 2.078 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
0.831CysAla: 0.831 ± 0.414
0.623CysCys: 0.623 ± 0.239
0.416CysAsp: 0.416 ± 0.191
0.831CysGlu: 0.831 ± 0.414
0.416CysPhe: 0.416 ± 0.348
0.623CysGly: 0.623 ± 0.379
0.0CysHis: 0.0 ± 0.0
1.454CysIle: 1.454 ± 0.272
1.039CysLys: 1.039 ± 0.332
1.039CysLeu: 1.039 ± 0.384
0.0CysMet: 0.0 ± 0.0
1.454CysAsn: 1.454 ± 0.362
0.416CysPro: 0.416 ± 0.279
0.208CysGln: 0.208 ± 0.297
1.039CysArg: 1.039 ± 0.576
0.831CysSer: 0.831 ± 0.258
1.039CysThr: 1.039 ± 0.511
0.416CysVal: 0.416 ± 0.191
0.208CysTrp: 0.208 ± 0.126
0.416CysTyr: 0.416 ± 0.252
0.0CysXaa: 0.0 ± 0.0
Asp
3.324AspAla: 3.324 ± 1.355
0.416AspCys: 0.416 ± 0.438
2.493AspAsp: 2.493 ± 0.47
2.909AspGlu: 2.909 ± 0.903
2.493AspPhe: 2.493 ± 0.575
2.078AspGly: 2.078 ± 0.488
0.416AspHis: 0.416 ± 0.296
4.155AspIle: 4.155 ± 1.127
3.117AspLys: 3.117 ± 0.962
8.103AspLeu: 8.103 ± 1.201
0.623AspMet: 0.623 ± 0.405
2.493AspAsn: 2.493 ± 0.516
3.117AspPro: 3.117 ± 0.739
4.155AspGln: 4.155 ± 0.687
1.247AspArg: 1.247 ± 0.353
2.701AspSer: 2.701 ± 0.948
1.662AspThr: 1.662 ± 0.401
2.078AspVal: 2.078 ± 0.46
0.831AspTrp: 0.831 ± 0.374
1.247AspTyr: 1.247 ± 0.556
0.0AspXaa: 0.0 ± 0.0
Glu
2.701GluAla: 2.701 ± 0.735
0.623GluCys: 0.623 ± 0.239
2.078GluAsp: 2.078 ± 0.395
3.74GluGlu: 3.74 ± 1.539
0.831GluPhe: 0.831 ± 0.3
3.532GluGly: 3.532 ± 1.0
1.454GluHis: 1.454 ± 0.918
3.117GluIle: 3.117 ± 0.848
4.155GluLys: 4.155 ± 0.642
5.818GluLeu: 5.818 ± 0.537
0.416GluMet: 0.416 ± 0.372
3.948GluAsn: 3.948 ± 0.544
1.039GluPro: 1.039 ± 0.377
3.948GluGln: 3.948 ± 1.277
2.909GluArg: 2.909 ± 0.432
4.779GluSer: 4.779 ± 1.748
3.948GluThr: 3.948 ± 0.813
2.078GluVal: 2.078 ± 1.067
0.831GluTrp: 0.831 ± 0.558
3.324GluTyr: 3.324 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
2.078PheAla: 2.078 ± 0.253
0.416PheCys: 0.416 ± 0.187
1.454PheAsp: 1.454 ± 0.574
1.247PheGlu: 1.247 ± 0.447
1.454PhePhe: 1.454 ± 0.619
2.078PheGly: 2.078 ± 0.535
1.247PheHis: 1.247 ± 0.605
2.701PheIle: 2.701 ± 1.009
3.948PheLys: 3.948 ± 0.478
6.856PheLeu: 6.856 ± 0.914
0.831PheMet: 0.831 ± 0.467
2.701PheAsn: 2.701 ± 0.429
2.701PhePro: 2.701 ± 0.767
2.701PheGln: 2.701 ± 0.621
0.416PheArg: 0.416 ± 0.191
4.779PheSer: 4.779 ± 1.2
2.701PheThr: 2.701 ± 0.884
1.662PheVal: 1.662 ± 0.554
0.208PheTrp: 0.208 ± 0.126
0.831PheTyr: 0.831 ± 0.505
0.0PheXaa: 0.0 ± 0.0
Gly
2.909GlyAla: 2.909 ± 0.404
0.416GlyCys: 0.416 ± 0.392
3.532GlyAsp: 3.532 ± 1.351
2.909GlyGlu: 2.909 ± 0.821
2.909GlyPhe: 2.909 ± 0.494
3.532GlyGly: 3.532 ± 1.119
1.039GlyHis: 1.039 ± 0.275
4.571GlyIle: 4.571 ± 0.634
2.909GlyLys: 2.909 ± 0.854
4.779GlyLeu: 4.779 ± 0.912
0.623GlyMet: 0.623 ± 0.476
2.701GlyAsn: 2.701 ± 0.979
1.662GlyPro: 1.662 ± 0.757
2.909GlyGln: 2.909 ± 0.608
3.532GlyArg: 3.532 ± 0.969
4.155GlySer: 4.155 ± 1.134
2.285GlyThr: 2.285 ± 0.514
3.74GlyVal: 3.74 ± 0.848
0.831GlyTrp: 0.831 ± 0.375
1.247GlyTyr: 1.247 ± 0.522
0.0GlyXaa: 0.0 ± 0.0
His
1.454HisAla: 1.454 ± 0.567
0.623HisCys: 0.623 ± 0.335
0.831HisAsp: 0.831 ± 0.496
0.416HisGlu: 0.416 ± 0.252
1.247HisPhe: 1.247 ± 0.415
0.831HisGly: 0.831 ± 0.778
1.039HisHis: 1.039 ± 0.436
1.454HisIle: 1.454 ± 0.272
0.623HisLys: 0.623 ± 0.283
3.532HisLeu: 3.532 ± 0.711
0.416HisMet: 0.416 ± 0.348
0.623HisAsn: 0.623 ± 0.379
1.662HisPro: 1.662 ± 0.557
1.662HisGln: 1.662 ± 0.64
1.662HisArg: 1.662 ± 0.486
1.247HisSer: 1.247 ± 0.757
0.623HisThr: 0.623 ± 0.55
1.039HisVal: 1.039 ± 0.34
0.831HisTrp: 0.831 ± 0.505
1.454HisTyr: 1.454 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
3.117IleAla: 3.117 ± 0.838
1.247IleCys: 1.247 ± 0.434
3.74IleAsp: 3.74 ± 0.845
4.363IleGlu: 4.363 ± 0.77
4.155IlePhe: 4.155 ± 0.791
3.74IleGly: 3.74 ± 0.537
1.662IleHis: 1.662 ± 0.803
7.272IleIle: 7.272 ± 1.17
3.324IleLys: 3.324 ± 0.63
5.194IleLeu: 5.194 ± 1.251
1.87IleMet: 1.87 ± 0.694
4.363IleAsn: 4.363 ± 0.586
4.571IlePro: 4.571 ± 0.504
3.324IleGln: 3.324 ± 0.993
1.87IleArg: 1.87 ± 0.784
6.649IleSer: 6.649 ± 0.563
4.363IleThr: 4.363 ± 1.196
4.571IleVal: 4.571 ± 0.595
0.623IleTrp: 0.623 ± 0.31
2.078IleTyr: 2.078 ± 0.785
0.0IleXaa: 0.0 ± 0.0
Lys
2.493LysAla: 2.493 ± 0.469
0.208LysCys: 0.208 ± 0.126
4.155LysAsp: 4.155 ± 1.108
3.532LysGlu: 3.532 ± 0.984
2.909LysPhe: 2.909 ± 0.747
3.324LysGly: 3.324 ± 1.281
1.454LysHis: 1.454 ± 0.698
5.818LysIle: 5.818 ± 0.964
3.948LysLys: 3.948 ± 0.599
5.61LysLeu: 5.61 ± 1.392
1.039LysMet: 1.039 ± 0.526
2.701LysAsn: 2.701 ± 0.376
3.324LysPro: 3.324 ± 0.933
2.701LysGln: 2.701 ± 0.375
3.324LysArg: 3.324 ± 0.688
4.571LysSer: 4.571 ± 0.722
3.532LysThr: 3.532 ± 1.141
3.117LysVal: 3.117 ± 0.71
0.208LysTrp: 0.208 ± 0.126
2.285LysTyr: 2.285 ± 0.613
0.0LysXaa: 0.0 ± 0.0
Leu
7.064LeuAla: 7.064 ± 1.046
1.039LeuCys: 1.039 ± 0.351
5.194LeuAsp: 5.194 ± 0.705
5.61LeuGlu: 5.61 ± 2.105
4.986LeuPhe: 4.986 ± 0.678
5.61LeuGly: 5.61 ± 0.538
3.117LeuHis: 3.117 ± 0.837
8.519LeuIle: 8.519 ± 1.01
6.649LeuLys: 6.649 ± 1.474
9.557LeuLeu: 9.557 ± 1.068
1.662LeuMet: 1.662 ± 0.551
6.233LeuAsn: 6.233 ± 1.964
5.402LeuPro: 5.402 ± 1.023
3.948LeuGln: 3.948 ± 0.771
6.025LeuArg: 6.025 ± 0.686
8.726LeuSer: 8.726 ± 1.48
8.726LeuThr: 8.726 ± 1.05
4.155LeuVal: 4.155 ± 0.912
1.454LeuTrp: 1.454 ± 0.516
2.909LeuTyr: 2.909 ± 0.852
0.0LeuXaa: 0.0 ± 0.0
Met
1.87MetAla: 1.87 ± 0.812
0.0MetCys: 0.0 ± 0.0
1.039MetAsp: 1.039 ± 0.407
1.454MetGlu: 1.454 ± 0.604
0.623MetPhe: 0.623 ± 0.269
1.039MetGly: 1.039 ± 0.414
0.831MetHis: 0.831 ± 0.383
1.454MetIle: 1.454 ± 0.508
1.039MetLys: 1.039 ± 0.513
1.247MetLeu: 1.247 ± 0.395
0.623MetMet: 0.623 ± 0.29
1.247MetAsn: 1.247 ± 0.516
1.039MetPro: 1.039 ± 0.514
0.416MetGln: 0.416 ± 0.438
1.039MetArg: 1.039 ± 0.377
1.039MetSer: 1.039 ± 0.377
1.662MetThr: 1.662 ± 0.628
0.623MetVal: 0.623 ± 0.283
0.208MetTrp: 0.208 ± 0.297
0.208MetTyr: 0.208 ± 0.126
0.0MetXaa: 0.0 ± 0.0
Asn
2.909AsnAla: 2.909 ± 1.07
0.831AsnCys: 0.831 ± 0.505
2.701AsnAsp: 2.701 ± 0.599
3.532AsnGlu: 3.532 ± 0.848
2.701AsnPhe: 2.701 ± 1.144
2.493AsnGly: 2.493 ± 0.79
1.454AsnHis: 1.454 ± 0.516
3.324AsnIle: 3.324 ± 0.735
1.87AsnLys: 1.87 ± 0.493
6.025AsnLeu: 6.025 ± 1.136
1.454AsnMet: 1.454 ± 0.44
1.662AsnAsn: 1.662 ± 0.26
2.909AsnPro: 2.909 ± 0.688
4.779AsnGln: 4.779 ± 0.961
2.909AsnArg: 2.909 ± 0.763
5.402AsnSer: 5.402 ± 0.837
2.909AsnThr: 2.909 ± 0.92
2.285AsnVal: 2.285 ± 0.725
0.831AsnTrp: 0.831 ± 0.414
2.285AsnTyr: 2.285 ± 0.486
0.0AsnXaa: 0.0 ± 0.0
Pro
1.87ProAla: 1.87 ± 0.772
1.247ProCys: 1.247 ± 0.599
3.324ProAsp: 3.324 ± 1.562
4.363ProGlu: 4.363 ± 0.384
2.078ProPhe: 2.078 ± 0.539
1.87ProGly: 1.87 ± 0.536
0.831ProHis: 0.831 ± 0.48
2.078ProIle: 2.078 ± 0.75
2.493ProLys: 2.493 ± 0.443
5.61ProLeu: 5.61 ± 1.124
0.208ProMet: 0.208 ± 0.219
2.078ProAsn: 2.078 ± 0.339
6.025ProPro: 6.025 ± 2.772
2.909ProGln: 2.909 ± 1.337
2.701ProArg: 2.701 ± 0.573
5.402ProSer: 5.402 ± 1.072
3.117ProThr: 3.117 ± 1.217
1.662ProVal: 1.662 ± 0.596
0.0ProTrp: 0.0 ± 0.0
2.078ProTyr: 2.078 ± 0.852
0.0ProXaa: 0.0 ± 0.0
Gln
3.324GlnAla: 3.324 ± 1.632
0.831GlnCys: 0.831 ± 0.505
2.285GlnAsp: 2.285 ± 0.781
1.87GlnGlu: 1.87 ± 0.872
2.078GlnPhe: 2.078 ± 0.323
2.909GlnGly: 2.909 ± 0.396
1.454GlnHis: 1.454 ± 0.6
3.74GlnIle: 3.74 ± 0.568
3.532GlnLys: 3.532 ± 0.995
3.74GlnLeu: 3.74 ± 1.064
2.493GlnMet: 2.493 ± 0.718
3.324GlnAsn: 3.324 ± 0.638
1.662GlnPro: 1.662 ± 0.859
2.285GlnGln: 2.285 ± 0.77
2.285GlnArg: 2.285 ± 1.01
3.324GlnSer: 3.324 ± 0.373
3.948GlnThr: 3.948 ± 0.741
2.701GlnVal: 2.701 ± 1.211
0.208GlnTrp: 0.208 ± 0.274
2.078GlnTyr: 2.078 ± 0.5
0.0GlnXaa: 0.0 ± 0.0
Arg
2.078ArgAla: 2.078 ± 0.536
0.416ArgCys: 0.416 ± 0.191
1.454ArgAsp: 1.454 ± 0.296
2.909ArgGlu: 2.909 ± 0.739
1.247ArgPhe: 1.247 ± 0.347
2.078ArgGly: 2.078 ± 0.592
1.039ArgHis: 1.039 ± 0.27
3.117ArgIle: 3.117 ± 0.969
3.324ArgLys: 3.324 ± 0.427
4.155ArgLeu: 4.155 ± 1.275
1.247ArgMet: 1.247 ± 0.556
3.532ArgAsn: 3.532 ± 1.152
1.454ArgPro: 1.454 ± 0.716
2.909ArgGln: 2.909 ± 1.078
2.701ArgArg: 2.701 ± 1.044
3.948ArgSer: 3.948 ± 1.881
4.155ArgThr: 4.155 ± 1.2
4.155ArgVal: 4.155 ± 0.644
1.039ArgTrp: 1.039 ± 0.414
1.87ArgTyr: 1.87 ± 0.994
0.0ArgXaa: 0.0 ± 0.0
Ser
4.779SerAla: 4.779 ± 1.741
1.662SerCys: 1.662 ± 0.474
4.571SerAsp: 4.571 ± 1.039
4.155SerGlu: 4.155 ± 1.287
3.948SerPhe: 3.948 ± 0.817
4.363SerGly: 4.363 ± 1.054
2.285SerHis: 2.285 ± 0.336
4.155SerIle: 4.155 ± 0.839
3.117SerLys: 3.117 ± 0.708
8.311SerLeu: 8.311 ± 1.501
1.247SerMet: 1.247 ± 0.98
5.194SerAsn: 5.194 ± 1.614
2.909SerPro: 2.909 ± 1.004
3.117SerGln: 3.117 ± 0.696
4.779SerArg: 4.779 ± 1.187
9.557SerSer: 9.557 ± 1.574
6.025SerThr: 6.025 ± 1.346
4.779SerVal: 4.779 ± 1.401
1.039SerTrp: 1.039 ± 0.44
2.909SerTyr: 2.909 ± 0.984
0.0SerXaa: 0.0 ± 0.0
Thr
3.324ThrAla: 3.324 ± 0.74
0.623ThrCys: 0.623 ± 0.285
4.363ThrAsp: 4.363 ± 0.909
3.324ThrGlu: 3.324 ± 0.917
2.285ThrPhe: 2.285 ± 0.527
4.155ThrGly: 4.155 ± 1.235
0.831ThrHis: 0.831 ± 0.3
3.117ThrIle: 3.117 ± 0.536
3.117ThrLys: 3.117 ± 0.468
7.272ThrLeu: 7.272 ± 0.983
1.247ThrMet: 1.247 ± 0.391
2.909ThrAsn: 2.909 ± 1.226
4.155ThrPro: 4.155 ± 0.337
2.909ThrGln: 2.909 ± 0.803
3.74ThrArg: 3.74 ± 0.503
5.818ThrSer: 5.818 ± 0.869
5.402ThrThr: 5.402 ± 1.195
3.74ThrVal: 3.74 ± 0.873
0.416ThrTrp: 0.416 ± 0.187
1.247ThrTyr: 1.247 ± 0.39
0.0ThrXaa: 0.0 ± 0.0
Val
3.324ValAla: 3.324 ± 0.676
1.039ValCys: 1.039 ± 0.406
0.831ValAsp: 0.831 ± 0.281
1.454ValGlu: 1.454 ± 0.376
0.831ValPhe: 0.831 ± 0.398
3.532ValGly: 3.532 ± 1.324
1.039ValHis: 1.039 ± 0.377
5.194ValIle: 5.194 ± 1.331
4.986ValLys: 4.986 ± 1.02
6.025ValLeu: 6.025 ± 1.139
0.831ValMet: 0.831 ± 0.442
2.701ValAsn: 2.701 ± 1.095
3.324ValPro: 3.324 ± 0.96
1.662ValGln: 1.662 ± 0.665
2.493ValArg: 2.493 ± 0.415
2.909ValSer: 2.909 ± 0.631
2.909ValThr: 2.909 ± 0.621
3.117ValVal: 3.117 ± 1.327
0.208ValTrp: 0.208 ± 0.126
1.039ValTyr: 1.039 ± 0.247
0.0ValXaa: 0.0 ± 0.0
Trp
0.208TrpAla: 0.208 ± 0.219
0.0TrpCys: 0.0 ± 0.0
1.039TrpAsp: 1.039 ± 0.435
1.039TrpGlu: 1.039 ± 0.526
0.831TrpPhe: 0.831 ± 0.33
1.039TrpGly: 1.039 ± 0.649
0.416TrpHis: 0.416 ± 0.252
0.831TrpIle: 0.831 ± 0.322
0.416TrpLys: 0.416 ± 0.366
1.247TrpLeu: 1.247 ± 0.359
0.416TrpMet: 0.416 ± 0.252
0.0TrpAsn: 0.0 ± 0.0
0.416TrpPro: 0.416 ± 0.47
0.0TrpGln: 0.0 ± 0.0
0.831TrpArg: 0.831 ± 0.282
0.208TrpSer: 0.208 ± 0.126
0.831TrpThr: 0.831 ± 0.33
0.623TrpVal: 0.623 ± 0.299
0.0TrpTrp: 0.0 ± 0.0
0.831TrpTyr: 0.831 ± 0.505
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.454TyrAla: 1.454 ± 0.445
0.623TyrCys: 0.623 ± 0.379
1.454TyrAsp: 1.454 ± 0.575
2.078TyrGlu: 2.078 ± 0.506
1.662TyrPhe: 1.662 ± 0.488
1.662TyrGly: 1.662 ± 0.474
0.831TyrHis: 0.831 ± 0.505
1.454TyrIle: 1.454 ± 0.754
2.078TyrLys: 2.078 ± 1.047
6.025TyrLeu: 6.025 ± 1.853
0.831TyrMet: 0.831 ± 0.285
2.493TyrAsn: 2.493 ± 0.579
1.662TyrPro: 1.662 ± 0.531
1.454TyrGln: 1.454 ± 0.407
0.623TyrArg: 0.623 ± 0.299
2.909TyrSer: 2.909 ± 1.201
1.662TyrThr: 1.662 ± 0.56
0.831TyrVal: 0.831 ± 0.684
0.623TyrTrp: 0.623 ± 0.239
2.285TyrTyr: 2.285 ± 0.909
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4814 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski