Amino acid dipepetide frequency for Liao ning virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.278AlaAla: 4.278 ± 1.018
0.658AlaCys: 0.658 ± 0.44
4.936AlaAsp: 4.936 ± 0.904
3.291AlaGlu: 3.291 ± 0.56
1.645AlaPhe: 1.645 ± 0.355
2.139AlaGly: 2.139 ± 0.737
0.823AlaHis: 0.823 ± 0.35
3.62AlaIle: 3.62 ± 0.891
3.291AlaLys: 3.291 ± 0.5
9.214AlaLeu: 9.214 ± 0.71
1.81AlaMet: 1.81 ± 0.638
3.62AlaAsn: 3.62 ± 0.781
2.139AlaPro: 2.139 ± 0.468
1.81AlaGln: 1.81 ± 0.591
2.962AlaArg: 2.962 ± 0.691
4.442AlaSer: 4.442 ± 0.784
3.949AlaThr: 3.949 ± 0.72
5.429AlaVal: 5.429 ± 1.291
0.658AlaTrp: 0.658 ± 0.271
2.139AlaTyr: 2.139 ± 0.605
0.0AlaXaa: 0.0 ± 0.0
Cys
0.494CysAla: 0.494 ± 0.192
0.165CysCys: 0.165 ± 0.141
0.658CysAsp: 0.658 ± 0.253
0.658CysGlu: 0.658 ± 0.451
0.494CysPhe: 0.494 ± 0.244
0.823CysGly: 0.823 ± 0.261
0.494CysHis: 0.494 ± 0.211
0.494CysIle: 0.494 ± 0.328
0.494CysLys: 0.494 ± 0.293
1.645CysLeu: 1.645 ± 0.502
0.165CysMet: 0.165 ± 0.143
0.987CysAsn: 0.987 ± 0.342
0.0CysPro: 0.0 ± 0.0
0.329CysGln: 0.329 ± 0.231
1.81CysArg: 1.81 ± 0.562
0.987CysSer: 0.987 ± 0.281
0.494CysThr: 0.494 ± 0.215
0.658CysVal: 0.658 ± 0.329
0.329CysTrp: 0.329 ± 0.156
0.329CysTyr: 0.329 ± 0.244
0.0CysXaa: 0.0 ± 0.0
Asp
3.455AspAla: 3.455 ± 0.659
0.823AspCys: 0.823 ± 0.336
3.949AspAsp: 3.949 ± 0.832
2.797AspGlu: 2.797 ± 0.625
2.962AspPhe: 2.962 ± 0.842
3.949AspGly: 3.949 ± 0.765
1.481AspHis: 1.481 ± 0.506
4.278AspIle: 4.278 ± 0.9
2.139AspLys: 2.139 ± 0.886
6.417AspLeu: 6.417 ± 0.812
1.974AspMet: 1.974 ± 0.358
4.936AspAsn: 4.936 ± 0.799
2.962AspPro: 2.962 ± 0.38
0.823AspGln: 0.823 ± 0.434
3.126AspArg: 3.126 ± 0.89
4.936AspSer: 4.936 ± 0.691
4.113AspThr: 4.113 ± 0.855
4.771AspVal: 4.771 ± 0.801
0.0AspTrp: 0.0 ± 0.0
3.291AspTyr: 3.291 ± 0.507
0.0AspXaa: 0.0 ± 0.0
Glu
4.442GluAla: 4.442 ± 0.875
0.658GluCys: 0.658 ± 0.312
2.468GluAsp: 2.468 ± 0.555
2.632GluGlu: 2.632 ± 0.707
1.974GluPhe: 1.974 ± 0.465
2.468GluGly: 2.468 ± 0.694
1.316GluHis: 1.316 ± 0.608
3.62GluIle: 3.62 ± 0.762
2.139GluLys: 2.139 ± 0.546
5.594GluLeu: 5.594 ± 1.07
1.316GluMet: 1.316 ± 0.381
2.303GluAsn: 2.303 ± 0.715
2.139GluPro: 2.139 ± 0.381
1.316GluGln: 1.316 ± 0.61
2.468GluArg: 2.468 ± 0.514
2.468GluSer: 2.468 ± 0.624
2.468GluThr: 2.468 ± 0.565
4.442GluVal: 4.442 ± 0.747
0.329GluTrp: 0.329 ± 0.174
2.962GluTyr: 2.962 ± 0.485
0.0GluXaa: 0.0 ± 0.0
Phe
3.126PheAla: 3.126 ± 1.232
0.987PheCys: 0.987 ± 0.386
3.291PheAsp: 3.291 ± 0.831
1.645PheGlu: 1.645 ± 0.264
0.165PhePhe: 0.165 ± 0.135
2.962PheGly: 2.962 ± 0.915
0.494PheHis: 0.494 ± 0.385
4.113PheIle: 4.113 ± 0.501
2.139PheLys: 2.139 ± 0.663
1.974PheLeu: 1.974 ± 0.589
0.987PheMet: 0.987 ± 0.526
2.962PheAsn: 2.962 ± 0.426
0.823PhePro: 0.823 ± 0.27
0.987PheGln: 0.987 ± 0.298
2.303PheArg: 2.303 ± 0.518
2.139PheSer: 2.139 ± 0.449
3.62PheThr: 3.62 ± 0.931
3.291PheVal: 3.291 ± 0.66
0.165PheTrp: 0.165 ± 0.135
0.494PheTyr: 0.494 ± 0.234
0.0PheXaa: 0.0 ± 0.0
Gly
2.468GlyAla: 2.468 ± 0.441
0.329GlyCys: 0.329 ± 0.229
2.962GlyAsp: 2.962 ± 0.694
2.797GlyGlu: 2.797 ± 0.35
3.291GlyPhe: 3.291 ± 0.714
2.797GlyGly: 2.797 ± 0.655
1.645GlyHis: 1.645 ± 0.471
2.632GlyIle: 2.632 ± 0.517
2.632GlyLys: 2.632 ± 0.679
6.91GlyLeu: 6.91 ± 0.759
1.974GlyMet: 1.974 ± 0.477
3.126GlyAsn: 3.126 ± 0.453
3.126GlyPro: 3.126 ± 0.609
1.152GlyGln: 1.152 ± 0.582
2.797GlyArg: 2.797 ± 0.705
5.923GlySer: 5.923 ± 1.019
3.455GlyThr: 3.455 ± 0.62
5.265GlyVal: 5.265 ± 0.695
0.329GlyTrp: 0.329 ± 0.202
2.303GlyTyr: 2.303 ± 0.584
0.0GlyXaa: 0.0 ± 0.0
His
0.987HisAla: 0.987 ± 0.353
0.329HisCys: 0.329 ± 0.209
1.316HisAsp: 1.316 ± 0.415
1.81HisGlu: 1.81 ± 0.548
1.152HisPhe: 1.152 ± 0.38
1.81HisGly: 1.81 ± 0.809
0.329HisHis: 0.329 ± 0.179
1.481HisIle: 1.481 ± 0.367
1.481HisLys: 1.481 ± 0.386
1.645HisLeu: 1.645 ± 0.661
1.645HisMet: 1.645 ± 0.66
0.987HisAsn: 0.987 ± 0.494
1.316HisPro: 1.316 ± 0.476
0.329HisGln: 0.329 ± 0.156
2.139HisArg: 2.139 ± 0.394
2.797HisSer: 2.797 ± 0.893
1.316HisThr: 1.316 ± 0.434
1.152HisVal: 1.152 ± 0.43
0.165HisTrp: 0.165 ± 0.141
0.823HisTyr: 0.823 ± 0.477
0.0HisXaa: 0.0 ± 0.0
Ile
3.455IleAla: 3.455 ± 0.522
0.494IleCys: 0.494 ± 0.263
4.113IleAsp: 4.113 ± 0.709
3.62IleGlu: 3.62 ± 0.666
1.645IlePhe: 1.645 ± 0.484
4.936IleGly: 4.936 ± 0.794
1.152IleHis: 1.152 ± 0.618
4.442IleIle: 4.442 ± 1.022
3.949IleLys: 3.949 ± 0.672
3.126IleLeu: 3.126 ± 0.619
3.784IleMet: 3.784 ± 1.061
4.936IleAsn: 4.936 ± 0.596
3.126IlePro: 3.126 ± 0.437
1.481IleGln: 1.481 ± 0.452
3.62IleArg: 3.62 ± 0.404
4.771IleSer: 4.771 ± 0.702
4.771IleThr: 4.771 ± 0.556
5.758IleVal: 5.758 ± 0.788
0.658IleTrp: 0.658 ± 0.348
1.152IleTyr: 1.152 ± 0.258
0.0IleXaa: 0.0 ± 0.0
Lys
2.632LysAla: 2.632 ± 0.979
0.329LysCys: 0.329 ± 0.262
3.784LysAsp: 3.784 ± 0.837
1.974LysGlu: 1.974 ± 0.763
2.962LysPhe: 2.962 ± 0.854
2.468LysGly: 2.468 ± 0.827
1.316LysHis: 1.316 ± 0.367
2.632LysIle: 2.632 ± 0.615
2.632LysLys: 2.632 ± 0.813
6.088LysLeu: 6.088 ± 0.965
0.658LysMet: 0.658 ± 0.46
3.455LysAsn: 3.455 ± 0.705
2.303LysPro: 2.303 ± 0.608
1.974LysGln: 1.974 ± 0.388
2.962LysArg: 2.962 ± 1.149
3.949LysSer: 3.949 ± 0.608
2.303LysThr: 2.303 ± 0.451
3.455LysVal: 3.455 ± 0.606
0.0LysTrp: 0.0 ± 0.0
2.962LysTyr: 2.962 ± 0.886
0.0LysXaa: 0.0 ± 0.0
Leu
5.594LeuAla: 5.594 ± 0.879
1.152LeuCys: 1.152 ± 0.374
5.1LeuAsp: 5.1 ± 0.696
4.442LeuGlu: 4.442 ± 0.715
2.139LeuPhe: 2.139 ± 0.425
4.607LeuGly: 4.607 ± 0.577
2.303LeuHis: 2.303 ± 0.626
5.923LeuIle: 5.923 ± 1.162
6.91LeuLys: 6.91 ± 1.07
6.91LeuLeu: 6.91 ± 0.8
3.784LeuMet: 3.784 ± 0.986
7.239LeuAsn: 7.239 ± 1.264
3.62LeuPro: 3.62 ± 0.679
1.645LeuGln: 1.645 ± 0.336
6.581LeuArg: 6.581 ± 0.966
7.404LeuSer: 7.404 ± 0.349
6.417LeuThr: 6.417 ± 0.872
4.936LeuVal: 4.936 ± 1.192
0.165LeuTrp: 0.165 ± 0.217
3.455LeuTyr: 3.455 ± 0.668
0.0LeuXaa: 0.0 ± 0.0
Met
3.784MetAla: 3.784 ± 1.073
0.823MetCys: 0.823 ± 0.398
1.974MetAsp: 1.974 ± 0.57
1.152MetGlu: 1.152 ± 0.237
1.81MetPhe: 1.81 ± 0.656
0.987MetGly: 0.987 ± 0.408
0.823MetHis: 0.823 ± 0.304
1.152MetIle: 1.152 ± 0.484
1.152MetLys: 1.152 ± 0.285
3.455MetLeu: 3.455 ± 0.443
0.658MetMet: 0.658 ± 0.277
1.152MetAsn: 1.152 ± 0.338
1.152MetPro: 1.152 ± 0.371
0.658MetGln: 0.658 ± 0.235
1.481MetArg: 1.481 ± 0.665
4.113MetSer: 4.113 ± 0.843
1.974MetThr: 1.974 ± 0.561
1.81MetVal: 1.81 ± 0.507
0.329MetTrp: 0.329 ± 0.182
1.152MetTyr: 1.152 ± 0.552
0.0MetXaa: 0.0 ± 0.0
Asn
4.936AsnAla: 4.936 ± 0.713
1.316AsnCys: 1.316 ± 0.386
4.278AsnAsp: 4.278 ± 0.831
4.442AsnGlu: 4.442 ± 0.912
2.468AsnPhe: 2.468 ± 0.364
3.784AsnGly: 3.784 ± 0.802
1.316AsnHis: 1.316 ± 0.543
5.265AsnIle: 5.265 ± 0.959
2.797AsnLys: 2.797 ± 0.522
4.607AsnLeu: 4.607 ± 0.869
1.81AsnMet: 1.81 ± 0.516
3.784AsnAsn: 3.784 ± 0.604
2.139AsnPro: 2.139 ± 0.838
2.303AsnGln: 2.303 ± 0.598
3.291AsnArg: 3.291 ± 0.629
5.265AsnSer: 5.265 ± 0.684
4.278AsnThr: 4.278 ± 0.7
4.936AsnVal: 4.936 ± 0.709
0.658AsnTrp: 0.658 ± 0.232
2.962AsnTyr: 2.962 ± 0.392
0.0AsnXaa: 0.0 ± 0.0
Pro
2.468ProAla: 2.468 ± 0.866
0.329ProCys: 0.329 ± 0.203
2.139ProAsp: 2.139 ± 0.549
2.303ProGlu: 2.303 ± 0.967
2.632ProPhe: 2.632 ± 0.574
2.303ProGly: 2.303 ± 0.707
0.987ProHis: 0.987 ± 0.529
2.468ProIle: 2.468 ± 0.49
1.645ProLys: 1.645 ± 0.575
3.126ProLeu: 3.126 ± 0.522
0.987ProMet: 0.987 ± 0.371
3.62ProAsn: 3.62 ± 0.665
1.481ProPro: 1.481 ± 0.444
0.494ProGln: 0.494 ± 0.328
1.481ProArg: 1.481 ± 0.306
3.949ProSer: 3.949 ± 0.542
1.81ProThr: 1.81 ± 0.408
2.797ProVal: 2.797 ± 0.525
0.329ProTrp: 0.329 ± 0.222
1.81ProTyr: 1.81 ± 0.542
0.0ProXaa: 0.0 ± 0.0
Gln
1.645GlnAla: 1.645 ± 0.353
0.165GlnCys: 0.165 ± 0.151
0.987GlnAsp: 0.987 ± 0.379
0.658GlnGlu: 0.658 ± 0.281
0.658GlnPhe: 0.658 ± 0.27
0.823GlnGly: 0.823 ± 0.298
1.152GlnHis: 1.152 ± 0.4
2.468GlnIle: 2.468 ± 0.54
0.658GlnLys: 0.658 ± 0.3
3.291GlnLeu: 3.291 ± 0.774
0.987GlnMet: 0.987 ± 0.352
1.481GlnAsn: 1.481 ± 0.456
1.316GlnPro: 1.316 ± 0.333
1.481GlnGln: 1.481 ± 0.479
1.316GlnArg: 1.316 ± 0.426
1.645GlnSer: 1.645 ± 0.464
1.645GlnThr: 1.645 ± 0.345
2.139GlnVal: 2.139 ± 0.592
0.165GlnTrp: 0.165 ± 0.135
1.645GlnTyr: 1.645 ± 0.488
0.0GlnXaa: 0.0 ± 0.0
Arg
3.126ArgAla: 3.126 ± 0.9
0.823ArgCys: 0.823 ± 0.322
2.962ArgAsp: 2.962 ± 0.641
1.974ArgGlu: 1.974 ± 0.623
2.962ArgPhe: 2.962 ± 0.943
2.962ArgGly: 2.962 ± 0.839
0.987ArgHis: 0.987 ± 0.278
3.455ArgIle: 3.455 ± 0.825
2.139ArgLys: 2.139 ± 0.511
5.1ArgLeu: 5.1 ± 0.701
2.303ArgMet: 2.303 ± 0.729
3.784ArgAsn: 3.784 ± 0.974
3.455ArgPro: 3.455 ± 0.708
1.81ArgGln: 1.81 ± 0.464
3.455ArgArg: 3.455 ± 0.649
3.949ArgSer: 3.949 ± 0.698
3.126ArgThr: 3.126 ± 0.534
5.265ArgVal: 5.265 ± 0.731
0.494ArgTrp: 0.494 ± 0.281
1.81ArgTyr: 1.81 ± 0.553
0.0ArgXaa: 0.0 ± 0.0
Ser
3.455SerAla: 3.455 ± 0.635
1.152SerCys: 1.152 ± 0.614
4.607SerAsp: 4.607 ± 0.985
4.113SerGlu: 4.113 ± 0.621
2.303SerPhe: 2.303 ± 0.44
6.746SerGly: 6.746 ± 0.915
2.962SerHis: 2.962 ± 0.522
5.265SerIle: 5.265 ± 0.847
4.113SerLys: 4.113 ± 1.376
8.062SerLeu: 8.062 ± 0.755
1.81SerMet: 1.81 ± 0.511
4.936SerAsn: 4.936 ± 0.861
2.468SerPro: 2.468 ± 0.419
2.632SerGln: 2.632 ± 0.585
4.936SerArg: 4.936 ± 1.266
7.239SerSer: 7.239 ± 1.234
3.949SerThr: 3.949 ± 0.545
5.758SerVal: 5.758 ± 0.955
0.165SerTrp: 0.165 ± 0.135
4.442SerTyr: 4.442 ± 1.088
0.0SerXaa: 0.0 ± 0.0
Thr
4.278ThrAla: 4.278 ± 0.802
0.494ThrCys: 0.494 ± 0.229
3.455ThrAsp: 3.455 ± 0.533
2.468ThrGlu: 2.468 ± 0.595
2.139ThrPhe: 2.139 ± 0.422
3.949ThrGly: 3.949 ± 0.63
1.974ThrHis: 1.974 ± 0.708
4.607ThrIle: 4.607 ± 1.012
3.126ThrLys: 3.126 ± 0.668
4.771ThrLeu: 4.771 ± 0.478
1.974ThrMet: 1.974 ± 0.63
3.291ThrAsn: 3.291 ± 0.712
1.645ThrPro: 1.645 ± 0.522
2.303ThrGln: 2.303 ± 0.788
3.455ThrArg: 3.455 ± 0.62
4.442ThrSer: 4.442 ± 0.58
3.126ThrThr: 3.126 ± 0.367
5.594ThrVal: 5.594 ± 1.146
0.165ThrTrp: 0.165 ± 0.14
2.962ThrTyr: 2.962 ± 0.837
0.0ThrXaa: 0.0 ± 0.0
Val
5.923ValAla: 5.923 ± 0.986
0.823ValCys: 0.823 ± 0.312
5.923ValAsp: 5.923 ± 0.707
4.607ValGlu: 4.607 ± 0.468
2.303ValPhe: 2.303 ± 0.587
4.936ValGly: 4.936 ± 0.966
1.481ValHis: 1.481 ± 0.514
5.1ValIle: 5.1 ± 1.367
5.1ValLys: 5.1 ± 0.882
4.771ValLeu: 4.771 ± 0.98
2.139ValMet: 2.139 ± 0.536
6.91ValAsn: 6.91 ± 1.261
1.974ValPro: 1.974 ± 0.424
1.974ValGln: 1.974 ± 0.819
3.949ValArg: 3.949 ± 0.596
5.923ValSer: 5.923 ± 0.855
4.936ValThr: 4.936 ± 0.998
5.594ValVal: 5.594 ± 0.707
0.658ValTrp: 0.658 ± 0.233
2.139ValTyr: 2.139 ± 0.692
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.658TrpAsp: 0.658 ± 0.311
0.494TrpGlu: 0.494 ± 0.193
0.329TrpPhe: 0.329 ± 0.227
0.165TrpGly: 0.165 ± 0.143
0.165TrpHis: 0.165 ± 0.151
0.0TrpIle: 0.0 ± 0.0
0.329TrpLys: 0.329 ± 0.218
1.316TrpLeu: 1.316 ± 0.564
0.165TrpMet: 0.165 ± 0.164
0.329TrpAsn: 0.329 ± 0.184
0.165TrpPro: 0.165 ± 0.143
0.0TrpGln: 0.0 ± 0.0
0.658TrpArg: 0.658 ± 0.264
0.329TrpSer: 0.329 ± 0.199
0.329TrpThr: 0.329 ± 0.216
0.165TrpVal: 0.165 ± 0.135
0.0TrpTrp: 0.0 ± 0.0
0.329TrpTyr: 0.329 ± 0.192
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.303TyrAla: 2.303 ± 0.381
0.658TyrCys: 0.658 ± 0.3
3.62TyrAsp: 3.62 ± 0.643
1.645TyrGlu: 1.645 ± 0.667
2.632TyrPhe: 2.632 ± 0.565
2.303TyrGly: 2.303 ± 0.336
1.81TyrHis: 1.81 ± 0.433
1.974TyrIle: 1.974 ± 0.545
1.974TyrLys: 1.974 ± 0.681
2.468TyrLeu: 2.468 ± 0.673
0.494TyrMet: 0.494 ± 0.254
2.797TyrAsn: 2.797 ± 0.753
1.81TyrPro: 1.81 ± 0.394
0.823TyrGln: 0.823 ± 0.386
1.152TyrArg: 1.152 ± 0.304
4.442TyrSer: 4.442 ± 0.429
2.139TyrThr: 2.139 ± 0.516
3.949TyrVal: 3.949 ± 0.542
0.165TyrTrp: 0.165 ± 0.164
0.987TyrTyr: 0.987 ± 0.328
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (6079 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski