Amino acid dipepetide frequency for Hubei picorna-like virus 70

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.578AlaAla: 5.578 ± 0.846
1.174AlaCys: 1.174 ± 0.791
2.349AlaAsp: 2.349 ± 1.237
2.642AlaGlu: 2.642 ± 0.569
2.936AlaPhe: 2.936 ± 0.445
4.991AlaGly: 4.991 ± 2.933
1.468AlaHis: 1.468 ± 0.728
5.578AlaIle: 5.578 ± 0.876
4.11AlaLys: 4.11 ± 1.516
4.698AlaLeu: 4.698 ± 0.654
1.468AlaMet: 1.468 ± 0.772
2.349AlaAsn: 2.349 ± 0.61
1.762AlaPro: 1.762 ± 0.449
3.23AlaGln: 3.23 ± 0.526
2.642AlaArg: 2.642 ± 0.823
3.817AlaSer: 3.817 ± 0.665
4.698AlaThr: 4.698 ± 1.359
3.523AlaVal: 3.523 ± 1.629
0.294AlaTrp: 0.294 ± 0.452
3.817AlaTyr: 3.817 ± 0.173
0.0AlaXaa: 0.0 ± 0.0
Cys
2.055CysAla: 2.055 ± 0.326
0.294CysCys: 0.294 ± 0.455
1.468CysAsp: 1.468 ± 0.772
1.762CysGlu: 1.762 ± 0.927
1.762CysPhe: 1.762 ± 0.263
1.174CysGly: 1.174 ± 0.33
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.587CysLys: 0.587 ± 0.309
3.23CysLeu: 3.23 ± 0.856
0.587CysMet: 0.587 ± 0.309
1.174CysAsn: 1.174 ± 0.371
1.468CysPro: 1.468 ± 0.54
0.294CysGln: 0.294 ± 0.155
0.294CysArg: 0.294 ± 0.155
2.055CysSer: 2.055 ± 0.825
1.468CysThr: 1.468 ± 0.282
1.468CysVal: 1.468 ± 0.772
0.0CysTrp: 0.0 ± 0.0
0.587CysTyr: 0.587 ± 0.353
0.0CysXaa: 0.0 ± 0.0
Asp
2.642AspAla: 2.642 ± 0.735
1.468AspCys: 1.468 ± 0.773
2.642AspAsp: 2.642 ± 1.078
2.936AspGlu: 2.936 ± 0.673
4.11AspPhe: 4.11 ± 1.0
2.936AspGly: 2.936 ± 1.1
1.174AspHis: 1.174 ± 0.618
3.523AspIle: 3.523 ± 1.4
4.404AspLys: 4.404 ± 0.928
4.11AspLeu: 4.11 ± 1.059
0.587AspMet: 0.587 ± 0.309
2.055AspAsn: 2.055 ± 1.077
3.523AspPro: 3.523 ± 0.592
2.055AspGln: 2.055 ± 1.215
1.468AspArg: 1.468 ± 0.773
4.404AspSer: 4.404 ± 0.845
3.523AspThr: 3.523 ± 0.525
3.523AspVal: 3.523 ± 0.993
0.294AspTrp: 0.294 ± 0.155
2.055AspTyr: 2.055 ± 0.667
0.0AspXaa: 0.0 ± 0.0
Glu
3.23GluAla: 3.23 ± 1.193
0.881GluCys: 0.881 ± 0.464
3.23GluAsp: 3.23 ± 1.714
3.23GluGlu: 3.23 ± 1.549
3.523GluPhe: 3.523 ± 1.315
2.055GluGly: 2.055 ± 0.788
1.174GluHis: 1.174 ± 0.618
3.523GluIle: 3.523 ± 1.4
4.991GluLys: 4.991 ± 1.759
2.055GluLeu: 2.055 ± 0.788
2.349GluMet: 2.349 ± 0.398
3.23GluAsn: 3.23 ± 1.218
2.642GluPro: 2.642 ± 0.569
2.055GluGln: 2.055 ± 0.667
1.762GluArg: 1.762 ± 0.657
3.523GluSer: 3.523 ± 1.005
2.936GluThr: 2.936 ± 0.445
2.936GluVal: 2.936 ± 0.832
1.174GluTrp: 1.174 ± 0.33
2.936GluTyr: 2.936 ± 1.08
0.0GluXaa: 0.0 ± 0.0
Phe
4.404PheAla: 4.404 ± 1.711
0.881PheCys: 0.881 ± 0.464
3.23PheAsp: 3.23 ± 0.307
3.523PheGlu: 3.523 ± 0.819
3.23PhePhe: 3.23 ± 0.526
2.055PheGly: 2.055 ± 0.724
0.587PheHis: 0.587 ± 0.353
3.23PheIle: 3.23 ± 0.307
4.698PheLys: 4.698 ± 1.045
5.285PheLeu: 5.285 ± 0.485
1.174PheMet: 1.174 ± 0.722
4.11PheAsn: 4.11 ± 2.861
2.349PhePro: 2.349 ± 0.864
2.055PheGln: 2.055 ± 0.705
1.468PheArg: 1.468 ± 0.773
3.817PheSer: 3.817 ± 0.733
2.936PheThr: 2.936 ± 0.447
3.817PheVal: 3.817 ± 1.289
0.587PheTrp: 0.587 ± 0.309
0.881PheTyr: 0.881 ± 0.494
0.0PheXaa: 0.0 ± 0.0
Gly
3.817GlyAla: 3.817 ± 0.86
1.174GlyCys: 1.174 ± 0.618
2.349GlyAsp: 2.349 ± 1.237
1.468GlyGlu: 1.468 ± 0.416
2.642GlyPhe: 2.642 ± 0.592
3.23GlyGly: 3.23 ± 2.73
0.587GlyHis: 0.587 ± 0.309
3.523GlyIle: 3.523 ± 1.855
3.23GlyLys: 3.23 ± 0.945
4.991GlyLeu: 4.991 ± 2.912
1.762GlyMet: 1.762 ± 0.927
2.642GlyAsn: 2.642 ± 0.948
2.349GlyPro: 2.349 ± 0.454
1.762GlyGln: 1.762 ± 1.458
0.587GlyArg: 0.587 ± 0.876
1.468GlySer: 1.468 ± 0.56
5.285GlyThr: 5.285 ± 1.347
5.872GlyVal: 5.872 ± 2.046
0.881GlyTrp: 0.881 ± 1.272
1.762GlyTyr: 1.762 ± 1.168
0.0GlyXaa: 0.0 ± 0.0
His
0.881HisAla: 0.881 ± 0.305
0.587HisCys: 0.587 ± 0.396
1.174HisAsp: 1.174 ± 0.618
2.055HisGlu: 2.055 ± 0.788
1.468HisPhe: 1.468 ± 0.54
0.294HisGly: 0.294 ± 0.155
0.881HisHis: 0.881 ± 0.464
1.762HisIle: 1.762 ± 0.534
1.762HisLys: 1.762 ± 0.927
2.936HisLeu: 2.936 ± 0.447
0.587HisMet: 0.587 ± 0.309
1.174HisAsn: 1.174 ± 0.446
0.881HisPro: 0.881 ± 0.305
0.881HisGln: 0.881 ± 0.464
1.762HisArg: 1.762 ± 0.534
2.349HisSer: 2.349 ± 0.892
0.881HisThr: 0.881 ± 0.305
0.881HisVal: 0.881 ± 0.464
0.0HisTrp: 0.0 ± 0.0
0.881HisTyr: 0.881 ± 0.392
0.0HisXaa: 0.0 ± 0.0
Ile
3.817IleAla: 3.817 ± 1.507
1.762IleCys: 1.762 ± 0.927
4.11IleAsp: 4.11 ± 1.306
2.936IleGlu: 2.936 ± 0.447
1.762IlePhe: 1.762 ± 0.263
3.523IleGly: 3.523 ± 0.676
0.881IleHis: 0.881 ± 0.464
4.404IleIle: 4.404 ± 1.962
4.11IleLys: 4.11 ± 0.663
4.698IleLeu: 4.698 ± 1.414
1.468IleMet: 1.468 ± 0.773
4.991IleAsn: 4.991 ± 0.894
3.523IlePro: 3.523 ± 0.676
4.11IleGln: 4.11 ± 1.719
2.055IleArg: 2.055 ± 0.825
6.753IleSer: 6.753 ± 0.887
3.23IleThr: 3.23 ± 0.307
2.349IleVal: 2.349 ± 0.807
0.587IleTrp: 0.587 ± 0.309
2.642IleTyr: 2.642 ± 0.915
0.0IleXaa: 0.0 ± 0.0
Lys
1.468LysAla: 1.468 ± 0.54
0.294LysCys: 0.294 ± 0.155
4.991LysAsp: 4.991 ± 1.477
4.698LysGlu: 4.698 ± 2.113
3.23LysPhe: 3.23 ± 1.193
2.055LysGly: 2.055 ± 1.06
2.055LysHis: 2.055 ± 1.082
4.991LysIle: 4.991 ± 1.993
2.642LysLys: 2.642 ± 0.594
7.046LysLeu: 7.046 ± 2.934
2.936LysMet: 2.936 ± 0.711
2.936LysAsn: 2.936 ± 0.915
1.762LysPro: 1.762 ± 0.534
2.642LysGln: 2.642 ± 1.068
1.174LysArg: 1.174 ± 0.791
3.817LysSer: 3.817 ± 1.202
5.872LysThr: 5.872 ± 1.285
3.523LysVal: 3.523 ± 1.51
0.881LysTrp: 0.881 ± 0.494
4.11LysTyr: 4.11 ± 1.411
0.0LysXaa: 0.0 ± 0.0
Leu
7.34LeuAla: 7.34 ± 1.706
1.468LeuCys: 1.468 ± 0.416
4.991LeuAsp: 4.991 ± 0.395
4.991LeuGlu: 4.991 ± 0.938
2.349LeuPhe: 2.349 ± 0.807
3.523LeuGly: 3.523 ± 1.13
2.936LeuHis: 2.936 ± 1.1
4.698LeuIle: 4.698 ± 1.413
4.991LeuLys: 4.991 ± 0.777
6.753LeuLeu: 6.753 ± 1.588
2.936LeuMet: 2.936 ± 0.447
6.753LeuAsn: 6.753 ± 2.431
4.404LeuPro: 4.404 ± 1.839
4.698LeuGln: 4.698 ± 0.827
3.23LeuArg: 3.23 ± 0.938
4.698LeuSer: 4.698 ± 0.875
4.11LeuThr: 4.11 ± 0.652
5.578LeuVal: 5.578 ± 1.279
1.174LeuTrp: 1.174 ± 0.618
2.642LeuTyr: 2.642 ± 0.569
0.0LeuXaa: 0.0 ± 0.0
Met
1.762MetAla: 1.762 ± 0.628
0.881MetCys: 0.881 ± 0.464
1.468MetAsp: 1.468 ± 1.281
1.762MetGlu: 1.762 ± 0.263
1.468MetPhe: 1.468 ± 0.773
0.881MetGly: 0.881 ± 0.464
0.294MetHis: 0.294 ± 0.155
1.174MetIle: 1.174 ± 0.618
1.468MetLys: 1.468 ± 0.54
1.468MetLeu: 1.468 ± 0.54
0.587MetMet: 0.587 ± 0.353
1.762MetAsn: 1.762 ± 0.628
1.468MetPro: 1.468 ± 0.772
0.881MetGln: 0.881 ± 0.744
1.762MetArg: 1.762 ± 0.657
1.468MetSer: 1.468 ± 0.416
1.762MetThr: 1.762 ± 0.657
0.587MetVal: 0.587 ± 0.309
0.294MetTrp: 0.294 ± 0.452
0.294MetTyr: 0.294 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
3.23AsnAla: 3.23 ± 0.526
0.881AsnCys: 0.881 ± 0.839
2.349AsnAsp: 2.349 ± 0.61
3.523AsnGlu: 3.523 ± 1.005
5.285AsnPhe: 5.285 ± 3.052
2.936AsnGly: 2.936 ± 2.334
0.294AsnHis: 0.294 ± 0.155
3.817AsnIle: 3.817 ± 2.599
3.817AsnLys: 3.817 ± 0.855
7.634AsnLeu: 7.634 ± 1.221
1.762AsnMet: 1.762 ± 1.067
3.817AsnAsn: 3.817 ± 4.813
2.349AsnPro: 2.349 ± 1.247
2.642AsnGln: 2.642 ± 1.859
3.817AsnArg: 3.817 ± 0.973
5.578AsnSer: 5.578 ± 0.597
4.11AsnThr: 4.11 ± 1.543
3.523AsnVal: 3.523 ± 1.213
1.762AsnTrp: 1.762 ± 0.606
2.349AsnTyr: 2.349 ± 1.663
0.0AsnXaa: 0.0 ± 0.0
Pro
2.936ProAla: 2.936 ± 0.711
1.468ProCys: 1.468 ± 0.282
2.349ProAsp: 2.349 ± 1.157
1.174ProGlu: 1.174 ± 0.33
1.762ProPhe: 1.762 ± 1.264
3.817ProGly: 3.817 ± 0.681
0.587ProHis: 0.587 ± 0.309
2.055ProIle: 2.055 ± 0.326
2.349ProLys: 2.349 ± 0.437
4.698ProLeu: 4.698 ± 1.169
0.0ProMet: 0.0 ± 0.0
4.698ProAsn: 4.698 ± 2.493
4.404ProPro: 4.404 ± 2.395
1.762ProGln: 1.762 ± 1.718
2.349ProArg: 2.349 ± 1.003
2.642ProSer: 2.642 ± 1.693
2.349ProThr: 2.349 ± 0.905
2.936ProVal: 2.936 ± 0.447
0.881ProTrp: 0.881 ± 0.392
1.468ProTyr: 1.468 ± 0.773
0.0ProXaa: 0.0 ± 0.0
Gln
4.11GlnAla: 4.11 ± 0.624
0.881GlnCys: 0.881 ± 0.494
2.642GlnAsp: 2.642 ± 0.735
1.468GlnGlu: 1.468 ± 0.776
3.523GlnPhe: 3.523 ± 0.959
1.174GlnGly: 1.174 ± 0.722
2.349GlnHis: 2.349 ± 0.437
2.055GlnIle: 2.055 ± 1.031
2.349GlnLys: 2.349 ± 0.807
2.642GlnLeu: 2.642 ± 0.746
0.587GlnMet: 0.587 ± 0.353
3.523GlnAsn: 3.523 ± 2.049
1.468GlnPro: 1.468 ± 0.776
1.762GlnGln: 1.762 ± 1.488
0.881GlnArg: 0.881 ± 0.305
2.642GlnSer: 2.642 ± 2.485
2.349GlnThr: 2.349 ± 0.741
3.817GlnVal: 3.817 ± 1.634
0.0GlnTrp: 0.0 ± 0.0
2.055GlnTyr: 2.055 ± 1.41
0.0GlnXaa: 0.0 ± 0.0
Arg
1.468ArgAla: 1.468 ± 0.56
1.174ArgCys: 1.174 ± 0.371
0.881ArgAsp: 0.881 ± 0.542
2.055ArgGlu: 2.055 ± 0.632
2.055ArgPhe: 2.055 ± 0.825
1.468ArgGly: 1.468 ± 0.773
0.881ArgHis: 0.881 ± 0.305
2.349ArgIle: 2.349 ± 0.661
2.936ArgLys: 2.936 ± 1.1
2.349ArgLeu: 2.349 ± 0.807
0.294ArgMet: 0.294 ± 0.155
3.523ArgAsn: 3.523 ± 0.525
0.587ArgPro: 0.587 ± 0.309
2.349ArgGln: 2.349 ± 0.661
3.23ArgArg: 3.23 ± 1.361
2.936ArgSer: 2.936 ± 0.963
2.642ArgThr: 2.642 ± 0.529
1.468ArgVal: 1.468 ± 0.54
0.587ArgTrp: 0.587 ± 0.596
1.762ArgTyr: 1.762 ± 0.657
0.0ArgXaa: 0.0 ± 0.0
Ser
4.11SerAla: 4.11 ± 1.978
0.294SerCys: 0.294 ± 0.155
5.285SerAsp: 5.285 ± 1.355
3.523SerGlu: 3.523 ± 1.4
3.817SerPhe: 3.817 ± 0.851
4.991SerGly: 4.991 ± 1.024
0.881SerHis: 0.881 ± 0.392
4.404SerIle: 4.404 ± 0.83
3.523SerLys: 3.523 ± 1.073
4.404SerLeu: 4.404 ± 1.192
0.587SerMet: 0.587 ± 0.396
3.23SerAsn: 3.23 ± 1.785
3.817SerPro: 3.817 ± 1.695
2.349SerGln: 2.349 ± 1.096
3.817SerArg: 3.817 ± 1.155
5.872SerSer: 5.872 ± 0.919
3.523SerThr: 3.523 ± 1.75
5.578SerVal: 5.578 ± 1.124
2.349SerTrp: 2.349 ± 1.003
2.936SerTyr: 2.936 ± 1.434
0.0SerXaa: 0.0 ± 0.0
Thr
3.817ThrAla: 3.817 ± 1.022
1.468ThrCys: 1.468 ± 0.416
2.642ThrAsp: 2.642 ± 0.641
2.349ThrGlu: 2.349 ± 0.807
3.817ThrPhe: 3.817 ± 1.198
4.11ThrGly: 4.11 ± 1.012
2.055ThrHis: 2.055 ± 1.163
5.578ThrIle: 5.578 ± 0.521
2.936ThrLys: 2.936 ± 0.516
5.578ThrLeu: 5.578 ± 0.876
1.468ThrMet: 1.468 ± 0.507
3.817ThrAsn: 3.817 ± 1.182
2.936ThrPro: 2.936 ± 0.349
2.349ThrGln: 2.349 ± 0.92
1.468ThrArg: 1.468 ± 0.416
3.817ThrSer: 3.817 ± 1.182
5.872ThrThr: 5.872 ± 1.606
5.578ThrVal: 5.578 ± 0.883
1.468ThrTrp: 1.468 ± 0.416
2.055ThrTyr: 2.055 ± 1.193
0.0ThrXaa: 0.0 ± 0.0
Val
3.23ValAla: 3.23 ± 2.252
2.936ValCys: 2.936 ± 0.641
3.523ValAsp: 3.523 ± 1.51
4.11ValGlu: 4.11 ± 1.575
3.23ValPhe: 3.23 ± 0.695
3.817ValGly: 3.817 ± 1.503
1.174ValHis: 1.174 ± 0.618
3.523ValIle: 3.523 ± 1.083
4.404ValLys: 4.404 ± 1.362
6.459ValLeu: 6.459 ± 1.438
1.468ValMet: 1.468 ± 0.773
5.578ValAsn: 5.578 ± 1.573
2.936ValPro: 2.936 ± 0.968
2.349ValGln: 2.349 ± 0.742
1.468ValArg: 1.468 ± 0.773
3.523ValSer: 3.523 ± 0.514
3.523ValThr: 3.523 ± 1.005
2.642ValVal: 2.642 ± 1.063
0.587ValTrp: 0.587 ± 0.396
2.936ValTyr: 2.936 ± 0.861
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.294TrpCys: 0.294 ± 0.455
0.881TrpAsp: 0.881 ± 0.744
0.881TrpGlu: 0.881 ± 0.796
0.881TrpPhe: 0.881 ± 0.464
1.174TrpGly: 1.174 ± 0.623
1.468TrpHis: 1.468 ± 0.772
0.881TrpIle: 0.881 ± 0.464
1.174TrpLys: 1.174 ± 0.33
0.881TrpLeu: 0.881 ± 0.464
0.294TrpMet: 0.294 ± 0.452
1.174TrpAsn: 1.174 ± 0.618
0.294TrpPro: 0.294 ± 0.155
0.587TrpGln: 0.587 ± 0.596
0.0TrpArg: 0.0 ± 0.0
1.468TrpSer: 1.468 ± 0.776
0.587TrpThr: 0.587 ± 0.876
0.587TrpVal: 0.587 ± 0.309
0.294TrpTrp: 0.294 ± 0.155
0.881TrpTyr: 0.881 ± 0.464
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.23TyrAla: 3.23 ± 1.229
1.468TyrCys: 1.468 ± 0.728
0.881TyrAsp: 0.881 ± 0.796
2.349TyrGlu: 2.349 ± 0.585
1.468TyrPhe: 1.468 ± 2.05
1.174TyrGly: 1.174 ± 0.623
2.349TyrHis: 2.349 ± 0.807
2.642TyrIle: 2.642 ± 0.823
2.642TyrLys: 2.642 ± 0.978
2.349TyrLeu: 2.349 ± 0.92
0.294TyrMet: 0.294 ± 0.155
2.936TyrAsn: 2.936 ± 0.861
1.762TyrPro: 1.762 ± 0.641
1.468TyrGln: 1.468 ± 1.474
1.762TyrArg: 1.762 ± 0.927
2.642TyrSer: 2.642 ± 1.391
3.817TyrThr: 3.817 ± 1.636
3.23TyrVal: 3.23 ± 0.856
0.587TyrTrp: 0.587 ± 0.309
0.881TyrTyr: 0.881 ± 0.542
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3407 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski