Amino acid dipepetide frequency for Shark River virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.964AlaAla: 2.964 ± 2.124
1.976AlaCys: 1.976 ± 0.862
2.223AlaAsp: 2.223 ± 0.764
2.47AlaGlu: 2.47 ± 0.618
2.47AlaPhe: 2.47 ± 0.733
0.988AlaGly: 0.988 ± 0.904
0.988AlaHis: 0.988 ± 0.578
3.706AlaIle: 3.706 ± 0.79
3.458AlaLys: 3.458 ± 0.733
3.953AlaLeu: 3.953 ± 2.233
0.741AlaMet: 0.741 ± 0.795
3.211AlaAsn: 3.211 ± 1.44
0.988AlaPro: 0.988 ± 0.472
1.482AlaGln: 1.482 ± 0.497
2.47AlaArg: 2.47 ± 0.85
1.482AlaSer: 1.482 ± 0.475
2.47AlaThr: 2.47 ± 0.923
1.235AlaVal: 1.235 ± 1.148
0.247AlaTrp: 0.247 ± 0.154
1.976AlaTyr: 1.976 ± 1.251
0.0AlaXaa: 0.0 ± 0.0
Cys
2.223CysAla: 2.223 ± 0.806
0.247CysCys: 0.247 ± 0.154
0.494CysAsp: 0.494 ± 0.158
0.988CysGlu: 0.988 ± 0.904
1.729CysPhe: 1.729 ± 0.935
2.223CysGly: 2.223 ± 1.075
1.729CysHis: 1.729 ± 0.862
3.458CysIle: 3.458 ± 1.87
2.223CysLys: 2.223 ± 1.379
1.235CysLeu: 1.235 ± 0.506
0.494CysMet: 0.494 ± 0.158
2.47CysAsn: 2.47 ± 0.791
0.741CysPro: 0.741 ± 0.358
0.741CysGln: 0.741 ± 0.216
1.482CysArg: 1.482 ± 0.717
1.482CysSer: 1.482 ± 1.026
1.729CysThr: 1.729 ± 0.658
0.988CysVal: 0.988 ± 0.904
0.247CysTrp: 0.247 ± 0.154
1.235CysTyr: 1.235 ± 0.506
0.0CysXaa: 0.0 ± 0.0
Asp
1.976AspAla: 1.976 ± 1.032
0.988AspCys: 0.988 ± 0.578
2.717AspAsp: 2.717 ± 0.767
3.211AspGlu: 3.211 ± 1.16
3.706AspPhe: 3.706 ± 2.351
2.223AspGly: 2.223 ± 0.475
0.741AspHis: 0.741 ± 0.358
5.435AspIle: 5.435 ± 1.741
2.964AspLys: 2.964 ± 1.021
8.399AspLeu: 8.399 ± 0.741
2.717AspMet: 2.717 ± 1.564
2.47AspAsn: 2.47 ± 0.762
2.47AspPro: 2.47 ± 1.266
2.717AspGln: 2.717 ± 0.375
1.729AspArg: 1.729 ± 0.879
1.482AspSer: 1.482 ± 1.026
3.211AspThr: 3.211 ± 0.853
1.482AspVal: 1.482 ± 0.63
0.741AspTrp: 0.741 ± 1.502
2.717AspTyr: 2.717 ± 0.767
0.0AspXaa: 0.0 ± 0.0
Glu
3.458GluAla: 3.458 ± 1.167
1.235GluCys: 1.235 ± 0.801
3.211GluAsp: 3.211 ± 1.027
3.706GluGlu: 3.706 ± 1.112
5.435GluPhe: 5.435 ± 2.332
1.729GluGly: 1.729 ± 0.493
1.482GluHis: 1.482 ± 0.717
7.658GluIle: 7.658 ± 1.644
4.941GluLys: 4.941 ± 1.93
7.164GluLeu: 7.164 ± 2.581
2.223GluMet: 2.223 ± 1.131
3.458GluAsn: 3.458 ± 0.753
1.235GluPro: 1.235 ± 0.772
3.458GluGln: 3.458 ± 0.495
3.458GluArg: 3.458 ± 0.972
2.964GluSer: 2.964 ± 0.836
3.706GluThr: 3.706 ± 1.054
3.211GluVal: 3.211 ± 1.249
0.988GluTrp: 0.988 ± 0.61
1.976GluTyr: 1.976 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
0.988PheAla: 0.988 ± 0.34
1.976PheCys: 1.976 ± 0.933
3.706PheAsp: 3.706 ± 1.07
4.447PheGlu: 4.447 ± 0.75
2.964PhePhe: 2.964 ± 0.412
2.47PheGly: 2.47 ± 0.967
0.741PheHis: 0.741 ± 0.358
3.211PheIle: 3.211 ± 0.497
5.929PheLys: 5.929 ± 0.958
4.447PheLeu: 4.447 ± 3.402
1.729PheMet: 1.729 ± 0.874
3.706PheAsn: 3.706 ± 1.08
1.235PhePro: 1.235 ± 1.154
0.494PheGln: 0.494 ± 0.158
1.729PheArg: 1.729 ± 0.371
4.2PheSer: 4.2 ± 0.88
3.458PheThr: 3.458 ± 0.985
1.729PheVal: 1.729 ± 0.869
0.988PheTrp: 0.988 ± 0.617
2.964PheTyr: 2.964 ± 1.698
0.0PheXaa: 0.0 ± 0.0
Gly
1.482GlyAla: 1.482 ± 1.726
2.223GlyCys: 2.223 ± 1.075
3.211GlyAsp: 3.211 ± 0.662
2.964GlyGlu: 2.964 ± 1.122
1.482GlyPhe: 1.482 ± 1.182
0.0GlyGly: 0.0 ± 0.0
0.247GlyHis: 0.247 ± 0.226
4.694GlyIle: 4.694 ± 1.798
2.717GlyLys: 2.717 ± 0.776
3.458GlyLeu: 3.458 ± 1.317
0.741GlyMet: 0.741 ± 0.678
3.211GlyAsn: 3.211 ± 1.16
0.741GlyPro: 0.741 ± 0.358
1.729GlyGln: 1.729 ± 0.575
1.729GlyArg: 1.729 ± 0.658
2.964GlySer: 2.964 ± 2.603
2.717GlyThr: 2.717 ± 1.517
1.729GlyVal: 1.729 ± 0.44
0.494GlyTrp: 0.494 ± 0.158
0.988GlyTyr: 0.988 ± 0.317
0.0GlyXaa: 0.0 ± 0.0
His
0.494HisAla: 0.494 ± 0.452
0.988HisCys: 0.988 ± 1.032
0.988HisAsp: 0.988 ± 0.34
0.741HisGlu: 0.741 ± 0.358
0.988HisPhe: 0.988 ± 0.34
1.482HisGly: 1.482 ± 0.497
0.494HisHis: 0.494 ± 0.158
1.729HisIle: 1.729 ± 0.781
0.741HisLys: 0.741 ± 0.216
0.741HisLeu: 0.741 ± 0.578
0.247HisMet: 0.247 ± 0.226
2.717HisAsn: 2.717 ± 0.747
0.494HisPro: 0.494 ± 0.158
0.741HisGln: 0.741 ± 1.126
0.494HisArg: 0.494 ± 0.624
2.223HisSer: 2.223 ± 0.645
2.717HisThr: 2.717 ± 0.77
1.729HisVal: 1.729 ± 0.44
0.247HisTrp: 0.247 ± 0.154
0.741HisTyr: 0.741 ± 0.358
0.0HisXaa: 0.0 ± 0.0
Ile
3.458IleAla: 3.458 ± 1.997
2.47IleCys: 2.47 ± 1.927
4.2IleAsp: 4.2 ± 0.88
6.423IleGlu: 6.423 ± 1.952
3.953IlePhe: 3.953 ± 1.725
3.458IleGly: 3.458 ± 1.745
1.976IleHis: 1.976 ± 0.933
5.929IleIle: 5.929 ± 1.553
8.399IleLys: 8.399 ± 1.824
9.634IleLeu: 9.634 ± 2.403
1.729IleMet: 1.729 ± 0.493
4.447IleAsn: 4.447 ± 1.29
2.223IlePro: 2.223 ± 0.648
2.47IleGln: 2.47 ± 1.012
3.706IleArg: 3.706 ± 1.037
9.14IleSer: 9.14 ± 1.956
5.188IleThr: 5.188 ± 1.299
4.694IleVal: 4.694 ± 1.328
0.988IleTrp: 0.988 ± 0.317
2.47IleTyr: 2.47 ± 0.918
0.0IleXaa: 0.0 ± 0.0
Lys
2.223LysAla: 2.223 ± 1.023
1.976LysCys: 1.976 ± 1.156
5.682LysAsp: 5.682 ± 1.042
6.67LysGlu: 6.67 ± 1.469
5.188LysPhe: 5.188 ± 0.627
3.706LysGly: 3.706 ± 0.77
2.47LysHis: 2.47 ± 1.175
5.435LysIle: 5.435 ± 0.97
4.447LysLys: 4.447 ± 1.005
9.387LysLeu: 9.387 ± 0.916
3.211LysMet: 3.211 ± 0.976
2.717LysAsn: 2.717 ± 0.799
3.458LysPro: 3.458 ± 0.61
2.964LysGln: 2.964 ± 0.676
1.976LysArg: 1.976 ± 0.582
6.423LysSer: 6.423 ± 1.842
5.682LysThr: 5.682 ± 0.528
4.694LysVal: 4.694 ± 0.854
0.988LysTrp: 0.988 ± 0.472
4.2LysTyr: 4.2 ± 1.196
0.0LysXaa: 0.0 ± 0.0
Leu
4.694LeuAla: 4.694 ± 1.002
1.729LeuCys: 1.729 ± 0.493
4.941LeuAsp: 4.941 ± 1.54
7.411LeuGlu: 7.411 ± 0.914
4.447LeuPhe: 4.447 ± 1.289
1.482LeuGly: 1.482 ± 1.377
2.47LeuHis: 2.47 ± 1.06
5.682LeuIle: 5.682 ± 1.629
10.375LeuLys: 10.375 ± 4.507
8.646LeuLeu: 8.646 ± 0.887
3.211LeuMet: 3.211 ± 1.126
6.917LeuAsn: 6.917 ± 1.505
3.953LeuPro: 3.953 ± 0.825
2.223LeuGln: 2.223 ± 2.21
4.941LeuArg: 4.941 ± 2.419
8.399LeuSer: 8.399 ± 2.428
5.435LeuThr: 5.435 ± 4.028
5.929LeuVal: 5.929 ± 2.095
0.0LeuTrp: 0.0 ± 0.0
3.706LeuTyr: 3.706 ± 0.832
0.0LeuXaa: 0.0 ± 0.0
Met
0.741MetAla: 0.741 ± 0.216
0.988MetCys: 0.988 ± 0.317
2.223MetAsp: 2.223 ± 1.087
2.47MetGlu: 2.47 ± 0.754
1.235MetPhe: 1.235 ± 0.459
1.235MetGly: 1.235 ± 1.057
0.247MetHis: 0.247 ± 0.226
3.211MetIle: 3.211 ± 1.126
1.482MetLys: 1.482 ± 0.475
3.211MetLeu: 3.211 ± 1.411
0.741MetMet: 0.741 ± 0.216
1.482MetAsn: 1.482 ± 0.63
2.47MetPro: 2.47 ± 0.791
1.235MetGln: 1.235 ± 1.382
1.235MetArg: 1.235 ± 0.483
2.964MetSer: 2.964 ± 0.593
1.976MetThr: 1.976 ± 0.633
1.482MetVal: 1.482 ± 0.63
0.0MetTrp: 0.0 ± 0.0
0.247MetTyr: 0.247 ± 0.226
0.0MetXaa: 0.0 ± 0.0
Asn
2.717AsnAla: 2.717 ± 0.375
1.729AsnCys: 1.729 ± 0.658
3.458AsnAsp: 3.458 ± 1.852
3.953AsnGlu: 3.953 ± 0.665
1.482AsnPhe: 1.482 ± 0.475
2.717AsnGly: 2.717 ± 1.545
2.223AsnHis: 2.223 ± 0.502
4.2AsnIle: 4.2 ± 1.31
5.435AsnLys: 5.435 ± 1.527
5.188AsnLeu: 5.188 ± 2.676
2.964AsnMet: 2.964 ± 0.836
4.447AsnAsn: 4.447 ± 0.881
1.976AsnPro: 1.976 ± 0.943
1.976AsnGln: 1.976 ± 1.235
2.717AsnArg: 2.717 ± 0.767
4.2AsnSer: 4.2 ± 0.923
3.953AsnThr: 3.953 ± 0.618
0.741AsnVal: 0.741 ± 0.358
0.741AsnTrp: 0.741 ± 0.463
2.717AsnTyr: 2.717 ± 0.767
0.0AsnXaa: 0.0 ± 0.0
Pro
1.482ProAla: 1.482 ± 0.972
0.0ProCys: 0.0 ± 0.0
2.223ProAsp: 2.223 ± 0.648
1.729ProGlu: 1.729 ± 1.081
1.729ProPhe: 1.729 ± 0.658
2.223ProGly: 2.223 ± 0.905
0.494ProHis: 0.494 ± 0.452
4.2ProIle: 4.2 ± 1.44
3.706ProLys: 3.706 ± 0.65
1.976ProLeu: 1.976 ± 0.943
0.741ProMet: 0.741 ± 1.102
1.729ProAsn: 1.729 ± 0.371
0.247ProPro: 0.247 ± 0.154
0.741ProGln: 0.741 ± 0.216
1.482ProArg: 1.482 ± 1.156
2.223ProSer: 2.223 ± 1.085
2.717ProThr: 2.717 ± 1.516
1.482ProVal: 1.482 ± 0.553
0.494ProTrp: 0.494 ± 0.309
1.235ProTyr: 1.235 ± 0.346
0.0ProXaa: 0.0 ± 0.0
Gln
2.223GlnAla: 2.223 ± 1.056
0.988GlnCys: 0.988 ± 0.34
1.729GlnAsp: 1.729 ± 2.497
3.706GlnGlu: 3.706 ± 1.943
1.729GlnPhe: 1.729 ± 0.493
1.482GlnGly: 1.482 ± 0.717
0.494GlnHis: 0.494 ± 0.158
2.964GlnIle: 2.964 ± 0.819
2.717GlnLys: 2.717 ± 0.776
3.211GlnLeu: 3.211 ± 1.824
1.235GlnMet: 1.235 ± 0.346
1.729GlnAsn: 1.729 ± 2.077
0.741GlnPro: 0.741 ± 0.358
1.235GlnGln: 1.235 ± 1.419
0.988GlnArg: 0.988 ± 0.34
1.482GlnSer: 1.482 ± 0.432
1.976GlnThr: 1.976 ± 0.855
1.976GlnVal: 1.976 ± 1.251
0.0GlnTrp: 0.0 ± 0.0
1.482GlnTyr: 1.482 ± 0.497
0.0GlnXaa: 0.0 ± 0.0
Arg
1.729ArgAla: 1.729 ± 0.371
1.976ArgCys: 1.976 ± 0.862
2.47ArgAsp: 2.47 ± 0.931
2.223ArgGlu: 2.223 ± 0.338
2.223ArgPhe: 2.223 ± 0.918
1.482ArgGly: 1.482 ± 0.475
1.235ArgHis: 1.235 ± 1.096
4.2ArgIle: 4.2 ± 1.477
3.458ArgLys: 3.458 ± 1.108
4.447ArgLeu: 4.447 ± 1.073
1.482ArgMet: 1.482 ± 0.622
2.223ArgAsn: 2.223 ± 1.085
1.235ArgPro: 1.235 ± 2.173
1.729ArgGln: 1.729 ± 2.436
1.729ArgArg: 1.729 ± 0.658
2.47ArgSer: 2.47 ± 0.918
0.988ArgThr: 0.988 ± 0.34
1.976ArgVal: 1.976 ± 0.982
0.0ArgTrp: 0.0 ± 0.0
1.235ArgTyr: 1.235 ± 0.483
0.0ArgXaa: 0.0 ± 0.0
Ser
2.717SerAla: 2.717 ± 0.888
1.976SerCys: 1.976 ± 1.156
4.2SerAsp: 4.2 ± 0.722
3.458SerGlu: 3.458 ± 0.495
4.694SerPhe: 4.694 ± 0.65
1.976SerGly: 1.976 ± 0.361
1.482SerHis: 1.482 ± 0.497
6.917SerIle: 6.917 ± 2.383
8.152SerLys: 8.152 ± 1.443
8.152SerLeu: 8.152 ± 3.224
1.482SerMet: 1.482 ± 0.63
3.706SerAsn: 3.706 ± 1.07
2.47SerPro: 2.47 ± 0.692
2.47SerGln: 2.47 ± 0.382
2.223SerArg: 2.223 ± 0.338
4.694SerSer: 4.694 ± 1.294
4.694SerThr: 4.694 ± 0.514
4.2SerVal: 4.2 ± 1.606
0.741SerTrp: 0.741 ± 0.463
2.964SerTyr: 2.964 ± 0.746
0.0SerXaa: 0.0 ± 0.0
Thr
2.964ThrAla: 2.964 ± 0.593
2.223ThrCys: 2.223 ± 1.237
3.953ThrAsp: 3.953 ± 1.109
3.953ThrGlu: 3.953 ± 1.829
3.211ThrPhe: 3.211 ± 1.819
2.717ThrGly: 2.717 ± 1.513
0.247ThrHis: 0.247 ± 0.226
6.423ThrIle: 6.423 ± 0.588
5.188ThrLys: 5.188 ± 0.948
4.941ThrLeu: 4.941 ± 3.406
1.482ThrMet: 1.482 ± 0.432
1.976ThrAsn: 1.976 ± 0.554
2.223ThrPro: 2.223 ± 0.876
1.976ThrGln: 1.976 ± 2.012
3.211ThrArg: 3.211 ± 0.698
5.929ThrSer: 5.929 ± 1.132
4.2ThrThr: 4.2 ± 1.43
0.988ThrVal: 0.988 ± 0.317
0.988ThrTrp: 0.988 ± 0.568
3.953ThrTyr: 3.953 ± 1.109
0.0ThrXaa: 0.0 ± 0.0
Val
1.482ValAla: 1.482 ± 0.553
0.988ValCys: 0.988 ± 0.578
1.235ValAsp: 1.235 ± 0.682
2.717ValGlu: 2.717 ± 0.993
2.47ValPhe: 2.47 ± 0.692
3.211ValGly: 3.211 ± 1.126
0.988ValHis: 0.988 ± 0.34
2.964ValIle: 2.964 ± 0.692
3.458ValLys: 3.458 ± 0.879
3.953ValLeu: 3.953 ± 0.691
1.976ValMet: 1.976 ± 0.603
2.717ValAsn: 2.717 ± 0.375
1.482ValPro: 1.482 ± 0.475
1.976ValGln: 1.976 ± 1.159
1.482ValArg: 1.482 ± 0.972
4.694ValSer: 4.694 ± 0.514
1.976ValThr: 1.976 ± 0.681
1.235ValVal: 1.235 ± 0.682
0.247ValTrp: 0.247 ± 0.226
2.47ValTyr: 2.47 ± 0.762
0.0ValXaa: 0.0 ± 0.0
Trp
0.494TrpAla: 0.494 ± 0.586
0.0TrpCys: 0.0 ± 0.0
0.247TrpAsp: 0.247 ± 0.154
0.494TrpGlu: 0.494 ± 0.309
0.494TrpPhe: 0.494 ± 0.158
0.988TrpGly: 0.988 ± 0.568
0.0TrpHis: 0.0 ± 0.0
0.247TrpIle: 0.247 ± 0.154
0.0TrpLys: 0.0 ± 0.0
0.988TrpLeu: 0.988 ± 0.317
0.0TrpMet: 0.0 ± 0.0
0.988TrpAsn: 0.988 ± 0.34
0.0TrpPro: 0.0 ± 0.0
0.494TrpGln: 0.494 ± 0.309
0.247TrpArg: 0.247 ± 0.633
1.482TrpSer: 1.482 ± 0.497
0.494TrpThr: 0.494 ± 0.309
0.741TrpVal: 0.741 ± 1.102
0.0TrpTrp: 0.0 ± 0.0
0.741TrpTyr: 0.741 ± 0.216
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.988TyrAla: 0.988 ± 0.819
1.482TyrCys: 1.482 ± 1.026
0.988TyrAsp: 0.988 ± 0.34
2.964TyrGlu: 2.964 ± 0.836
1.482TyrPhe: 1.482 ± 0.63
1.729TyrGly: 1.729 ± 0.44
0.988TyrHis: 0.988 ± 0.317
4.694TyrIle: 4.694 ± 0.73
3.706TyrLys: 3.706 ± 0.787
3.458TyrLeu: 3.458 ± 0.9
1.482TyrMet: 1.482 ± 0.432
3.211TyrAsn: 3.211 ± 0.899
2.47TyrPro: 2.47 ± 1.109
1.235TyrGln: 1.235 ± 0.977
1.729TyrArg: 1.729 ± 1.268
2.47TyrSer: 2.47 ± 0.762
3.458TyrThr: 3.458 ± 1.852
1.482TyrVal: 1.482 ± 0.63
0.0TyrTrp: 0.0 ± 0.0
1.235TyrTyr: 1.235 ± 0.346
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4049 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski