Amino acid dipepetide frequency for Shayang Fly Virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.431AlaAla: 3.431 ± 0.969
0.98AlaCys: 0.98 ± 0.349
1.225AlaAsp: 1.225 ± 0.356
2.205AlaGlu: 2.205 ± 1.025
1.225AlaPhe: 1.225 ± 0.258
2.94AlaGly: 2.94 ± 0.844
0.735AlaHis: 0.735 ± 0.298
3.431AlaIle: 3.431 ± 0.546
5.391AlaLys: 5.391 ± 1.012
3.431AlaLeu: 3.431 ± 0.75
1.715AlaMet: 1.715 ± 0.616
2.695AlaAsn: 2.695 ± 0.812
0.98AlaPro: 0.98 ± 0.645
0.735AlaGln: 0.735 ± 0.594
1.96AlaArg: 1.96 ± 0.551
2.695AlaSer: 2.695 ± 1.168
3.676AlaThr: 3.676 ± 0.697
2.94AlaVal: 2.94 ± 1.18
0.245AlaTrp: 0.245 ± 0.147
2.695AlaTyr: 2.695 ± 0.506
0.0AlaXaa: 0.0 ± 0.0
Cys
0.49CysAla: 0.49 ± 0.233
0.245CysCys: 0.245 ± 0.358
0.49CysAsp: 0.49 ± 0.294
0.735CysGlu: 0.735 ± 0.652
1.225CysPhe: 1.225 ± 0.482
0.245CysGly: 0.245 ± 0.147
0.49CysHis: 0.49 ± 0.229
1.96CysIle: 1.96 ± 0.622
0.98CysLys: 0.98 ± 1.063
1.715CysLeu: 1.715 ± 0.442
0.735CysMet: 0.735 ± 0.298
0.735CysAsn: 0.735 ± 0.588
1.225CysPro: 1.225 ± 0.356
0.49CysGln: 0.49 ± 0.292
1.715CysArg: 1.715 ± 0.225
1.715CysSer: 1.715 ± 0.672
0.49CysThr: 0.49 ± 0.639
1.225CysVal: 1.225 ± 0.55
0.245CysTrp: 0.245 ± 0.147
0.735CysTyr: 0.735 ± 0.44
0.0CysXaa: 0.0 ± 0.0
Asp
2.205AspAla: 2.205 ± 0.648
1.96AspCys: 1.96 ± 0.329
1.47AspAsp: 1.47 ± 0.518
1.715AspGlu: 1.715 ± 0.399
2.695AspPhe: 2.695 ± 0.112
2.45AspGly: 2.45 ± 0.324
0.735AspHis: 0.735 ± 0.215
4.901AspIle: 4.901 ± 1.102
3.676AspLys: 3.676 ± 0.518
5.636AspLeu: 5.636 ± 0.343
1.47AspMet: 1.47 ± 0.206
3.676AspAsn: 3.676 ± 1.076
3.431AspPro: 3.431 ± 0.769
1.225AspGln: 1.225 ± 0.516
1.47AspArg: 1.47 ± 0.429
5.636AspSer: 5.636 ± 1.019
1.225AspThr: 1.225 ± 0.654
1.96AspVal: 1.96 ± 0.702
0.735AspTrp: 0.735 ± 0.44
1.715AspTyr: 1.715 ± 0.486
0.0AspXaa: 0.0 ± 0.0
Glu
2.94GluAla: 2.94 ± 1.024
0.245GluCys: 0.245 ± 0.319
3.185GluAsp: 3.185 ± 0.98
4.166GluGlu: 4.166 ± 0.845
3.431GluPhe: 3.431 ± 0.867
2.45GluGly: 2.45 ± 0.873
1.47GluHis: 1.47 ± 0.888
3.921GluIle: 3.921 ± 1.075
3.431GluLys: 3.431 ± 1.308
5.636GluLeu: 5.636 ± 0.866
0.98GluMet: 0.98 ± 0.304
2.45GluAsn: 2.45 ± 1.166
1.96GluPro: 1.96 ± 0.632
2.695GluGln: 2.695 ± 1.42
1.96GluArg: 1.96 ± 0.987
3.676GluSer: 3.676 ± 1.092
3.676GluThr: 3.676 ± 0.984
2.45GluVal: 2.45 ± 0.352
0.245GluTrp: 0.245 ± 0.147
0.98GluTyr: 0.98 ± 0.645
0.0GluXaa: 0.0 ± 0.0
Phe
1.96PheAla: 1.96 ± 0.539
1.47PheCys: 1.47 ± 0.63
1.715PheAsp: 1.715 ± 0.874
3.921PheGlu: 3.921 ± 0.979
3.676PhePhe: 3.676 ± 0.933
3.185PheGly: 3.185 ± 0.692
0.735PheHis: 0.735 ± 0.324
3.676PheIle: 3.676 ± 0.728
4.411PheLys: 4.411 ± 0.7
4.166PheLeu: 4.166 ± 1.343
1.715PheMet: 1.715 ± 0.34
3.431PheAsn: 3.431 ± 0.752
3.921PhePro: 3.921 ± 1.057
3.185PheGln: 3.185 ± 0.515
1.96PheArg: 1.96 ± 0.406
5.881PheSer: 5.881 ± 1.124
2.695PheThr: 2.695 ± 0.564
1.715PheVal: 1.715 ± 1.178
1.715PheTrp: 1.715 ± 0.409
2.45PheTyr: 2.45 ± 0.595
0.0PheXaa: 0.0 ± 0.0
Gly
2.45GlyAla: 2.45 ± 0.948
0.49GlyCys: 0.49 ± 0.294
2.94GlyAsp: 2.94 ± 1.063
1.715GlyGlu: 1.715 ± 0.394
3.921GlyPhe: 3.921 ± 0.717
2.205GlyGly: 2.205 ± 0.757
0.98GlyHis: 0.98 ± 0.36
2.695GlyIle: 2.695 ± 0.512
2.205GlyLys: 2.205 ± 0.473
6.371GlyLeu: 6.371 ± 1.192
1.47GlyMet: 1.47 ± 0.692
1.225GlyAsn: 1.225 ± 0.403
1.47GlyPro: 1.47 ± 0.996
0.98GlyGln: 0.98 ± 0.573
1.225GlyArg: 1.225 ± 0.582
4.656GlySer: 4.656 ± 1.263
2.45GlyThr: 2.45 ± 0.839
1.715GlyVal: 1.715 ± 0.399
1.715GlyTrp: 1.715 ± 0.415
1.96GlyTyr: 1.96 ± 0.697
0.0GlyXaa: 0.0 ± 0.0
His
0.98HisAla: 0.98 ± 0.28
0.49HisCys: 0.49 ± 0.294
0.245HisAsp: 0.245 ± 0.147
0.98HisGlu: 0.98 ± 0.46
2.45HisPhe: 2.45 ± 0.686
1.715HisGly: 1.715 ± 0.622
1.225HisHis: 1.225 ± 0.482
0.735HisIle: 0.735 ± 0.298
1.225HisLys: 1.225 ± 0.539
3.676HisLeu: 3.676 ± 0.685
0.245HisMet: 0.245 ± 0.147
0.98HisAsn: 0.98 ± 0.587
0.98HisPro: 0.98 ± 0.46
1.225HisGln: 1.225 ± 0.582
0.735HisArg: 0.735 ± 0.215
2.205HisSer: 2.205 ± 0.611
0.735HisThr: 0.735 ± 0.365
1.225HisVal: 1.225 ± 0.726
0.0HisTrp: 0.0 ± 0.0
1.225HisTyr: 1.225 ± 0.356
0.0HisXaa: 0.0 ± 0.0
Ile
3.431IleAla: 3.431 ± 0.183
2.94IleCys: 2.94 ± 0.623
3.185IleAsp: 3.185 ± 0.557
3.921IleGlu: 3.921 ± 1.203
4.656IlePhe: 4.656 ± 1.404
5.146IleGly: 5.146 ± 0.65
1.715IleHis: 1.715 ± 0.809
6.126IleIle: 6.126 ± 1.035
6.861IleLys: 6.861 ± 1.273
8.821IleLeu: 8.821 ± 1.148
2.45IleMet: 2.45 ± 0.681
5.146IleAsn: 5.146 ± 1.552
4.166IlePro: 4.166 ± 1.003
2.45IleGln: 2.45 ± 0.586
3.431IleArg: 3.431 ± 0.697
9.802IleSer: 9.802 ± 1.159
2.94IleThr: 2.94 ± 0.858
3.921IleVal: 3.921 ± 0.807
0.245IleTrp: 0.245 ± 0.147
1.715IleTyr: 1.715 ± 0.507
0.0IleXaa: 0.0 ± 0.0
Lys
3.431LysAla: 3.431 ± 0.338
0.245LysCys: 0.245 ± 0.147
3.676LysAsp: 3.676 ± 0.509
5.146LysGlu: 5.146 ± 1.242
3.431LysPhe: 3.431 ± 0.421
3.431LysGly: 3.431 ± 1.53
1.47LysHis: 1.47 ± 0.531
4.411LysIle: 4.411 ± 0.448
4.166LysLys: 4.166 ± 0.488
8.576LysLeu: 8.576 ± 0.975
1.47LysMet: 1.47 ± 0.381
1.96LysAsn: 1.96 ± 0.813
2.205LysPro: 2.205 ± 1.11
1.715LysGln: 1.715 ± 0.76
2.695LysArg: 2.695 ± 0.726
6.616LysSer: 6.616 ± 1.029
6.371LysThr: 6.371 ± 1.011
4.411LysVal: 4.411 ± 0.994
0.735LysTrp: 0.735 ± 0.356
2.205LysTyr: 2.205 ± 0.675
0.0LysXaa: 0.0 ± 0.0
Leu
5.146LeuAla: 5.146 ± 1.4
0.98LeuCys: 0.98 ± 0.723
5.881LeuAsp: 5.881 ± 0.929
3.676LeuGlu: 3.676 ± 0.435
5.881LeuPhe: 5.881 ± 0.647
4.656LeuGly: 4.656 ± 0.478
2.205LeuHis: 2.205 ± 0.518
11.517LeuIle: 11.517 ± 1.498
6.861LeuLys: 6.861 ± 1.326
11.272LeuLeu: 11.272 ± 1.565
2.45LeuMet: 2.45 ± 0.395
6.616LeuAsn: 6.616 ± 0.768
4.656LeuPro: 4.656 ± 0.482
4.656LeuGln: 4.656 ± 0.563
5.391LeuArg: 5.391 ± 1.036
10.537LeuSer: 10.537 ± 0.923
7.841LeuThr: 7.841 ± 1.438
5.146LeuVal: 5.146 ± 0.902
0.49LeuTrp: 0.49 ± 0.308
2.45LeuTyr: 2.45 ± 0.918
0.0LeuXaa: 0.0 ± 0.0
Met
0.98MetAla: 0.98 ± 0.398
0.0MetCys: 0.0 ± 0.0
1.715MetAsp: 1.715 ± 0.672
1.96MetGlu: 1.96 ± 0.622
1.47MetPhe: 1.47 ± 0.714
1.47MetGly: 1.47 ± 0.7
0.49MetHis: 0.49 ± 0.233
2.45MetIle: 2.45 ± 0.686
2.205MetLys: 2.205 ± 1.186
1.96MetLeu: 1.96 ± 0.182
0.98MetMet: 0.98 ± 0.36
1.225MetAsn: 1.225 ± 0.516
0.245MetPro: 0.245 ± 0.295
1.225MetGln: 1.225 ± 0.539
1.225MetArg: 1.225 ± 0.403
2.45MetSer: 2.45 ± 0.395
1.715MetThr: 1.715 ± 0.88
0.245MetVal: 0.245 ± 0.147
0.49MetTrp: 0.49 ± 0.229
1.47MetTyr: 1.47 ± 0.313
0.0MetXaa: 0.0 ± 0.0
Asn
1.715AsnAla: 1.715 ± 0.466
1.715AsnCys: 1.715 ± 0.579
2.205AsnAsp: 2.205 ± 0.535
2.94AsnGlu: 2.94 ± 0.823
3.185AsnPhe: 3.185 ± 0.82
1.47AsnGly: 1.47 ± 0.429
1.47AsnHis: 1.47 ± 0.534
2.695AsnIle: 2.695 ± 0.726
2.94AsnLys: 2.94 ± 0.759
8.821AsnLeu: 8.821 ± 1.467
1.96AsnMet: 1.96 ± 0.707
1.715AsnAsn: 1.715 ± 0.225
3.921AsnPro: 3.921 ± 0.255
2.695AsnGln: 2.695 ± 1.034
0.735AsnArg: 0.735 ± 0.392
5.636AsnSer: 5.636 ± 1.072
2.94AsnThr: 2.94 ± 0.805
3.676AsnVal: 3.676 ± 1.049
0.49AsnTrp: 0.49 ± 0.294
2.205AsnTyr: 2.205 ± 0.453
0.0AsnXaa: 0.0 ± 0.0
Pro
1.96ProAla: 1.96 ± 1.237
0.49ProCys: 0.49 ± 0.229
2.94ProAsp: 2.94 ± 0.869
1.715ProGlu: 1.715 ± 0.549
1.715ProPhe: 1.715 ± 0.409
2.205ProGly: 2.205 ± 0.675
1.225ProHis: 1.225 ± 0.549
4.411ProIle: 4.411 ± 0.989
3.431ProLys: 3.431 ± 0.686
4.656ProLeu: 4.656 ± 0.811
0.98ProMet: 0.98 ± 0.46
2.205ProAsn: 2.205 ± 0.466
2.695ProPro: 2.695 ± 1.17
1.96ProGln: 1.96 ± 1.024
1.225ProArg: 1.225 ± 0.869
6.126ProSer: 6.126 ± 0.602
1.96ProThr: 1.96 ± 0.295
0.98ProVal: 0.98 ± 0.274
0.49ProTrp: 0.49 ± 0.292
1.47ProTyr: 1.47 ± 0.534
0.0ProXaa: 0.0 ± 0.0
Gln
1.715GlnAla: 1.715 ± 0.55
0.245GlnCys: 0.245 ± 0.319
1.96GlnAsp: 1.96 ± 0.413
1.47GlnGlu: 1.47 ± 0.809
3.185GlnPhe: 3.185 ± 0.515
0.98GlnGly: 0.98 ± 0.28
0.98GlnHis: 0.98 ± 0.587
3.921GlnIle: 3.921 ± 1.167
2.45GlnLys: 2.45 ± 0.896
3.431GlnLeu: 3.431 ± 0.787
0.735GlnMet: 0.735 ± 0.346
2.45GlnAsn: 2.45 ± 0.698
0.735GlnPro: 0.735 ± 0.765
0.49GlnGln: 0.49 ± 0.481
1.225GlnArg: 1.225 ± 0.951
1.96GlnSer: 1.96 ± 0.782
2.695GlnThr: 2.695 ± 0.112
1.96GlnVal: 1.96 ± 0.529
0.735GlnTrp: 0.735 ± 0.215
1.715GlnTyr: 1.715 ± 1.051
0.0GlnXaa: 0.0 ± 0.0
Arg
2.45ArgAla: 2.45 ± 0.527
0.98ArgCys: 0.98 ± 0.645
2.94ArgAsp: 2.94 ± 1.013
2.45ArgGlu: 2.45 ± 0.586
2.94ArgPhe: 2.94 ± 0.385
1.225ArgGly: 1.225 ± 0.582
0.49ArgHis: 0.49 ± 0.294
4.656ArgIle: 4.656 ± 1.018
1.96ArgLys: 1.96 ± 0.717
2.695ArgLeu: 2.695 ± 0.472
1.225ArgMet: 1.225 ± 0.403
3.185ArgAsn: 3.185 ± 0.528
1.225ArgPro: 1.225 ± 0.356
2.205ArgGln: 2.205 ± 0.757
0.98ArgArg: 0.98 ± 0.28
4.166ArgSer: 4.166 ± 0.649
0.735ArgThr: 0.735 ± 0.356
2.205ArgVal: 2.205 ± 0.909
0.0ArgTrp: 0.0 ± 0.0
0.735ArgTyr: 0.735 ± 0.559
0.0ArgXaa: 0.0 ± 0.0
Ser
1.96SerAla: 1.96 ± 0.345
2.205SerCys: 2.205 ± 1.132
5.881SerAsp: 5.881 ± 0.836
4.901SerGlu: 4.901 ± 0.775
4.901SerPhe: 4.901 ± 2.497
3.676SerGly: 3.676 ± 1.129
2.205SerHis: 2.205 ± 1.039
10.292SerIle: 10.292 ± 1.98
6.371SerLys: 6.371 ± 1.192
11.762SerLeu: 11.762 ± 1.749
1.715SerMet: 1.715 ± 0.704
5.636SerAsn: 5.636 ± 1.019
3.676SerPro: 3.676 ± 1.121
2.94SerGln: 2.94 ± 0.761
4.656SerArg: 4.656 ± 1.087
16.173SerSer: 16.173 ± 3.335
4.656SerThr: 4.656 ± 1.184
3.676SerVal: 3.676 ± 1.033
1.225SerTrp: 1.225 ± 0.258
4.411SerTyr: 4.411 ± 0.492
0.0SerXaa: 0.0 ± 0.0
Thr
3.921ThrAla: 3.921 ± 1.344
0.735ThrCys: 0.735 ± 0.392
1.715ThrAsp: 1.715 ± 0.486
3.676ThrGlu: 3.676 ± 1.255
2.45ThrPhe: 2.45 ± 0.545
2.45ThrGly: 2.45 ± 0.334
1.47ThrHis: 1.47 ± 1.0
4.901ThrIle: 4.901 ± 0.389
2.94ThrLys: 2.94 ± 0.638
6.371ThrLeu: 6.371 ± 1.199
0.98ThrMet: 0.98 ± 0.398
2.94ThrAsn: 2.94 ± 0.794
1.715ThrPro: 1.715 ± 0.606
1.47ThrGln: 1.47 ± 0.784
2.45ThrArg: 2.45 ± 0.827
4.656ThrSer: 4.656 ± 1.418
1.96ThrThr: 1.96 ± 0.538
2.205ThrVal: 2.205 ± 0.828
1.225ThrTrp: 1.225 ± 0.419
2.45ThrTyr: 2.45 ± 1.179
0.0ThrXaa: 0.0 ± 0.0
Val
1.225ValAla: 1.225 ± 0.356
1.225ValCys: 1.225 ± 0.273
3.921ValAsp: 3.921 ± 0.846
2.205ValGlu: 2.205 ± 1.15
3.431ValPhe: 3.431 ± 0.486
0.245ValGly: 0.245 ± 0.285
1.715ValHis: 1.715 ± 0.634
2.45ValIle: 2.45 ± 0.497
4.656ValLys: 4.656 ± 0.817
3.921ValLeu: 3.921 ± 1.063
0.49ValMet: 0.49 ± 0.294
3.431ValAsn: 3.431 ± 1.13
3.921ValPro: 3.921 ± 0.714
0.98ValGln: 0.98 ± 0.587
1.47ValArg: 1.47 ± 0.429
4.166ValSer: 4.166 ± 1.151
2.695ValThr: 2.695 ± 0.722
0.735ValVal: 0.735 ± 0.44
0.98ValTrp: 0.98 ± 0.28
1.47ValTyr: 1.47 ± 0.647
0.0ValXaa: 0.0 ± 0.0
Trp
0.98TrpAla: 0.98 ± 0.584
0.0TrpCys: 0.0 ± 0.0
0.49TrpAsp: 0.49 ± 0.229
0.49TrpGlu: 0.49 ± 0.294
0.49TrpPhe: 0.49 ± 0.229
0.98TrpGly: 0.98 ± 0.36
0.245TrpHis: 0.245 ± 0.319
0.49TrpIle: 0.49 ± 0.294
0.98TrpLys: 0.98 ± 0.587
0.98TrpLeu: 0.98 ± 0.288
0.49TrpMet: 0.49 ± 0.481
0.735TrpAsn: 0.735 ± 0.365
0.245TrpPro: 0.245 ± 0.147
0.0TrpGln: 0.0 ± 0.0
1.225TrpArg: 1.225 ± 0.396
1.47TrpSer: 1.47 ± 0.714
0.98TrpThr: 0.98 ± 0.458
0.735TrpVal: 0.735 ± 0.44
0.245TrpTrp: 0.245 ± 0.147
0.245TrpTyr: 0.245 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.96TyrAla: 1.96 ± 0.538
0.245TyrCys: 0.245 ± 0.295
2.205TyrAsp: 2.205 ± 0.952
1.96TyrGlu: 1.96 ± 1.393
1.47TyrPhe: 1.47 ± 0.902
1.715TyrGly: 1.715 ± 0.225
1.225TyrHis: 1.225 ± 0.403
3.431TyrIle: 3.431 ± 0.83
1.225TyrLys: 1.225 ± 0.635
4.411TyrLeu: 4.411 ± 0.673
1.47TyrMet: 1.47 ± 0.313
2.695TyrAsn: 2.695 ± 0.601
1.47TyrPro: 1.47 ± 0.429
1.715TyrGln: 1.715 ± 0.309
1.715TyrArg: 1.715 ± 0.386
2.695TyrSer: 2.695 ± 0.95
0.245TyrThr: 0.245 ± 0.147
2.205TyrVal: 2.205 ± 0.173
0.245TyrTrp: 0.245 ± 0.295
1.225TyrTyr: 1.225 ± 0.419
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (4082 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski