Amino acid dipepetide frequency for Imjin River virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.222AlaAla: 5.222 ± 2.638
1.451AlaCys: 1.451 ± 0.521
3.771AlaAsp: 3.771 ± 0.299
2.611AlaGlu: 2.611 ± 0.26
2.321AlaPhe: 2.321 ± 0.672
4.352AlaGly: 4.352 ± 1.653
1.451AlaHis: 1.451 ± 0.454
4.642AlaIle: 4.642 ± 2.25
2.321AlaLys: 2.321 ± 0.284
5.802AlaLeu: 5.802 ± 0.217
3.481AlaMet: 3.481 ± 0.324
2.901AlaAsn: 2.901 ± 1.118
2.321AlaPro: 2.321 ± 0.262
1.16AlaGln: 1.16 ± 0.343
3.191AlaArg: 3.191 ± 1.859
5.512AlaSer: 5.512 ± 0.71
3.191AlaThr: 3.191 ± 0.525
2.901AlaVal: 2.901 ± 1.043
0.29AlaTrp: 0.29 ± 0.152
4.062AlaTyr: 4.062 ± 0.276
0.0AlaXaa: 0.0 ± 0.0
Cys
1.741CysAla: 1.741 ± 0.717
0.29CysCys: 0.29 ± 0.384
1.16CysAsp: 1.16 ± 0.206
1.451CysGlu: 1.451 ± 0.762
0.0CysPhe: 0.0 ± 0.0
1.16CysGly: 1.16 ± 1.456
0.29CysHis: 0.29 ± 0.152
0.87CysIle: 0.87 ± 0.358
1.741CysLys: 1.741 ± 0.583
2.901CysLeu: 2.901 ± 0.405
0.29CysMet: 0.29 ± 0.364
2.031CysAsn: 2.031 ± 0.251
0.87CysPro: 0.87 ± 0.636
0.87CysGln: 0.87 ± 0.358
1.451CysArg: 1.451 ± 0.441
0.87CysSer: 0.87 ± 0.276
1.16CysThr: 1.16 ± 0.57
0.58CysVal: 0.58 ± 0.305
0.0CysTrp: 0.0 ± 0.0
2.901CysTyr: 2.901 ± 0.875
0.0CysXaa: 0.0 ± 0.0
Asp
4.062AspAla: 4.062 ± 0.942
1.451AspCys: 1.451 ± 1.001
2.901AspAsp: 2.901 ± 0.108
4.062AspGlu: 4.062 ± 0.601
3.191AspPhe: 3.191 ± 0.926
2.901AspGly: 2.901 ± 0.108
0.87AspHis: 0.87 ± 0.279
4.352AspIle: 4.352 ± 0.86
3.771AspLys: 3.771 ± 0.35
6.382AspLeu: 6.382 ± 1.668
1.741AspMet: 1.741 ± 0.099
3.191AspAsn: 3.191 ± 2.347
1.741AspPro: 1.741 ± 0.552
2.901AspGln: 2.901 ± 0.565
2.031AspArg: 2.031 ± 0.811
2.321AspSer: 2.321 ± 1.195
3.771AspThr: 3.771 ± 0.35
2.611AspVal: 2.611 ± 0.307
0.87AspTrp: 0.87 ± 0.276
2.611AspTyr: 2.611 ± 0.628
0.0AspXaa: 0.0 ± 0.0
Glu
5.222GluAla: 5.222 ± 0.631
1.16GluCys: 1.16 ± 0.57
4.642GluAsp: 4.642 ± 2.027
4.932GluGlu: 4.932 ± 2.756
1.741GluPhe: 1.741 ± 0.583
4.062GluGly: 4.062 ± 0.314
1.741GluHis: 1.741 ± 1.355
3.771GluIle: 3.771 ± 0.843
3.771GluLys: 3.771 ± 1.149
4.932GluLeu: 4.932 ± 1.385
1.451GluMet: 1.451 ± 0.966
2.031GluAsn: 2.031 ± 0.852
1.451GluPro: 1.451 ± 0.551
2.611GluGln: 2.611 ± 0.992
2.321GluArg: 2.321 ± 0.848
3.771GluSer: 3.771 ± 1.606
3.771GluThr: 3.771 ± 1.152
3.191GluVal: 3.191 ± 1.586
0.29GluTrp: 0.29 ± 0.152
2.321GluTyr: 2.321 ± 0.284
0.0GluXaa: 0.0 ± 0.0
Phe
1.741PheAla: 1.741 ± 0.855
2.321PheCys: 2.321 ± 1.218
1.741PheAsp: 1.741 ± 0.898
2.321PheGlu: 2.321 ± 0.847
1.16PhePhe: 1.16 ± 0.609
2.611PheGly: 2.611 ± 0.315
0.58PheHis: 0.58 ± 0.305
2.611PheIle: 2.611 ± 0.583
2.611PheLys: 2.611 ± 0.628
2.611PheLeu: 2.611 ± 1.011
0.58PheMet: 0.58 ± 0.305
2.031PheAsn: 2.031 ± 1.066
2.031PhePro: 2.031 ± 0.811
0.0PheGln: 0.0 ± 0.0
1.741PheArg: 1.741 ± 0.914
3.191PheSer: 3.191 ± 0.943
2.031PheThr: 2.031 ± 0.995
2.031PheVal: 2.031 ± 0.604
1.16PheTrp: 1.16 ± 1.021
0.58PheTyr: 0.58 ± 0.285
0.0PheXaa: 0.0 ± 0.0
Gly
2.611GlyAla: 2.611 ± 2.073
1.451GlyCys: 1.451 ± 0.54
2.611GlyAsp: 2.611 ± 0.704
2.611GlyGlu: 2.611 ± 0.704
4.062GlyPhe: 4.062 ± 0.502
3.771GlyGly: 3.771 ± 1.695
1.451GlyHis: 1.451 ± 0.762
1.741GlyIle: 1.741 ± 0.099
1.451GlyLys: 1.451 ± 0.521
4.932GlyLeu: 4.932 ± 1.096
2.901GlyMet: 2.901 ± 0.923
2.611GlyAsn: 2.611 ± 0.738
1.741GlyPro: 1.741 ± 0.717
2.031GlyGln: 2.031 ± 0.33
0.87GlyArg: 0.87 ± 0.672
4.642GlySer: 4.642 ± 0.882
2.321GlyThr: 2.321 ± 0.412
3.191GlyVal: 3.191 ± 0.047
0.29GlyTrp: 0.29 ± 0.364
3.191GlyTyr: 3.191 ± 0.474
0.0GlyXaa: 0.0 ± 0.0
His
0.58HisAla: 0.58 ± 0.511
0.58HisCys: 0.58 ± 0.285
1.741HisAsp: 1.741 ± 0.898
0.29HisGlu: 0.29 ± 0.364
0.58HisPhe: 0.58 ± 0.305
1.16HisGly: 1.16 ± 0.343
0.29HisHis: 0.29 ± 0.152
1.451HisIle: 1.451 ± 0.454
2.031HisLys: 2.031 ± 0.604
2.901HisLeu: 2.901 ± 0.565
0.87HisMet: 0.87 ± 0.276
0.87HisAsn: 0.87 ± 0.276
1.16HisPro: 1.16 ± 0.57
0.58HisGln: 0.58 ± 0.305
0.87HisArg: 0.87 ± 0.276
1.741HisSer: 1.741 ± 0.855
1.451HisThr: 1.451 ± 0.762
3.481HisVal: 3.481 ± 1.03
0.0HisTrp: 0.0 ± 0.0
1.16HisTyr: 1.16 ± 0.609
0.0HisXaa: 0.0 ± 0.0
Ile
2.901IleAla: 2.901 ± 0.576
1.741IleCys: 1.741 ± 0.428
2.611IleAsp: 2.611 ± 0.556
3.771IleGlu: 3.771 ± 0.812
2.321IlePhe: 2.321 ± 0.709
2.321IleGly: 2.321 ± 1.195
1.16IleHis: 1.16 ± 0.57
3.481IleIle: 3.481 ± 0.366
4.062IleLys: 4.062 ± 0.957
5.802IleLeu: 5.802 ± 1.502
2.031IleMet: 2.031 ± 0.704
2.901IleAsn: 2.901 ± 0.576
4.932IlePro: 4.932 ± 0.901
2.901IleGln: 2.901 ± 0.936
6.092IleArg: 6.092 ± 1.405
4.352IleSer: 4.352 ± 1.469
6.672IleThr: 6.672 ± 0.998
4.642IleVal: 4.642 ± 1.373
1.16IleTrp: 1.16 ± 0.343
2.901IleTyr: 2.901 ± 0.407
0.0IleXaa: 0.0 ± 0.0
Lys
3.771LysAla: 3.771 ± 0.466
0.29LysCys: 0.29 ± 0.152
4.062LysAsp: 4.062 ± 1.637
4.062LysGlu: 4.062 ± 0.303
2.321LysPhe: 2.321 ± 0.809
2.031LysGly: 2.031 ± 0.565
1.741LysHis: 1.741 ± 0.567
4.642LysIle: 4.642 ± 1.236
2.901LysLys: 2.901 ± 1.001
7.253LysLeu: 7.253 ± 1.48
0.87LysMet: 0.87 ± 0.276
3.481LysAsn: 3.481 ± 0.658
2.031LysPro: 2.031 ± 0.818
1.451LysGln: 1.451 ± 0.551
1.741LysArg: 1.741 ± 0.099
3.191LysSer: 3.191 ± 0.532
3.771LysThr: 3.771 ± 0.299
4.352LysVal: 4.352 ± 0.654
0.58LysTrp: 0.58 ± 0.305
1.451LysTyr: 1.451 ± 0.521
0.0LysXaa: 0.0 ± 0.0
Leu
5.802LeuAla: 5.802 ± 0.811
1.741LeuCys: 1.741 ± 0.583
6.092LeuAsp: 6.092 ± 1.186
6.382LeuGlu: 6.382 ± 1.447
2.611LeuPhe: 2.611 ± 0.307
3.481LeuGly: 3.481 ± 1.079
0.87LeuHis: 0.87 ± 0.457
5.222LeuIle: 5.222 ± 1.284
4.932LeuLys: 4.932 ± 1.115
7.833LeuLeu: 7.833 ± 2.018
2.321LeuMet: 2.321 ± 0.262
3.771LeuAsn: 3.771 ± 1.148
4.062LeuPro: 4.062 ± 1.317
3.481LeuGln: 3.481 ± 0.198
8.413LeuArg: 8.413 ± 0.477
8.993LeuSer: 8.993 ± 1.461
4.352LeuThr: 4.352 ± 0.652
6.382LeuVal: 6.382 ± 0.565
0.29LeuTrp: 0.29 ± 0.152
6.672LeuTyr: 6.672 ± 0.798
0.0LeuXaa: 0.0 ± 0.0
Met
1.741MetAla: 1.741 ± 0.717
1.16MetCys: 1.16 ± 0.206
2.321MetAsp: 2.321 ± 0.262
2.031MetGlu: 2.031 ± 0.33
1.451MetPhe: 1.451 ± 1.001
0.58MetGly: 0.58 ± 0.728
0.58MetHis: 0.58 ± 0.285
1.741MetIle: 1.741 ± 0.927
2.031MetLys: 2.031 ± 0.3
2.031MetLeu: 2.031 ± 0.599
1.16MetMet: 1.16 ± 0.579
2.031MetAsn: 2.031 ± 0.704
1.741MetPro: 1.741 ± 0.583
0.87MetGln: 0.87 ± 0.279
0.87MetArg: 0.87 ± 0.672
2.321MetSer: 2.321 ± 0.847
2.031MetThr: 2.031 ± 0.704
1.16MetVal: 1.16 ± 0.599
0.87MetTrp: 0.87 ± 0.276
2.321MetTyr: 2.321 ± 0.687
0.0MetXaa: 0.0 ± 0.0
Asn
3.481AsnAla: 3.481 ± 0.588
0.58AsnCys: 0.58 ± 0.305
2.031AsnAsp: 2.031 ± 0.3
3.191AsnGlu: 3.191 ± 1.476
1.741AsnPhe: 1.741 ± 0.583
2.321AsnGly: 2.321 ± 0.284
2.031AsnHis: 2.031 ± 0.704
2.901AsnIle: 2.901 ± 1.158
3.191AsnLys: 3.191 ± 0.943
3.771AsnLeu: 3.771 ± 0.65
1.451AsnMet: 1.451 ± 0.054
2.321AsnAsn: 2.321 ± 1.197
1.741AsnPro: 1.741 ± 0.717
2.031AsnGln: 2.031 ± 0.599
1.741AsnArg: 1.741 ± 0.898
2.611AsnSer: 2.611 ± 0.829
1.741AsnThr: 1.741 ± 0.583
2.611AsnVal: 2.611 ± 0.26
0.58AsnTrp: 0.58 ± 0.305
2.901AsnTyr: 2.901 ± 1.118
0.0AsnXaa: 0.0 ± 0.0
Pro
0.29ProAla: 0.29 ± 0.152
0.29ProCys: 0.29 ± 0.364
3.191ProAsp: 3.191 ± 0.474
2.611ProGlu: 2.611 ± 1.561
1.16ProPhe: 1.16 ± 0.343
1.741ProGly: 1.741 ± 0.875
1.16ProHis: 1.16 ± 0.343
4.062ProIle: 4.062 ± 0.502
2.031ProLys: 2.031 ± 1.198
3.771ProLeu: 3.771 ± 0.65
1.16ProMet: 1.16 ± 0.599
1.451ProAsn: 1.451 ± 0.054
1.741ProPro: 1.741 ± 0.914
1.741ProGln: 1.741 ± 0.428
1.451ProArg: 1.451 ± 0.551
4.352ProSer: 4.352 ± 0.849
3.771ProThr: 3.771 ± 1.303
2.031ProVal: 2.031 ± 0.604
0.58ProTrp: 0.58 ± 0.299
2.321ProTyr: 2.321 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
4.062GlnAla: 4.062 ± 1.17
0.58GlnCys: 0.58 ± 0.728
2.611GlnAsp: 2.611 ± 0.769
1.741GlnGlu: 1.741 ± 0.099
0.29GlnPhe: 0.29 ± 0.152
2.901GlnGly: 2.901 ± 1.001
1.16GlnHis: 1.16 ± 0.343
2.901GlnIle: 2.901 ± 0.407
1.741GlnLys: 1.741 ± 0.428
3.481GlnLeu: 3.481 ± 1.166
1.16GlnMet: 1.16 ± 0.206
0.87GlnAsn: 0.87 ± 0.279
1.451GlnPro: 1.451 ± 0.054
1.451GlnGln: 1.451 ± 0.054
1.451GlnArg: 1.451 ± 0.762
1.741GlnSer: 1.741 ± 1.428
1.16GlnThr: 1.16 ± 0.996
2.901GlnVal: 2.901 ± 1.043
0.0GlnTrp: 0.0 ± 0.0
2.611GlnTyr: 2.611 ± 0.583
0.0GlnXaa: 0.0 ± 0.0
Arg
1.741ArgAla: 1.741 ± 0.927
0.58ArgCys: 0.58 ± 0.285
2.901ArgAsp: 2.901 ± 1.475
4.352ArgGlu: 4.352 ± 1.037
2.611ArgPhe: 2.611 ± 0.992
1.741ArgGly: 1.741 ± 0.099
1.741ArgHis: 1.741 ± 0.875
4.062ArgIle: 4.062 ± 0.942
3.191ArgLys: 3.191 ± 1.005
6.092ArgLeu: 6.092 ± 0.448
1.741ArgMet: 1.741 ± 0.927
1.741ArgAsn: 1.741 ± 0.583
2.031ArgPro: 2.031 ± 0.599
2.031ArgGln: 2.031 ± 0.852
3.771ArgArg: 3.771 ± 2.158
4.352ArgSer: 4.352 ± 1.361
3.191ArgThr: 3.191 ± 1.394
2.611ArgVal: 2.611 ± 1.148
0.29ArgTrp: 0.29 ± 0.152
2.031ArgTyr: 2.031 ± 0.852
0.0ArgXaa: 0.0 ± 0.0
Ser
4.062SerAla: 4.062 ± 0.746
1.741SerCys: 1.741 ± 0.552
4.642SerAsp: 4.642 ± 1.345
5.802SerGlu: 5.802 ± 0.86
2.321SerPhe: 2.321 ± 0.865
3.771SerGly: 3.771 ± 0.688
1.741SerHis: 1.741 ± 0.552
6.382SerIle: 6.382 ± 0.955
4.352SerLys: 4.352 ± 1.089
7.543SerLeu: 7.543 ± 2.713
1.16SerMet: 1.16 ± 0.609
2.321SerAsn: 2.321 ± 0.262
2.031SerPro: 2.031 ± 1.066
2.611SerGln: 2.611 ± 0.769
4.352SerArg: 4.352 ± 0.977
7.543SerSer: 7.543 ± 0.415
5.802SerThr: 5.802 ± 0.321
3.771SerVal: 3.771 ± 1.165
1.451SerTrp: 1.451 ± 0.54
2.611SerTyr: 2.611 ± 1.011
0.0SerXaa: 0.0 ± 0.0
Thr
4.932ThrAla: 4.932 ± 1.236
2.321ThrCys: 2.321 ± 0.709
2.031ThrAsp: 2.031 ± 0.811
2.611ThrGlu: 2.611 ± 0.556
1.741ThrPhe: 1.741 ± 0.552
3.191ThrGly: 3.191 ± 0.435
1.16ThrHis: 1.16 ± 0.609
4.062ThrIle: 4.062 ± 0.276
3.481ThrLys: 3.481 ± 0.795
5.222ThrLeu: 5.222 ± 0.631
2.321ThrMet: 2.321 ± 0.262
2.611ThrAsn: 2.611 ± 0.307
2.321ThrPro: 2.321 ± 0.88
2.611ThrGln: 2.611 ± 1.959
4.352ThrArg: 4.352 ± 0.362
5.222ThrSer: 5.222 ± 0.751
2.901ThrThr: 2.901 ± 0.565
4.352ThrVal: 4.352 ± 1.104
1.741ThrTrp: 1.741 ± 0.855
2.901ThrTyr: 2.901 ± 1.08
0.0ThrXaa: 0.0 ± 0.0
Val
5.512ValAla: 5.512 ± 1.667
1.741ValCys: 1.741 ± 0.099
3.771ValAsp: 3.771 ± 0.812
2.321ValGlu: 2.321 ± 1.55
2.031ValPhe: 2.031 ± 0.33
2.901ValGly: 2.901 ± 0.708
1.451ValHis: 1.451 ± 0.916
3.771ValIle: 3.771 ± 1.27
1.741ValLys: 1.741 ± 0.855
5.222ValLeu: 5.222 ± 0.52
1.451ValMet: 1.451 ± 0.54
2.901ValAsn: 2.901 ± 0.708
3.191ValPro: 3.191 ± 0.86
1.741ValGln: 1.741 ± 0.397
4.062ValArg: 4.062 ± 0.502
5.222ValSer: 5.222 ± 1.342
4.352ValThr: 4.352 ± 0.654
4.642ValVal: 4.642 ± 1.389
0.29ValTrp: 0.29 ± 0.152
3.191ValTyr: 3.191 ± 0.536
0.0ValXaa: 0.0 ± 0.0
Trp
1.16TrpAla: 1.16 ± 0.206
0.29TrpCys: 0.29 ± 0.152
0.29TrpAsp: 0.29 ± 0.364
0.29TrpGlu: 0.29 ± 0.364
1.451TrpPhe: 1.451 ± 0.54
0.29TrpGly: 0.29 ± 0.364
0.29TrpHis: 0.29 ± 0.152
0.87TrpIle: 0.87 ± 0.276
0.87TrpLys: 0.87 ± 0.276
0.58TrpLeu: 0.58 ± 0.285
0.87TrpMet: 0.87 ± 0.279
0.58TrpAsn: 0.58 ± 0.285
0.29TrpPro: 0.29 ± 0.152
0.0TrpGln: 0.0 ± 0.0
0.29TrpArg: 0.29 ± 0.152
0.87TrpSer: 0.87 ± 0.457
0.87TrpThr: 0.87 ± 0.279
0.87TrpVal: 0.87 ± 0.457
0.0TrpTrp: 0.0 ± 0.0
0.87TrpTyr: 0.87 ± 0.799
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.321TyrAla: 2.321 ± 0.847
0.87TyrCys: 0.87 ± 0.457
2.611TyrAsp: 2.611 ± 0.704
1.741TyrGlu: 1.741 ± 0.099
0.58TyrPhe: 0.58 ± 0.299
3.191TyrGly: 3.191 ± 0.474
2.031TyrHis: 2.031 ± 0.76
5.222TyrIle: 5.222 ± 1.534
3.771TyrLys: 3.771 ± 0.26
4.642TyrLeu: 4.642 ± 0.525
1.741TyrMet: 1.741 ± 0.37
2.321TyrAsn: 2.321 ± 0.687
2.031TyrPro: 2.031 ± 0.565
3.191TyrGln: 3.191 ± 0.047
1.741TyrArg: 1.741 ± 0.567
3.191TyrSer: 3.191 ± 0.771
3.771TyrThr: 3.771 ± 1.571
3.191TyrVal: 3.191 ± 0.867
1.16TyrTrp: 1.16 ± 0.206
2.321TyrTyr: 2.321 ± 0.403
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3448 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski