Amino acid dipepetide frequency for Castlerea virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.375AlaAla: 5.375 ± 5.443
0.672AlaCys: 0.672 ± 0.341
3.359AlaAsp: 3.359 ± 1.008
2.015AlaGlu: 2.015 ± 1.024
2.351AlaPhe: 2.351 ± 1.306
3.695AlaGly: 3.695 ± 1.267
1.008AlaHis: 1.008 ± 0.512
2.351AlaIle: 2.351 ± 2.128
2.687AlaLys: 2.687 ± 1.365
9.07AlaLeu: 9.07 ± 1.587
1.344AlaMet: 1.344 ± 0.683
1.344AlaAsn: 1.344 ± 0.683
3.023AlaPro: 3.023 ± 3.384
2.015AlaGln: 2.015 ± 1.024
2.687AlaArg: 2.687 ± 2.055
5.375AlaSer: 5.375 ± 1.967
3.023AlaThr: 3.023 ± 1.045
4.031AlaVal: 4.031 ± 3.036
0.0AlaTrp: 0.0 ± 0.0
1.68AlaTyr: 1.68 ± 0.853
0.0AlaXaa: 0.0 ± 0.0
Cys
1.68CysAla: 1.68 ± 1.048
0.336CysCys: 0.336 ± 0.171
1.68CysAsp: 1.68 ± 0.504
1.344CysGlu: 1.344 ± 0.683
1.68CysPhe: 1.68 ± 0.504
0.336CysGly: 0.336 ± 0.171
0.672CysHis: 0.672 ± 0.604
1.008CysIle: 1.008 ± 0.52
0.672CysLys: 0.672 ± 0.341
2.687CysLeu: 2.687 ± 0.965
0.336CysMet: 0.336 ± 0.171
1.344CysAsn: 1.344 ± 0.683
1.344CysPro: 1.344 ± 0.683
0.0CysGln: 0.0 ± 0.0
1.344CysArg: 1.344 ± 0.683
2.015CysSer: 2.015 ± 0.578
1.68CysThr: 1.68 ± 0.853
2.351CysVal: 2.351 ± 0.757
0.0CysTrp: 0.0 ± 0.0
0.672CysTyr: 0.672 ± 0.341
0.0CysXaa: 0.0 ± 0.0
Asp
3.695AspAla: 3.695 ± 1.877
2.015AspCys: 2.015 ± 1.024
2.687AspAsp: 2.687 ± 1.365
2.351AspGlu: 2.351 ± 0.687
3.695AspPhe: 3.695 ± 0.469
2.687AspGly: 2.687 ± 0.965
0.0AspHis: 0.0 ± 0.0
3.695AspIle: 3.695 ± 0.469
2.351AspLys: 2.351 ± 0.687
5.039AspLeu: 5.039 ± 0.894
1.344AspMet: 1.344 ± 0.683
2.687AspAsn: 2.687 ± 1.365
1.68AspPro: 1.68 ± 0.504
1.344AspGln: 1.344 ± 0.683
2.015AspArg: 2.015 ± 0.578
4.031AspSer: 4.031 ± 0.453
2.687AspThr: 2.687 ± 0.984
4.367AspVal: 4.367 ± 1.257
0.336AspTrp: 0.336 ± 0.72
1.344AspTyr: 1.344 ± 0.683
0.0AspXaa: 0.0 ± 0.0
Glu
1.344GluAla: 1.344 ± 0.683
2.351GluCys: 2.351 ± 0.687
1.68GluAsp: 1.68 ± 0.853
3.023GluGlu: 3.023 ± 1.536
4.031GluPhe: 4.031 ± 2.829
1.68GluGly: 1.68 ± 0.853
1.344GluHis: 1.344 ± 0.483
5.71GluIle: 5.71 ± 1.645
4.031GluLys: 4.031 ± 1.346
5.039GluLeu: 5.039 ± 1.5
1.68GluMet: 1.68 ± 0.853
2.351GluAsn: 2.351 ± 0.687
3.695GluPro: 3.695 ± 1.071
1.344GluGln: 1.344 ± 0.683
2.015GluArg: 2.015 ± 0.578
4.367GluSer: 4.367 ± 1.588
3.695GluThr: 3.695 ± 1.466
4.031GluVal: 4.031 ± 2.048
0.672GluTrp: 0.672 ± 0.604
2.351GluTyr: 2.351 ± 0.95
0.0GluXaa: 0.0 ± 0.0
Phe
1.68PheAla: 1.68 ± 0.973
1.344PheCys: 1.344 ± 0.683
4.031PheAsp: 4.031 ± 0.453
3.359PheGlu: 3.359 ± 1.503
3.695PhePhe: 3.695 ± 1.466
2.687PheGly: 2.687 ± 0.817
2.015PheHis: 2.015 ± 0.578
2.015PheIle: 2.015 ± 1.039
2.015PheLys: 2.015 ± 1.813
7.054PheLeu: 7.054 ± 2.164
0.672PheMet: 0.672 ± 0.655
5.039PheAsn: 5.039 ± 1.946
2.015PhePro: 2.015 ± 1.813
1.68PheGln: 1.68 ± 0.853
3.695PheArg: 3.695 ± 1.23
4.703PheSer: 4.703 ± 1.373
5.039PheThr: 5.039 ± 1.382
5.039PheVal: 5.039 ± 1.946
1.008PheTrp: 1.008 ± 1.106
3.359PheTyr: 3.359 ± 3.022
0.0PheXaa: 0.0 ± 0.0
Gly
2.015GlyAla: 2.015 ± 0.578
1.344GlyCys: 1.344 ± 0.683
4.703GlyAsp: 4.703 ± 1.751
3.023GlyGlu: 3.023 ± 0.972
1.344GlyPhe: 1.344 ± 1.027
2.351GlyGly: 2.351 ± 0.687
1.008GlyHis: 1.008 ± 1.106
2.687GlyIle: 2.687 ± 1.365
3.359GlyLys: 3.359 ± 1.833
3.359GlyLeu: 3.359 ± 1.833
1.008GlyMet: 1.008 ± 0.512
3.023GlyAsn: 3.023 ± 0.96
0.672GlyPro: 0.672 ± 0.341
2.015GlyGln: 2.015 ± 0.578
1.68GlyArg: 1.68 ± 0.504
2.351GlySer: 2.351 ± 1.306
1.008GlyThr: 1.008 ± 0.512
5.039GlyVal: 5.039 ± 1.382
0.336GlyTrp: 0.336 ± 0.171
2.015GlyTyr: 2.015 ± 1.024
0.0GlyXaa: 0.0 ± 0.0
His
1.344HisAla: 1.344 ± 0.683
0.672HisCys: 0.672 ± 0.341
0.672HisAsp: 0.672 ± 0.341
1.008HisGlu: 1.008 ± 0.512
1.68HisPhe: 1.68 ± 0.504
0.0HisGly: 0.0 ± 0.0
0.672HisHis: 0.672 ± 0.604
2.015HisIle: 2.015 ± 0.898
1.344HisLys: 1.344 ± 0.683
3.359HisLeu: 3.359 ± 2.228
0.0HisMet: 0.0 ± 0.0
1.344HisAsn: 1.344 ± 0.483
1.008HisPro: 1.008 ± 0.512
0.672HisGln: 0.672 ± 1.525
2.015HisArg: 2.015 ± 0.578
3.023HisSer: 3.023 ± 2.572
2.015HisThr: 2.015 ± 0.946
2.015HisVal: 2.015 ± 1.024
0.672HisTrp: 0.672 ± 0.341
0.672HisTyr: 0.672 ± 0.604
0.0HisXaa: 0.0 ± 0.0
Ile
5.039IleAla: 5.039 ± 1.5
1.68IleCys: 1.68 ± 1.114
3.695IleAsp: 3.695 ± 1.877
2.687IleGlu: 2.687 ± 0.965
2.687IlePhe: 2.687 ± 1.908
3.359IleGly: 3.359 ± 1.111
1.008IleHis: 1.008 ± 0.512
1.344IleIle: 1.344 ± 0.683
4.031IleLys: 4.031 ± 1.426
3.695IleLeu: 3.695 ± 3.366
0.672IleMet: 0.672 ± 0.403
1.68IleAsn: 1.68 ± 0.853
4.031IlePro: 4.031 ± 2.108
0.336IleGln: 0.336 ± 1.317
2.351IleArg: 2.351 ± 0.687
6.718IleSer: 6.718 ± 0.544
4.703IleThr: 4.703 ± 2.832
6.718IleVal: 6.718 ± 2.017
0.0IleTrp: 0.0 ± 0.0
1.344IleTyr: 1.344 ± 1.204
0.0IleXaa: 0.0 ± 0.0
Lys
2.015LysAla: 2.015 ± 1.024
0.336LysCys: 0.336 ± 0.171
1.68LysAsp: 1.68 ± 0.504
3.695LysGlu: 3.695 ± 1.267
5.039LysPhe: 5.039 ± 1.916
1.344LysGly: 1.344 ± 1.027
1.344LysHis: 1.344 ± 0.483
3.023LysIle: 3.023 ± 0.972
1.344LysLys: 1.344 ± 1.027
7.054LysLeu: 7.054 ± 2.06
1.68LysMet: 1.68 ± 0.853
5.039LysAsn: 5.039 ± 1.672
3.023LysPro: 3.023 ± 1.536
2.687LysGln: 2.687 ± 0.63
5.039LysArg: 5.039 ± 1.926
3.023LysSer: 3.023 ± 0.529
2.687LysThr: 2.687 ± 0.63
3.359LysVal: 3.359 ± 1.008
0.336LysTrp: 0.336 ± 1.317
1.344LysTyr: 1.344 ± 0.683
0.0LysXaa: 0.0 ± 0.0
Leu
5.71LeuAla: 5.71 ± 1.406
1.008LeuCys: 1.008 ± 0.512
4.031LeuAsp: 4.031 ± 1.346
5.71LeuGlu: 5.71 ± 1.645
5.71LeuPhe: 5.71 ± 1.645
3.695LeuGly: 3.695 ± 3.366
3.023LeuHis: 3.023 ± 0.529
6.046LeuIle: 6.046 ± 1.929
5.039LeuLys: 5.039 ± 1.5
8.062LeuLeu: 8.062 ± 1.693
2.351LeuMet: 2.351 ± 1.195
4.031LeuAsn: 4.031 ± 1.635
5.039LeuPro: 5.039 ± 1.672
3.023LeuGln: 3.023 ± 0.529
5.71LeuArg: 5.71 ± 2.248
8.062LeuSer: 8.062 ± 0.484
8.398LeuThr: 8.398 ± 4.066
4.703LeuVal: 4.703 ± 1.514
0.672LeuTrp: 0.672 ± 1.204
3.023LeuTyr: 3.023 ± 0.964
0.0LeuXaa: 0.0 ± 0.0
Met
0.336MetAla: 0.336 ± 0.171
1.008MetCys: 1.008 ± 0.512
0.336MetAsp: 0.336 ± 0.171
0.336MetGlu: 0.336 ± 0.171
1.008MetPhe: 1.008 ± 0.512
0.672MetGly: 0.672 ± 0.341
1.344MetHis: 1.344 ± 0.683
2.015MetIle: 2.015 ± 1.024
1.008MetLys: 1.008 ± 0.512
3.359MetLeu: 3.359 ± 0.47
0.0MetMet: 0.0 ± 0.0
1.344MetAsn: 1.344 ± 0.483
1.344MetPro: 1.344 ± 0.683
0.0MetGln: 0.0 ± 0.0
2.015MetArg: 2.015 ± 1.024
3.359MetSer: 3.359 ± 0.47
1.344MetThr: 1.344 ± 1.209
0.336MetVal: 0.336 ± 0.171
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.359AsnAla: 3.359 ± 0.47
1.68AsnCys: 1.68 ± 0.504
2.351AsnAsp: 2.351 ± 0.988
2.015AsnGlu: 2.015 ± 1.024
3.023AsnPhe: 3.023 ± 0.972
3.359AsnGly: 3.359 ± 0.47
2.015AsnHis: 2.015 ± 0.578
3.359AsnIle: 3.359 ± 1.707
1.68AsnLys: 1.68 ± 0.504
4.367AsnLeu: 4.367 ± 1.959
1.344AsnMet: 1.344 ± 0.679
2.687AsnAsn: 2.687 ± 0.965
2.351AsnPro: 2.351 ± 0.687
2.687AsnGln: 2.687 ± 1.908
2.015AsnArg: 2.015 ± 0.946
5.039AsnSer: 5.039 ± 3.643
4.031AsnThr: 4.031 ± 1.426
3.695AsnVal: 3.695 ± 1.267
0.0AsnTrp: 0.0 ± 0.0
3.359AsnTyr: 3.359 ± 1.707
0.0AsnXaa: 0.0 ± 0.0
Pro
2.351ProAla: 2.351 ± 3.508
0.672ProCys: 0.672 ± 0.341
2.351ProAsp: 2.351 ± 0.95
0.672ProGlu: 0.672 ± 0.604
2.687ProPhe: 2.687 ± 0.965
3.359ProGly: 3.359 ± 1.128
1.344ProHis: 1.344 ± 0.483
5.375ProIle: 5.375 ± 1.931
5.71ProLys: 5.71 ± 3.313
5.039ProLeu: 5.039 ± 1.79
1.008ProMet: 1.008 ± 0.512
1.008ProAsn: 1.008 ± 0.512
1.008ProPro: 1.008 ± 0.512
2.351ProGln: 2.351 ± 0.95
1.68ProArg: 1.68 ± 0.504
2.351ProSer: 2.351 ± 1.716
3.359ProThr: 3.359 ± 0.47
3.023ProVal: 3.023 ± 0.96
0.336ProTrp: 0.336 ± 0.171
1.68ProTyr: 1.68 ± 1.114
0.0ProXaa: 0.0 ± 0.0
Gln
1.68GlnAla: 1.68 ± 2.306
0.672GlnCys: 0.672 ± 1.441
0.672GlnAsp: 0.672 ± 0.341
2.015GlnGlu: 2.015 ± 0.578
1.344GlnPhe: 1.344 ± 0.483
2.015GlnGly: 2.015 ± 0.578
0.336GlnHis: 0.336 ± 0.171
0.336GlnIle: 0.336 ± 0.171
4.031GlnLys: 4.031 ± 0.453
5.039GlnLeu: 5.039 ± 0.063
0.672GlnMet: 0.672 ± 0.341
3.023GlnAsn: 3.023 ± 3.318
0.672GlnPro: 0.672 ± 0.604
1.008GlnGln: 1.008 ± 0.512
2.687GlnArg: 2.687 ± 0.817
1.68GlnSer: 1.68 ± 0.853
2.015GlnThr: 2.015 ± 0.946
0.336GlnVal: 0.336 ± 0.171
0.0GlnTrp: 0.0 ± 0.0
1.344GlnTyr: 1.344 ± 0.683
0.0GlnXaa: 0.0 ± 0.0
Arg
3.359ArgAla: 3.359 ± 1.128
1.008ArgCys: 1.008 ± 0.512
3.359ArgAsp: 3.359 ± 1.008
4.367ArgGlu: 4.367 ± 2.219
3.023ArgPhe: 3.023 ± 1.045
0.672ArgGly: 0.672 ± 0.341
1.68ArgHis: 1.68 ± 0.504
3.695ArgIle: 3.695 ± 0.469
2.687ArgLys: 2.687 ± 0.965
3.023ArgLeu: 3.023 ± 1.536
1.68ArgMet: 1.68 ± 0.853
3.695ArgAsn: 3.695 ± 1.071
2.687ArgPro: 2.687 ± 3.41
1.008ArgGln: 1.008 ± 0.52
3.359ArgArg: 3.359 ± 1.128
3.023ArgSer: 3.023 ± 0.529
3.359ArgThr: 3.359 ± 0.47
6.046ArgVal: 6.046 ± 1.418
0.0ArgTrp: 0.0 ± 0.0
1.68ArgTyr: 1.68 ± 0.504
0.0ArgXaa: 0.0 ± 0.0
Ser
6.382SerAla: 6.382 ± 0.986
1.344SerCys: 1.344 ± 1.209
3.359SerAsp: 3.359 ± 1.833
4.703SerGlu: 4.703 ± 0.959
6.382SerPhe: 6.382 ± 3.058
4.703SerGly: 4.703 ± 1.607
1.344SerHis: 1.344 ± 0.483
5.039SerIle: 5.039 ± 0.818
3.359SerLys: 3.359 ± 1.707
5.71SerLeu: 5.71 ± 0.588
1.344SerMet: 1.344 ± 0.683
4.031SerAsn: 4.031 ± 1.155
3.695SerPro: 3.695 ± 1.728
3.359SerGln: 3.359 ± 0.794
3.695SerArg: 3.695 ± 3.147
5.039SerSer: 5.039 ± 2.741
5.039SerThr: 5.039 ± 0.818
7.054SerVal: 7.054 ± 2.271
0.672SerTrp: 0.672 ± 0.341
3.695SerTyr: 3.695 ± 2.148
0.0SerXaa: 0.0 ± 0.0
Thr
4.031ThrAla: 4.031 ± 1.892
2.015ThrCys: 2.015 ± 1.024
2.687ThrAsp: 2.687 ± 1.365
4.031ThrGlu: 4.031 ± 1.426
3.359ThrPhe: 3.359 ± 0.47
2.015ThrGly: 2.015 ± 1.024
2.687ThrHis: 2.687 ± 1.908
2.015ThrIle: 2.015 ± 0.946
3.359ThrLys: 3.359 ± 1.128
4.703ThrLeu: 4.703 ± 0.959
1.008ThrMet: 1.008 ± 0.512
4.367ThrAsn: 4.367 ± 2.782
5.039ThrPro: 5.039 ± 1.79
2.687ThrGln: 2.687 ± 1.365
3.023ThrArg: 3.023 ± 0.972
6.382ThrSer: 6.382 ± 1.758
4.031ThrThr: 4.031 ± 1.635
5.039ThrVal: 5.039 ± 1.382
0.0ThrTrp: 0.0 ± 0.0
3.359ThrTyr: 3.359 ± 0.794
0.0ThrXaa: 0.0 ± 0.0
Val
4.703ValAla: 4.703 ± 1.49
2.351ValCys: 2.351 ± 2.197
3.359ValAsp: 3.359 ± 1.111
6.382ValGlu: 6.382 ± 2.582
5.375ValPhe: 5.375 ± 0.692
3.023ValGly: 3.023 ± 2.794
2.015ValHis: 2.015 ± 1.476
4.367ValIle: 4.367 ± 1.108
4.367ValLys: 4.367 ± 1.588
5.375ValLeu: 5.375 ± 1.414
1.68ValMet: 1.68 ± 1.048
3.023ValAsn: 3.023 ± 1.045
4.703ValPro: 4.703 ± 1.373
2.687ValGln: 2.687 ± 0.984
4.031ValArg: 4.031 ± 0.527
7.054ValSer: 7.054 ± 0.836
3.695ValThr: 3.695 ± 1.23
7.39ValVal: 7.39 ± 3.457
0.0ValTrp: 0.0 ± 0.0
3.023ValTyr: 3.023 ± 1.559
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.672TrpAsp: 0.672 ± 0.341
0.336TrpGlu: 0.336 ± 0.171
1.008TrpPhe: 1.008 ± 1.319
0.0TrpGly: 0.0 ± 0.0
0.336TrpHis: 0.336 ± 0.171
0.336TrpIle: 0.336 ± 1.317
0.336TrpLys: 0.336 ± 0.171
0.336TrpLeu: 0.336 ± 0.171
0.0TrpMet: 0.0 ± 0.0
0.336TrpAsn: 0.336 ± 0.171
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.336TrpArg: 0.336 ± 0.171
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.672TrpVal: 0.672 ± 2.633
0.0TrpTrp: 0.0 ± 0.0
0.336TrpTyr: 0.336 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.008TyrAla: 1.008 ± 0.52
0.672TyrCys: 0.672 ± 0.341
2.687TyrAsp: 2.687 ± 0.817
3.359TyrGlu: 3.359 ± 0.794
3.023TyrPhe: 3.023 ± 1.947
2.687TyrGly: 2.687 ± 0.817
1.008TyrHis: 1.008 ± 0.512
1.68TyrIle: 1.68 ± 1.921
1.68TyrLys: 1.68 ± 0.853
1.68TyrLeu: 1.68 ± 0.504
1.008TyrMet: 1.008 ± 1.319
3.023TyrAsn: 3.023 ± 0.972
0.336TyrPro: 0.336 ± 0.171
0.672TyrGln: 0.672 ± 0.604
2.015TyrArg: 2.015 ± 0.578
2.351TyrSer: 2.351 ± 0.988
4.031TyrThr: 4.031 ± 1.155
3.359TyrVal: 3.359 ± 0.47
0.0TyrTrp: 0.0 ± 0.0
1.008TyrTyr: 1.008 ± 0.512
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2978 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski