Amino acid dipepetide frequency for Acaryochloris marina (strain MBIC 11017)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.473AlaAla: 7.473 ± 0.073
0.956AlaCys: 0.956 ± 0.022
4.698AlaAsp: 4.698 ± 0.065
5.269AlaGlu: 5.269 ± 0.055
3.093AlaPhe: 3.093 ± 0.033
5.371AlaGly: 5.371 ± 0.07
1.63AlaHis: 1.63 ± 0.03
5.994AlaIle: 5.994 ± 0.06
3.898AlaLys: 3.898 ± 0.045
9.128AlaLeu: 9.128 ± 0.086
1.881AlaMet: 1.881 ± 0.027
3.302AlaAsn: 3.302 ± 0.064
3.467AlaPro: 3.467 ± 0.055
4.966AlaGln: 4.966 ± 0.056
3.564AlaArg: 3.564 ± 0.042
4.982AlaSer: 4.982 ± 0.055
4.703AlaThr: 4.703 ± 0.063
5.319AlaVal: 5.319 ± 0.051
1.134AlaTrp: 1.134 ± 0.025
2.354AlaTyr: 2.354 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
0.652CysAla: 0.652 ± 0.019
0.205CysCys: 0.205 ± 0.01
0.705CysAsp: 0.705 ± 0.018
0.537CysGlu: 0.537 ± 0.015
0.478CysPhe: 0.478 ± 0.013
0.827CysGly: 0.827 ± 0.024
0.374CysHis: 0.374 ± 0.012
0.591CysIle: 0.591 ± 0.017
0.384CysLys: 0.384 ± 0.013
1.318CysLeu: 1.318 ± 0.029
0.224CysMet: 0.224 ± 0.011
0.383CysAsn: 0.383 ± 0.013
0.618CysPro: 0.618 ± 0.017
0.74CysGln: 0.74 ± 0.018
0.634CysArg: 0.634 ± 0.018
0.757CysSer: 0.757 ± 0.021
0.488CysThr: 0.488 ± 0.015
0.566CysVal: 0.566 ± 0.017
0.208CysTrp: 0.208 ± 0.01
0.33CysTyr: 0.33 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
4.124AspAla: 4.124 ± 0.054
0.668AspCys: 0.668 ± 0.018
2.6AspAsp: 2.6 ± 0.046
2.847AspGlu: 2.847 ± 0.044
2.39AspPhe: 2.39 ± 0.038
3.554AspGly: 3.554 ± 0.068
1.258AspHis: 1.258 ± 0.028
3.394AspIle: 3.394 ± 0.043
1.798AspLys: 1.798 ± 0.028
6.5AspLeu: 6.5 ± 0.058
0.894AspMet: 0.894 ± 0.021
1.771AspAsn: 1.771 ± 0.033
2.957AspPro: 2.957 ± 0.052
3.249AspGln: 3.249 ± 0.036
3.169AspArg: 3.169 ± 0.044
2.993AspSer: 2.993 ± 0.043
2.638AspThr: 2.638 ± 0.053
3.15AspVal: 3.15 ± 0.047
1.008AspTrp: 1.008 ± 0.021
1.85AspTyr: 1.85 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
5.288GluAla: 5.288 ± 0.049
0.465GluCys: 0.465 ± 0.017
2.895GluAsp: 2.895 ± 0.039
3.361GluGlu: 3.361 ± 0.044
2.201GluPhe: 2.201 ± 0.036
3.45GluGly: 3.45 ± 0.048
1.219GluHis: 1.219 ± 0.022
3.809GluIle: 3.809 ± 0.038
2.619GluLys: 2.619 ± 0.04
6.422GluLeu: 6.422 ± 0.063
1.31GluMet: 1.31 ± 0.025
1.997GluAsn: 1.997 ± 0.033
2.384GluPro: 2.384 ± 0.038
3.863GluGln: 3.863 ± 0.055
3.219GluArg: 3.219 ± 0.046
3.413GluSer: 3.413 ± 0.042
3.506GluThr: 3.506 ± 0.045
4.012GluVal: 4.012 ± 0.05
0.819GluTrp: 0.819 ± 0.019
1.564GluTyr: 1.564 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
3.108PheAla: 3.108 ± 0.043
0.583PheCys: 0.583 ± 0.017
2.229PheAsp: 2.229 ± 0.034
2.302PheGlu: 2.302 ± 0.034
1.637PhePhe: 1.637 ± 0.03
2.932PheGly: 2.932 ± 0.039
0.827PheHis: 0.827 ± 0.022
2.166PheIle: 2.166 ± 0.04
1.537PheLys: 1.537 ± 0.026
4.019PheLeu: 4.019 ± 0.046
0.751PheMet: 0.751 ± 0.017
1.641PheAsn: 1.641 ± 0.034
1.752PhePro: 1.752 ± 0.027
1.969PheGln: 1.969 ± 0.028
1.897PheArg: 1.897 ± 0.029
3.069PheSer: 3.069 ± 0.04
2.197PheThr: 2.197 ± 0.034
2.37PheVal: 2.37 ± 0.036
0.702PheTrp: 0.702 ± 0.018
1.241PheTyr: 1.241 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
4.909GlyAla: 4.909 ± 0.068
0.877GlyCys: 0.877 ± 0.021
3.56GlyAsp: 3.56 ± 0.083
3.647GlyGlu: 3.647 ± 0.048
3.043GlyPhe: 3.043 ± 0.043
4.81GlyGly: 4.81 ± 0.117
1.608GlyHis: 1.608 ± 0.04
4.648GlyIle: 4.648 ± 0.053
3.305GlyLys: 3.305 ± 0.046
7.312GlyLeu: 7.312 ± 0.069
1.599GlyMet: 1.599 ± 0.03
2.85GlyAsn: 2.85 ± 0.076
2.19GlyPro: 2.19 ± 0.037
3.879GlyGln: 3.879 ± 0.049
3.24GlyArg: 3.24 ± 0.046
4.262GlySer: 4.262 ± 0.059
3.916GlyThr: 3.916 ± 0.072
4.499GlyVal: 4.499 ± 0.05
1.209GlyTrp: 1.209 ± 0.025
2.308GlyTyr: 2.308 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
1.4HisAla: 1.4 ± 0.027
0.362HisCys: 0.362 ± 0.013
0.926HisAsp: 0.926 ± 0.024
1.017HisGlu: 1.017 ± 0.021
0.939HisPhe: 0.939 ± 0.022
1.262HisGly: 1.262 ± 0.023
0.866HisHis: 0.866 ± 0.025
1.187HisIle: 1.187 ± 0.024
0.774HisLys: 0.774 ± 0.02
2.838HisLeu: 2.838 ± 0.044
0.347HisMet: 0.347 ± 0.013
0.719HisAsn: 0.719 ± 0.019
1.696HisPro: 1.696 ± 0.03
1.692HisGln: 1.692 ± 0.035
1.425HisArg: 1.425 ± 0.027
1.328HisSer: 1.328 ± 0.025
1.133HisThr: 1.133 ± 0.029
1.038HisVal: 1.038 ± 0.024
0.458HisTrp: 0.458 ± 0.014
0.768HisTyr: 0.768 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
6.103IleAla: 6.103 ± 0.056
0.753IleCys: 0.753 ± 0.02
3.595IleAsp: 3.595 ± 0.049
3.905IleGlu: 3.905 ± 0.042
2.306IlePhe: 2.306 ± 0.04
4.199IleGly: 4.199 ± 0.05
1.451IleHis: 1.451 ± 0.029
2.888IleIle: 2.888 ± 0.042
2.447IleLys: 2.447 ± 0.042
6.054IleLeu: 6.054 ± 0.058
0.846IleMet: 0.846 ± 0.022
2.399IleAsn: 2.399 ± 0.041
3.323IlePro: 3.323 ± 0.047
3.288IleGln: 3.288 ± 0.041
3.052IleArg: 3.052 ± 0.038
4.202IleSer: 4.202 ± 0.047
3.468IleThr: 3.468 ± 0.051
3.689IleVal: 3.689 ± 0.043
0.832IleTrp: 0.832 ± 0.021
1.697IleTyr: 1.697 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
4.019LysAla: 4.019 ± 0.051
0.294LysCys: 0.294 ± 0.012
2.116LysAsp: 2.116 ± 0.033
2.198LysGlu: 2.198 ± 0.036
1.411LysPhe: 1.411 ± 0.027
2.717LysGly: 2.717 ± 0.04
0.858LysHis: 0.858 ± 0.02
2.659LysIle: 2.659 ± 0.044
2.081LysLys: 2.081 ± 0.039
4.56LysLeu: 4.56 ± 0.058
0.766LysMet: 0.766 ± 0.02
1.566LysAsn: 1.566 ± 0.029
2.402LysPro: 2.402 ± 0.037
2.544LysGln: 2.544 ± 0.036
2.435LysArg: 2.435 ± 0.039
2.722LysSer: 2.722 ± 0.037
2.906LysThr: 2.906 ± 0.036
3.019LysVal: 3.019 ± 0.042
0.481LysTrp: 0.481 ± 0.017
1.073LysTyr: 1.073 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
9.78LeuAla: 9.78 ± 0.082
1.155LeuCys: 1.155 ± 0.024
5.798LeuAsp: 5.798 ± 0.058
6.939LeuGlu: 6.939 ± 0.064
3.818LeuPhe: 3.818 ± 0.05
7.731LeuGly: 7.731 ± 0.068
2.145LeuHis: 2.145 ± 0.034
6.201LeuIle: 6.201 ± 0.066
5.542LeuLys: 5.542 ± 0.059
10.981LeuLeu: 10.981 ± 0.099
2.415LeuMet: 2.415 ± 0.035
4.498LeuAsn: 4.498 ± 0.051
5.69LeuPro: 5.69 ± 0.054
6.19LeuGln: 6.19 ± 0.068
5.499LeuArg: 5.499 ± 0.052
8.137LeuSer: 8.137 ± 0.074
6.543LeuThr: 6.543 ± 0.055
6.976LeuVal: 6.976 ± 0.063
1.577LeuTrp: 1.577 ± 0.036
2.718LeuTyr: 2.718 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
2.035MetAla: 2.035 ± 0.03
0.137MetCys: 0.137 ± 0.008
0.992MetAsp: 0.992 ± 0.022
1.067MetGlu: 1.067 ± 0.023
0.581MetPhe: 0.581 ± 0.014
1.602MetGly: 1.602 ± 0.027
0.356MetHis: 0.356 ± 0.013
1.162MetIle: 1.162 ± 0.025
0.902MetLys: 0.902 ± 0.021
1.939MetLeu: 1.939 ± 0.036
0.523MetMet: 0.523 ± 0.016
0.794MetAsn: 0.794 ± 0.019
1.102MetPro: 1.102 ± 0.023
1.062MetGln: 1.062 ± 0.022
0.99MetArg: 0.99 ± 0.022
1.416MetSer: 1.416 ± 0.03
1.423MetThr: 1.423 ± 0.025
1.508MetVal: 1.508 ± 0.03
0.163MetTrp: 0.163 ± 0.008
0.35MetTyr: 0.35 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.853AsnAla: 2.853 ± 0.044
0.397AsnCys: 0.397 ± 0.016
1.788AsnAsp: 1.788 ± 0.06
1.572AsnGlu: 1.572 ± 0.026
1.509AsnPhe: 1.509 ± 0.028
2.473AsnGly: 2.473 ± 0.063
0.92AsnHis: 0.92 ± 0.018
2.236AsnIle: 2.236 ± 0.04
1.304AsnLys: 1.304 ± 0.028
4.642AsnLeu: 4.642 ± 0.062
0.592AsnMet: 0.592 ± 0.017
1.459AsnAsn: 1.459 ± 0.04
2.705AsnPro: 2.705 ± 0.036
2.428AsnGln: 2.428 ± 0.031
2.286AsnArg: 2.286 ± 0.032
2.443AsnSer: 2.443 ± 0.05
2.183AsnThr: 2.183 ± 0.05
2.037AsnVal: 2.037 ± 0.04
0.633AsnTrp: 0.633 ± 0.018
1.091AsnTyr: 1.091 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
3.697ProAla: 3.697 ± 0.061
0.442ProCys: 0.442 ± 0.017
3.267ProAsp: 3.267 ± 0.062
3.821ProGlu: 3.821 ± 0.056
1.903ProPhe: 1.903 ± 0.034
3.205ProGly: 3.205 ± 0.047
1.12ProHis: 1.12 ± 0.024
3.177ProIle: 3.177 ± 0.041
2.425ProLys: 2.425 ± 0.039
4.99ProLeu: 4.99 ± 0.052
0.978ProMet: 0.978 ± 0.019
2.02ProAsn: 2.02 ± 0.034
2.596ProPro: 2.596 ± 0.043
3.0ProGln: 3.0 ± 0.044
1.95ProArg: 1.95 ± 0.03
3.55ProSer: 3.55 ± 0.047
3.26ProThr: 3.26 ± 0.051
3.266ProVal: 3.266 ± 0.04
0.669ProTrp: 0.669 ± 0.02
1.317ProTyr: 1.317 ± 0.024
0.0ProXaa: 0.0 ± 0.0
Gln
5.788GlnAla: 5.788 ± 0.078
0.511GlnCys: 0.511 ± 0.017
2.733GlnAsp: 2.733 ± 0.039
3.272GlnGlu: 3.272 ± 0.048
2.13GlnPhe: 2.13 ± 0.034
3.907GlnGly: 3.907 ± 0.048
1.286GlnHis: 1.286 ± 0.028
3.477GlnIle: 3.477 ± 0.043
2.492GlnLys: 2.492 ± 0.033
6.656GlnLeu: 6.656 ± 0.075
1.168GlnMet: 1.168 ± 0.023
1.897GlnAsn: 1.897 ± 0.031
3.201GlnPro: 3.201 ± 0.041
4.356GlnGln: 4.356 ± 0.066
3.448GlnArg: 3.448 ± 0.04
3.884GlnSer: 3.884 ± 0.045
3.793GlnThr: 3.793 ± 0.049
4.26GlnVal: 4.26 ± 0.048
0.875GlnTrp: 0.875 ± 0.023
1.418GlnTyr: 1.418 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
3.421ArgAla: 3.421 ± 0.045
0.638ArgCys: 0.638 ± 0.018
2.495ArgAsp: 2.495 ± 0.034
2.752ArgGlu: 2.752 ± 0.043
2.294ArgPhe: 2.294 ± 0.033
2.873ArgGly: 2.873 ± 0.042
1.192ArgHis: 1.192 ± 0.027
3.233ArgIle: 3.233 ± 0.039
2.306ArgLys: 2.306 ± 0.039
6.132ArgLeu: 6.132 ± 0.058
1.14ArgMet: 1.14 ± 0.024
1.889ArgAsn: 1.889 ± 0.026
2.292ArgPro: 2.292 ± 0.034
3.594ArgGln: 3.594 ± 0.051
3.131ArgArg: 3.131 ± 0.048
3.435ArgSer: 3.435 ± 0.045
2.669ArgThr: 2.669 ± 0.04
3.231ArgVal: 3.231 ± 0.042
0.952ArgTrp: 0.952 ± 0.023
1.891ArgTyr: 1.891 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
4.925SerAla: 4.925 ± 0.045
0.671SerCys: 0.671 ± 0.02
3.452SerAsp: 3.452 ± 0.045
3.771SerGlu: 3.771 ± 0.044
2.516SerPhe: 2.516 ± 0.032
4.929SerGly: 4.929 ± 0.065
1.483SerHis: 1.483 ± 0.03
3.912SerIle: 3.912 ± 0.049
2.725SerLys: 2.725 ± 0.038
7.492SerLeu: 7.492 ± 0.062
1.407SerMet: 1.407 ± 0.027
2.49SerAsn: 2.49 ± 0.049
3.851SerPro: 3.851 ± 0.057
3.888SerGln: 3.888 ± 0.045
3.224SerArg: 3.224 ± 0.042
4.794SerSer: 4.794 ± 0.068
3.828SerThr: 3.828 ± 0.057
4.114SerVal: 4.114 ± 0.055
0.954SerTrp: 0.954 ± 0.025
1.696SerTyr: 1.696 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
5.006ThrAla: 5.006 ± 0.058
0.553ThrCys: 0.553 ± 0.016
3.011ThrAsp: 3.011 ± 0.037
3.127ThrGlu: 3.127 ± 0.042
2.244ThrPhe: 2.244 ± 0.041
4.107ThrGly: 4.107 ± 0.066
1.302ThrHis: 1.302 ± 0.025
3.446ThrIle: 3.446 ± 0.05
2.017ThrLys: 2.017 ± 0.032
7.089ThrLeu: 7.089 ± 0.07
0.904ThrMet: 0.904 ± 0.022
2.006ThrAsn: 2.006 ± 0.047
3.557ThrPro: 3.557 ± 0.058
3.381ThrGln: 3.381 ± 0.047
2.383ThrArg: 2.383 ± 0.036
3.608ThrSer: 3.608 ± 0.047
3.537ThrThr: 3.537 ± 0.069
4.18ThrVal: 4.18 ± 0.052
0.799ThrTrp: 0.799 ± 0.021
1.776ThrTyr: 1.776 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
5.605ValAla: 5.605 ± 0.051
0.732ValCys: 0.732 ± 0.019
3.72ValAsp: 3.72 ± 0.047
4.071ValGlu: 4.071 ± 0.044
2.487ValPhe: 2.487 ± 0.034
4.589ValGly: 4.589 ± 0.054
1.217ValHis: 1.217 ± 0.026
3.998ValIle: 3.998 ± 0.043
2.726ValLys: 2.726 ± 0.035
6.954ValLeu: 6.954 ± 0.055
1.538ValMet: 1.538 ± 0.029
2.469ValAsn: 2.469 ± 0.041
2.809ValPro: 2.809 ± 0.039
3.172ValGln: 3.172 ± 0.037
3.079ValArg: 3.079 ± 0.048
4.279ValSer: 4.279 ± 0.054
3.73ValThr: 3.73 ± 0.043
4.654ValVal: 4.654 ± 0.052
0.976ValTrp: 0.976 ± 0.023
1.701ValTyr: 1.701 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
0.952TrpAla: 0.952 ± 0.023
0.183TrpCys: 0.183 ± 0.009
0.768TrpAsp: 0.768 ± 0.024
0.764TrpGlu: 0.764 ± 0.018
0.617TrpPhe: 0.617 ± 0.019
1.049TrpGly: 1.049 ± 0.021
0.424TrpHis: 0.424 ± 0.014
0.902TrpIle: 0.902 ± 0.019
0.581TrpLys: 0.581 ± 0.015
1.914TrpLeu: 1.914 ± 0.038
0.397TrpMet: 0.397 ± 0.012
0.528TrpAsn: 0.528 ± 0.018
0.544TrpPro: 0.544 ± 0.015
1.328TrpGln: 1.328 ± 0.03
0.882TrpArg: 0.882 ± 0.021
0.996TrpSer: 0.996 ± 0.026
0.71TrpThr: 0.71 ± 0.018
1.06TrpVal: 1.06 ± 0.023
0.248TrpTrp: 0.248 ± 0.01
0.392TrpTyr: 0.392 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.044TyrAla: 2.044 ± 0.032
0.435TyrCys: 0.435 ± 0.014
1.469TyrAsp: 1.469 ± 0.028
1.576TyrGlu: 1.576 ± 0.029
1.333TyrPhe: 1.333 ± 0.024
2.085TyrGly: 2.085 ± 0.033
0.644TyrHis: 0.644 ± 0.015
1.415TyrIle: 1.415 ± 0.027
0.961TyrLys: 0.961 ± 0.023
3.395TyrLeu: 3.395 ± 0.046
0.423TyrMet: 0.423 ± 0.014
0.874TyrAsn: 0.874 ± 0.024
1.516TyrPro: 1.516 ± 0.028
1.96TyrGln: 1.96 ± 0.033
2.051TyrArg: 2.051 ± 0.034
1.852TyrSer: 1.852 ± 0.031
1.403TyrThr: 1.403 ± 0.026
1.559TyrVal: 1.559 ± 0.025
0.54TyrTrp: 0.54 ± 0.017
0.887TyrTyr: 0.887 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8172 proteins (2253963 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski