Amino acid dipepetide frequency for Catenovulum agarivorans DS-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.493AlaAla: 7.493 ± 0.106
0.988AlaCys: 0.988 ± 0.029
5.454AlaAsp: 5.454 ± 0.069
5.963AlaGlu: 5.963 ± 0.075
3.352AlaPhe: 3.352 ± 0.052
5.728AlaGly: 5.728 ± 0.081
1.737AlaHis: 1.737 ± 0.035
5.778AlaIle: 5.778 ± 0.07
5.331AlaLys: 5.331 ± 0.076
8.741AlaLeu: 8.741 ± 0.104
2.023AlaMet: 2.023 ± 0.045
4.533AlaAsn: 4.533 ± 0.068
2.635AlaPro: 2.635 ± 0.056
4.727AlaGln: 4.727 ± 0.07
3.264AlaArg: 3.264 ± 0.055
5.32AlaSer: 5.32 ± 0.065
4.394AlaThr: 4.394 ± 0.064
5.622AlaVal: 5.622 ± 0.075
0.965AlaTrp: 0.965 ± 0.027
2.656AlaTyr: 2.656 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.801CysAla: 0.801 ± 0.022
0.164CysCys: 0.164 ± 0.011
0.515CysAsp: 0.515 ± 0.019
0.533CysGlu: 0.533 ± 0.019
0.45CysPhe: 0.45 ± 0.019
0.789CysGly: 0.789 ± 0.027
0.301CysHis: 0.301 ± 0.019
0.605CysIle: 0.605 ± 0.023
0.461CysLys: 0.461 ± 0.017
1.012CysLeu: 1.012 ± 0.027
0.184CysMet: 0.184 ± 0.012
0.422CysAsn: 0.422 ± 0.021
0.392CysPro: 0.392 ± 0.02
0.558CysGln: 0.558 ± 0.021
0.416CysArg: 0.416 ± 0.02
0.651CysSer: 0.651 ± 0.022
0.438CysThr: 0.438 ± 0.019
0.612CysVal: 0.612 ± 0.019
0.117CysTrp: 0.117 ± 0.01
0.348CysTyr: 0.348 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
4.398AspAla: 4.398 ± 0.058
0.544AspCys: 0.544 ± 0.02
3.26AspAsp: 3.26 ± 0.075
3.666AspGlu: 3.666 ± 0.065
2.79AspPhe: 2.79 ± 0.044
4.003AspGly: 4.003 ± 0.078
0.981AspHis: 0.981 ± 0.032
4.333AspIle: 4.333 ± 0.064
3.755AspLys: 3.755 ± 0.056
5.347AspLeu: 5.347 ± 0.071
1.387AspMet: 1.387 ± 0.035
3.052AspAsn: 3.052 ± 0.054
2.009AspPro: 2.009 ± 0.051
2.04AspGln: 2.04 ± 0.037
1.848AspArg: 1.848 ± 0.039
3.621AspSer: 3.621 ± 0.065
2.828AspThr: 2.828 ± 0.055
3.808AspVal: 3.808 ± 0.065
0.966AspTrp: 0.966 ± 0.025
2.302AspTyr: 2.302 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
4.406GluAla: 4.406 ± 0.062
0.445GluCys: 0.445 ± 0.019
2.682GluAsp: 2.682 ± 0.053
3.007GluGlu: 3.007 ± 0.058
2.721GluPhe: 2.721 ± 0.048
2.884GluGly: 2.884 ± 0.056
1.636GluHis: 1.636 ± 0.041
3.845GluIle: 3.845 ± 0.057
3.535GluLys: 3.535 ± 0.051
6.945GluLeu: 6.945 ± 0.074
1.409GluMet: 1.409 ± 0.036
2.944GluAsn: 2.944 ± 0.052
1.907GluPro: 1.907 ± 0.063
4.785GluGln: 4.785 ± 0.078
2.635GluArg: 2.635 ± 0.053
3.39GluSer: 3.39 ± 0.053
2.87GluThr: 2.87 ± 0.051
3.9GluVal: 3.9 ± 0.062
0.71GluTrp: 0.71 ± 0.027
2.033GluTyr: 2.033 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
4.049PheAla: 4.049 ± 0.05
0.474PheCys: 0.474 ± 0.019
3.146PheAsp: 3.146 ± 0.052
2.751PheGlu: 2.751 ± 0.04
1.66PhePhe: 1.66 ± 0.048
2.892PheGly: 2.892 ± 0.048
0.726PheHis: 0.726 ± 0.026
2.842PheIle: 2.842 ± 0.049
2.406PheLys: 2.406 ± 0.05
3.123PheLeu: 3.123 ± 0.054
0.902PheMet: 0.902 ± 0.025
2.519PheAsn: 2.519 ± 0.041
1.33PhePro: 1.33 ± 0.035
1.268PheGln: 1.268 ± 0.027
1.42PheArg: 1.42 ± 0.032
3.478PheSer: 3.478 ± 0.054
2.444PheThr: 2.444 ± 0.042
2.995PheVal: 2.995 ± 0.049
0.568PheTrp: 0.568 ± 0.021
1.542PheTyr: 1.542 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
4.651GlyAla: 4.651 ± 0.071
0.815GlyCys: 0.815 ± 0.025
3.631GlyAsp: 3.631 ± 0.066
3.983GlyGlu: 3.983 ± 0.062
3.2GlyPhe: 3.2 ± 0.056
4.326GlyGly: 4.326 ± 0.074
1.445GlyHis: 1.445 ± 0.035
4.288GlyIle: 4.288 ± 0.066
3.889GlyLys: 3.889 ± 0.058
6.32GlyLeu: 6.32 ± 0.079
1.527GlyMet: 1.527 ± 0.033
2.781GlyAsn: 2.781 ± 0.063
1.506GlyPro: 1.506 ± 0.04
3.079GlyGln: 3.079 ± 0.049
2.678GlyArg: 2.678 ± 0.054
3.781GlySer: 3.781 ± 0.061
3.17GlyThr: 3.17 ± 0.063
4.514GlyVal: 4.514 ± 0.063
0.968GlyTrp: 0.968 ± 0.029
2.608GlyTyr: 2.608 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
1.692HisAla: 1.692 ± 0.043
0.286HisCys: 0.286 ± 0.013
1.084HisAsp: 1.084 ± 0.031
1.043HisGlu: 1.043 ± 0.029
1.077HisPhe: 1.077 ± 0.032
1.393HisGly: 1.393 ± 0.039
0.582HisHis: 0.582 ± 0.023
1.535HisIle: 1.535 ± 0.032
1.408HisLys: 1.408 ± 0.037
2.141HisLeu: 2.141 ± 0.041
0.512HisMet: 0.512 ± 0.021
1.145HisAsn: 1.145 ± 0.037
1.111HisPro: 1.111 ± 0.03
1.345HisGln: 1.345 ± 0.034
0.846HisArg: 0.846 ± 0.024
1.461HisSer: 1.461 ± 0.033
1.091HisThr: 1.091 ± 0.03
1.22HisVal: 1.22 ± 0.03
0.403HisTrp: 0.403 ± 0.018
0.923HisTyr: 0.923 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.101IleAla: 6.101 ± 0.086
0.704IleCys: 0.704 ± 0.024
4.543IleAsp: 4.543 ± 0.058
5.049IleGlu: 5.049 ± 0.055
2.288IlePhe: 2.288 ± 0.046
4.352IleGly: 4.352 ± 0.069
1.325IleHis: 1.325 ± 0.035
3.454IleIle: 3.454 ± 0.059
3.938IleLys: 3.938 ± 0.048
5.119IleLeu: 5.119 ± 0.073
1.137IleMet: 1.137 ± 0.033
3.532IleAsn: 3.532 ± 0.055
2.346IlePro: 2.346 ± 0.045
2.793IleGln: 2.793 ± 0.048
2.678IleArg: 2.678 ± 0.052
4.52IleSer: 4.52 ± 0.068
3.593IleThr: 3.593 ± 0.054
4.126IleVal: 4.126 ± 0.062
0.719IleTrp: 0.719 ± 0.023
1.953IleTyr: 1.953 ± 0.042
0.0IleXaa: 0.0 ± 0.0
Lys
4.863LysAla: 4.863 ± 0.069
0.369LysCys: 0.369 ± 0.021
2.721LysAsp: 2.721 ± 0.046
2.694LysGlu: 2.694 ± 0.05
2.156LysPhe: 2.156 ± 0.042
3.094LysGly: 3.094 ± 0.05
1.548LysHis: 1.548 ± 0.032
3.49LysIle: 3.49 ± 0.048
3.051LysLys: 3.051 ± 0.065
6.442LysLeu: 6.442 ± 0.082
1.277LysMet: 1.277 ± 0.03
2.72LysAsn: 2.72 ± 0.049
2.554LysPro: 2.554 ± 0.045
4.034LysGln: 4.034 ± 0.069
2.597LysArg: 2.597 ± 0.051
3.443LysSer: 3.443 ± 0.057
3.083LysThr: 3.083 ± 0.051
3.977LysVal: 3.977 ± 0.064
0.698LysTrp: 0.698 ± 0.024
1.766LysTyr: 1.766 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
10.31LeuAla: 10.31 ± 0.106
0.865LeuCys: 0.865 ± 0.027
5.758LeuAsp: 5.758 ± 0.071
5.428LeuGlu: 5.428 ± 0.074
4.014LeuPhe: 4.014 ± 0.067
5.836LeuGly: 5.836 ± 0.071
1.919LeuHis: 1.919 ± 0.041
6.089LeuIle: 6.089 ± 0.089
5.586LeuLys: 5.586 ± 0.066
9.567LeuLeu: 9.567 ± 0.118
2.256LeuMet: 2.256 ± 0.041
5.272LeuAsn: 5.272 ± 0.056
4.404LeuPro: 4.404 ± 0.056
4.357LeuGln: 4.357 ± 0.059
3.595LeuArg: 3.595 ± 0.057
7.352LeuSer: 7.352 ± 0.082
6.518LeuThr: 6.518 ± 0.077
6.809LeuVal: 6.809 ± 0.079
0.957LeuTrp: 0.957 ± 0.03
2.883LeuTyr: 2.883 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
2.054MetAla: 2.054 ± 0.037
0.174MetCys: 0.174 ± 0.012
1.057MetAsp: 1.057 ± 0.029
0.957MetGlu: 0.957 ± 0.03
0.838MetPhe: 0.838 ± 0.026
1.372MetGly: 1.372 ± 0.035
0.515MetHis: 0.515 ± 0.02
1.111MetIle: 1.111 ± 0.03
1.15MetLys: 1.15 ± 0.032
2.464MetLeu: 2.464 ± 0.044
0.526MetMet: 0.526 ± 0.02
0.995MetAsn: 0.995 ± 0.027
1.172MetPro: 1.172 ± 0.033
1.427MetGln: 1.427 ± 0.03
1.005MetArg: 1.005 ± 0.024
1.649MetSer: 1.649 ± 0.031
1.273MetThr: 1.273 ± 0.03
1.456MetVal: 1.456 ± 0.034
0.227MetTrp: 0.227 ± 0.015
0.562MetTyr: 0.562 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.865AsnAla: 3.865 ± 0.053
0.516AsnCys: 0.516 ± 0.021
2.477AsnAsp: 2.477 ± 0.055
2.467AsnGlu: 2.467 ± 0.044
2.067AsnPhe: 2.067 ± 0.045
3.414AsnGly: 3.414 ± 0.067
1.112AsnHis: 1.112 ± 0.027
3.357AsnIle: 3.357 ± 0.055
3.233AsnLys: 3.233 ± 0.052
4.968AsnLeu: 4.968 ± 0.066
1.066AsnMet: 1.066 ± 0.026
2.872AsnAsn: 2.872 ± 0.064
2.266AsnPro: 2.266 ± 0.044
3.224AsnGln: 3.224 ± 0.059
2.022AsnArg: 2.022 ± 0.039
3.186AsnSer: 3.186 ± 0.066
2.676AsnThr: 2.676 ± 0.055
2.779AsnVal: 2.779 ± 0.043
0.862AsnTrp: 0.862 ± 0.025
1.877AsnTyr: 1.877 ± 0.039
0.0AsnXaa: 0.0 ± 0.0
Pro
3.159ProAla: 3.159 ± 0.062
0.293ProCys: 0.293 ± 0.013
2.333ProAsp: 2.333 ± 0.044
2.95ProGlu: 2.95 ± 0.073
1.58ProPhe: 1.58 ± 0.034
1.942ProGly: 1.942 ± 0.044
0.873ProHis: 0.873 ± 0.025
2.485ProIle: 2.485 ± 0.046
1.933ProLys: 1.933 ± 0.042
3.412ProLeu: 3.412 ± 0.051
0.823ProMet: 0.823 ± 0.026
2.099ProAsn: 2.099 ± 0.038
1.054ProPro: 1.054 ± 0.028
1.857ProGln: 1.857 ± 0.038
1.151ProArg: 1.151 ± 0.032
2.426ProSer: 2.426 ± 0.049
2.211ProThr: 2.211 ± 0.046
2.838ProVal: 2.838 ± 0.039
0.512ProTrp: 0.512 ± 0.023
1.329ProTyr: 1.329 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
5.658GlnAla: 5.658 ± 0.08
0.382GlnCys: 0.382 ± 0.017
2.391GlnAsp: 2.391 ± 0.039
2.065GlnGlu: 2.065 ± 0.042
2.159GlnPhe: 2.159 ± 0.044
3.188GlnGly: 3.188 ± 0.049
1.503GlnHis: 1.503 ± 0.036
3.496GlnIle: 3.496 ± 0.049
2.519GlnLys: 2.519 ± 0.048
6.286GlnLeu: 6.286 ± 0.091
1.122GlnMet: 1.122 ± 0.032
2.579GlnAsn: 2.579 ± 0.049
2.091GlnPro: 2.091 ± 0.036
5.068GlnGln: 5.068 ± 0.103
2.04GlnArg: 2.04 ± 0.039
3.379GlnSer: 3.379 ± 0.054
3.236GlnThr: 3.236 ± 0.059
3.802GlnVal: 3.802 ± 0.061
0.694GlnTrp: 0.694 ± 0.027
1.774GlnTyr: 1.774 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
3.032ArgAla: 3.032 ± 0.046
0.342ArgCys: 0.342 ± 0.015
2.083ArgAsp: 2.083 ± 0.04
2.29ArgGlu: 2.29 ± 0.043
1.994ArgPhe: 1.994 ± 0.04
2.234ArgGly: 2.234 ± 0.043
0.913ArgHis: 0.913 ± 0.025
2.766ArgIle: 2.766 ± 0.046
2.266ArgLys: 2.266 ± 0.042
4.206ArgLeu: 4.206 ± 0.062
0.973ArgMet: 0.973 ± 0.025
1.86ArgAsn: 1.86 ± 0.042
1.483ArgPro: 1.483 ± 0.034
2.086ArgGln: 2.086 ± 0.047
1.844ArgArg: 1.844 ± 0.04
2.346ArgSer: 2.346 ± 0.048
1.886ArgThr: 1.886 ± 0.039
2.773ArgVal: 2.773 ± 0.048
0.552ArgTrp: 0.552 ± 0.021
1.624ArgTyr: 1.624 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
5.892SerAla: 5.892 ± 0.078
0.668SerCys: 0.668 ± 0.023
3.665SerAsp: 3.665 ± 0.065
3.842SerGlu: 3.842 ± 0.057
2.978SerPhe: 2.978 ± 0.05
4.629SerGly: 4.629 ± 0.073
1.398SerHis: 1.398 ± 0.034
4.383SerIle: 4.383 ± 0.067
3.505SerLys: 3.505 ± 0.054
6.622SerLeu: 6.622 ± 0.072
1.431SerMet: 1.431 ± 0.03
3.078SerAsn: 3.078 ± 0.06
2.251SerPro: 2.251 ± 0.047
3.493SerGln: 3.493 ± 0.055
2.537SerArg: 2.537 ± 0.044
4.336SerSer: 4.336 ± 0.078
3.359SerThr: 3.359 ± 0.06
4.637SerVal: 4.637 ± 0.063
0.919SerTrp: 0.919 ± 0.03
2.228SerTyr: 2.228 ± 0.042
0.0SerXaa: 0.0 ± 0.0
Thr
4.811ThrAla: 4.811 ± 0.067
0.457ThrCys: 0.457 ± 0.018
3.449ThrAsp: 3.449 ± 0.068
3.213ThrGlu: 3.213 ± 0.044
2.165ThrPhe: 2.165 ± 0.044
4.114ThrGly: 4.114 ± 0.062
1.254ThrHis: 1.254 ± 0.028
3.447ThrIle: 3.447 ± 0.057
2.463ThrLys: 2.463 ± 0.049
5.478ThrLeu: 5.478 ± 0.078
1.006ThrMet: 1.006 ± 0.025
2.592ThrAsn: 2.592 ± 0.05
2.506ThrPro: 2.506 ± 0.041
2.872ThrGln: 2.872 ± 0.049
1.99ThrArg: 1.99 ± 0.041
3.624ThrSer: 3.624 ± 0.074
3.107ThrThr: 3.107 ± 0.059
3.629ThrVal: 3.629 ± 0.065
0.66ThrTrp: 0.66 ± 0.023
1.652ThrTyr: 1.652 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
6.038ValAla: 6.038 ± 0.078
0.742ValCys: 0.742 ± 0.026
4.278ValAsp: 4.278 ± 0.064
4.856ValGlu: 4.856 ± 0.067
2.771ValPhe: 2.771 ± 0.045
4.135ValGly: 4.135 ± 0.07
1.269ValHis: 1.269 ± 0.031
4.321ValIle: 4.321 ± 0.064
3.866ValLys: 3.866 ± 0.061
6.184ValLeu: 6.184 ± 0.085
1.477ValMet: 1.477 ± 0.038
3.335ValAsn: 3.335 ± 0.046
2.28ValPro: 2.28 ± 0.046
2.495ValGln: 2.495 ± 0.048
2.629ValArg: 2.629 ± 0.047
4.799ValSer: 4.799 ± 0.074
3.975ValThr: 3.975 ± 0.069
5.076ValVal: 5.076 ± 0.074
0.763ValTrp: 0.763 ± 0.024
2.133ValTyr: 2.133 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.845TrpAla: 0.845 ± 0.026
0.148TrpCys: 0.148 ± 0.011
0.644TrpAsp: 0.644 ± 0.024
0.417TrpGlu: 0.417 ± 0.018
0.657TrpPhe: 0.657 ± 0.021
0.762TrpGly: 0.762 ± 0.029
0.413TrpHis: 0.413 ± 0.016
0.667TrpIle: 0.667 ± 0.024
0.506TrpLys: 0.506 ± 0.023
1.782TrpLeu: 1.782 ± 0.045
0.303TrpMet: 0.303 ± 0.013
0.49TrpAsn: 0.49 ± 0.022
0.523TrpPro: 0.523 ± 0.022
1.26TrpGln: 1.26 ± 0.033
0.688TrpArg: 0.688 ± 0.025
0.804TrpSer: 0.804 ± 0.025
0.571TrpThr: 0.571 ± 0.024
0.876TrpVal: 0.876 ± 0.024
0.201TrpTrp: 0.201 ± 0.013
0.479TrpTyr: 0.479 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.588TyrAla: 2.588 ± 0.046
0.394TyrCys: 0.394 ± 0.017
1.768TyrAsp: 1.768 ± 0.04
1.516TyrGlu: 1.516 ± 0.035
1.616TyrPhe: 1.616 ± 0.03
2.073TyrGly: 2.073 ± 0.043
0.849TyrHis: 0.849 ± 0.03
1.978TyrIle: 1.978 ± 0.038
1.776TyrLys: 1.776 ± 0.039
3.618TyrLeu: 3.618 ± 0.056
0.64TyrMet: 0.64 ± 0.02
1.487TyrAsn: 1.487 ± 0.043
1.438TyrPro: 1.438 ± 0.033
2.663TyrGln: 2.663 ± 0.055
1.668TyrArg: 1.668 ± 0.035
2.285TyrSer: 2.285 ± 0.049
1.768TyrThr: 1.768 ± 0.037
1.972TyrVal: 1.972 ± 0.042
0.572TyrTrp: 0.572 ± 0.02
1.294TyrTyr: 1.294 ± 0.04
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3835 proteins (1380918 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski