Amino acid dipepetide frequency for Sphingobium sp. TKS

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.892AlaAla: 17.892 ± 0.164
1.107AlaCys: 1.107 ± 0.029
7.441AlaAsp: 7.441 ± 0.069
7.347AlaGlu: 7.347 ± 0.076
4.254AlaPhe: 4.254 ± 0.05
11.051AlaGly: 11.051 ± 0.164
2.5AlaHis: 2.5 ± 0.039
6.756AlaIle: 6.756 ± 0.062
3.96AlaLys: 3.96 ± 0.062
13.896AlaLeu: 13.896 ± 0.122
3.945AlaMet: 3.945 ± 0.05
3.068AlaAsn: 3.068 ± 0.05
5.955AlaPro: 5.955 ± 0.074
4.721AlaGln: 4.721 ± 0.073
9.601AlaArg: 9.601 ± 0.09
6.83AlaSer: 6.83 ± 0.089
6.2AlaThr: 6.2 ± 0.077
8.388AlaVal: 8.388 ± 0.077
1.584AlaTrp: 1.584 ± 0.029
2.581AlaTyr: 2.581 ± 0.037
0.0AlaXaa: 0.0 ± 0.0
Cys
1.072CysAla: 1.072 ± 0.029
0.115CysCys: 0.115 ± 0.008
0.529CysAsp: 0.529 ± 0.018
0.45CysGlu: 0.45 ± 0.016
0.299CysPhe: 0.299 ± 0.014
0.926CysGly: 0.926 ± 0.025
0.235CysHis: 0.235 ± 0.016
0.385CysIle: 0.385 ± 0.016
0.183CysLys: 0.183 ± 0.011
0.76CysLeu: 0.76 ± 0.021
0.168CysMet: 0.168 ± 0.011
0.19CysAsn: 0.19 ± 0.009
0.49CysPro: 0.49 ± 0.025
0.225CysGln: 0.225 ± 0.012
0.664CysArg: 0.664 ± 0.021
0.481CysSer: 0.481 ± 0.019
0.425CysThr: 0.425 ± 0.016
0.575CysVal: 0.575 ± 0.018
0.148CysTrp: 0.148 ± 0.008
0.165CysTyr: 0.165 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
7.385AspAla: 7.385 ± 0.07
0.494AspCys: 0.494 ± 0.019
3.289AspAsp: 3.289 ± 0.048
3.404AspGlu: 3.404 ± 0.055
2.131AspPhe: 2.131 ± 0.036
5.502AspGly: 5.502 ± 0.076
1.456AspHis: 1.456 ± 0.038
3.04AspIle: 3.04 ± 0.046
1.748AspLys: 1.748 ± 0.034
5.862AspLeu: 5.862 ± 0.064
1.527AspMet: 1.527 ± 0.033
1.348AspAsn: 1.348 ± 0.032
3.676AspPro: 3.676 ± 0.059
1.911AspGln: 1.911 ± 0.039
5.087AspArg: 5.087 ± 0.063
2.477AspSer: 2.477 ± 0.04
2.382AspThr: 2.382 ± 0.043
3.916AspVal: 3.916 ± 0.054
1.083AspTrp: 1.083 ± 0.027
1.692AspTyr: 1.692 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
7.699GluAla: 7.699 ± 0.082
0.354GluCys: 0.354 ± 0.014
2.754GluAsp: 2.754 ± 0.043
3.148GluGlu: 3.148 ± 0.056
1.465GluPhe: 1.465 ± 0.035
4.55GluGly: 4.55 ± 0.052
1.176GluHis: 1.176 ± 0.028
3.122GluIle: 3.122 ± 0.05
2.128GluLys: 2.128 ± 0.032
5.005GluLeu: 5.005 ± 0.061
1.432GluMet: 1.432 ± 0.029
1.392GluAsn: 1.392 ± 0.028
2.493GluPro: 2.493 ± 0.042
2.219GluGln: 2.219 ± 0.036
5.195GluArg: 5.195 ± 0.065
2.334GluSer: 2.334 ± 0.034
3.043GluThr: 3.043 ± 0.043
3.431GluVal: 3.431 ± 0.055
0.758GluTrp: 0.758 ± 0.023
0.961GluTyr: 0.961 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
4.722PheAla: 4.722 ± 0.067
0.342PheCys: 0.342 ± 0.014
2.624PheAsp: 2.624 ± 0.04
1.96PheGlu: 1.96 ± 0.038
1.241PhePhe: 1.241 ± 0.03
3.467PheGly: 3.467 ± 0.045
0.773PheHis: 0.773 ± 0.022
1.489PheIle: 1.489 ± 0.029
0.811PheLys: 0.811 ± 0.026
3.179PheLeu: 3.179 ± 0.048
0.753PheMet: 0.753 ± 0.022
0.947PheAsn: 0.947 ± 0.025
1.512PhePro: 1.512 ± 0.028
0.936PheGln: 0.936 ± 0.022
2.276PheArg: 2.276 ± 0.036
2.148PheSer: 2.148 ± 0.044
1.992PheThr: 1.992 ± 0.034
2.376PheVal: 2.376 ± 0.038
0.516PheTrp: 0.516 ± 0.019
0.878PheTyr: 0.878 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
9.736GlyAla: 9.736 ± 0.15
0.831GlyCys: 0.831 ± 0.023
4.832GlyAsp: 4.832 ± 0.077
4.739GlyGlu: 4.739 ± 0.05
3.575GlyPhe: 3.575 ± 0.062
8.309GlyGly: 8.309 ± 0.456
1.931GlyHis: 1.931 ± 0.032
4.519GlyIle: 4.519 ± 0.049
3.299GlyLys: 3.299 ± 0.058
8.726GlyLeu: 8.726 ± 0.082
2.43GlyMet: 2.43 ± 0.038
2.283GlyAsn: 2.283 ± 0.062
3.419GlyPro: 3.419 ± 0.047
3.193GlyGln: 3.193 ± 0.043
6.659GlyArg: 6.659 ± 0.066
4.947GlySer: 4.947 ± 0.118
4.595GlyThr: 4.595 ± 0.077
6.057GlyVal: 6.057 ± 0.072
1.563GlyTrp: 1.563 ± 0.033
2.301GlyTyr: 2.301 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
2.571HisAla: 2.571 ± 0.038
0.248HisCys: 0.248 ± 0.012
1.326HisAsp: 1.326 ± 0.029
1.055HisGlu: 1.055 ± 0.026
0.882HisPhe: 0.882 ± 0.024
2.083HisGly: 2.083 ± 0.034
0.665HisHis: 0.665 ± 0.022
1.036HisIle: 1.036 ± 0.025
0.501HisLys: 0.501 ± 0.017
2.088HisLeu: 2.088 ± 0.042
0.526HisMet: 0.526 ± 0.017
0.462HisAsn: 0.462 ± 0.017
1.329HisPro: 1.329 ± 0.033
0.603HisGln: 0.603 ± 0.02
1.6HisArg: 1.6 ± 0.034
1.056HisSer: 1.056 ± 0.027
0.635HisThr: 0.635 ± 0.02
1.67HisVal: 1.67 ± 0.037
0.39HisTrp: 0.39 ± 0.016
0.652HisTyr: 0.652 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
7.751IleAla: 7.751 ± 0.071
0.479IleCys: 0.479 ± 0.016
3.949IleAsp: 3.949 ± 0.049
3.486IleGlu: 3.486 ± 0.053
1.7IlePhe: 1.7 ± 0.036
5.051IleGly: 5.051 ± 0.072
0.998IleHis: 0.998 ± 0.026
2.317IleIle: 2.317 ± 0.039
1.243IleLys: 1.243 ± 0.028
4.37IleLeu: 4.37 ± 0.059
1.024IleMet: 1.024 ± 0.02
1.325IleAsn: 1.325 ± 0.031
2.308IlePro: 2.308 ± 0.036
1.28IleGln: 1.28 ± 0.026
3.343IleArg: 3.343 ± 0.045
2.897IleSer: 2.897 ± 0.044
2.528IleThr: 2.528 ± 0.046
4.112IleVal: 4.112 ± 0.046
0.621IleTrp: 0.621 ± 0.018
1.033IleTyr: 1.033 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
4.366LysAla: 4.366 ± 0.063
0.158LysCys: 0.158 ± 0.009
1.665LysAsp: 1.665 ± 0.035
1.354LysGlu: 1.354 ± 0.035
0.791LysPhe: 0.791 ± 0.022
2.763LysGly: 2.763 ± 0.044
0.527LysHis: 0.527 ± 0.017
1.556LysIle: 1.556 ± 0.033
1.058LysLys: 1.058 ± 0.03
3.182LysLeu: 3.182 ± 0.047
0.732LysMet: 0.732 ± 0.022
0.715LysAsn: 0.715 ± 0.021
1.953LysPro: 1.953 ± 0.033
0.938LysGln: 0.938 ± 0.025
2.255LysArg: 2.255 ± 0.041
1.636LysSer: 1.636 ± 0.034
1.737LysThr: 1.737 ± 0.036
2.198LysVal: 2.198 ± 0.041
0.385LysTrp: 0.385 ± 0.015
0.562LysTyr: 0.562 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
13.678LeuAla: 13.678 ± 0.12
0.949LeuCys: 0.949 ± 0.027
6.108LeuAsp: 6.108 ± 0.07
5.041LeuGlu: 5.041 ± 0.055
3.719LeuPhe: 3.719 ± 0.05
8.258LeuGly: 8.258 ± 0.081
2.028LeuHis: 2.028 ± 0.035
5.04LeuIle: 5.04 ± 0.052
3.168LeuLys: 3.168 ± 0.04
9.942LeuLeu: 9.942 ± 0.108
2.268LeuMet: 2.268 ± 0.04
2.507LeuAsn: 2.507 ± 0.045
5.645LeuPro: 5.645 ± 0.067
2.587LeuGln: 2.587 ± 0.045
7.231LeuArg: 7.231 ± 0.081
6.488LeuSer: 6.488 ± 0.07
5.563LeuThr: 5.563 ± 0.059
6.776LeuVal: 6.776 ± 0.067
1.268LeuTrp: 1.268 ± 0.033
1.975LeuTyr: 1.975 ± 0.033
0.0LeuXaa: 0.0 ± 0.0
Met
3.417MetAla: 3.417 ± 0.052
0.157MetCys: 0.157 ± 0.01
1.196MetAsp: 1.196 ± 0.029
1.184MetGlu: 1.184 ± 0.026
0.664MetPhe: 0.664 ± 0.021
1.917MetGly: 1.917 ± 0.037
0.452MetHis: 0.452 ± 0.016
1.379MetIle: 1.379 ± 0.026
0.934MetLys: 0.934 ± 0.022
2.775MetLeu: 2.775 ± 0.046
0.691MetMet: 0.691 ± 0.024
0.718MetAsn: 0.718 ± 0.025
1.491MetPro: 1.491 ± 0.033
0.827MetGln: 0.827 ± 0.022
1.886MetArg: 1.886 ± 0.033
1.57MetSer: 1.57 ± 0.031
1.832MetThr: 1.832 ± 0.037
1.665MetVal: 1.665 ± 0.032
0.207MetTrp: 0.207 ± 0.012
0.237MetTyr: 0.237 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.266AsnAla: 3.266 ± 0.057
0.245AsnCys: 0.245 ± 0.012
1.457AsnAsp: 1.457 ± 0.037
1.162AsnGlu: 1.162 ± 0.03
0.892AsnPhe: 0.892 ± 0.025
2.56AsnGly: 2.56 ± 0.06
0.487AsnHis: 0.487 ± 0.016
1.329AsnIle: 1.329 ± 0.034
0.659AsnLys: 0.659 ± 0.022
2.477AsnLeu: 2.477 ± 0.044
0.578AsnMet: 0.578 ± 0.018
0.676AsnAsn: 0.676 ± 0.028
1.702AsnPro: 1.702 ± 0.034
0.781AsnGln: 0.781 ± 0.023
1.889AsnArg: 1.889 ± 0.034
1.316AsnSer: 1.316 ± 0.036
0.917AsnThr: 0.917 ± 0.025
1.886AsnVal: 1.886 ± 0.036
0.437AsnTrp: 0.437 ± 0.015
0.681AsnTyr: 0.681 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
6.649ProAla: 6.649 ± 0.072
0.348ProCys: 0.348 ± 0.014
3.71ProAsp: 3.71 ± 0.05
3.393ProGlu: 3.393 ± 0.053
1.882ProPhe: 1.882 ± 0.034
4.551ProGly: 4.551 ± 0.05
1.092ProHis: 1.092 ± 0.028
2.474ProIle: 2.474 ± 0.038
1.552ProLys: 1.552 ± 0.034
4.86ProLeu: 4.86 ± 0.055
1.259ProMet: 1.259 ± 0.03
1.282ProAsn: 1.282 ± 0.03
2.691ProPro: 2.691 ± 0.057
1.717ProGln: 1.717 ± 0.032
3.036ProArg: 3.036 ± 0.051
2.843ProSer: 2.843 ± 0.035
2.551ProThr: 2.551 ± 0.045
4.293ProVal: 4.293 ± 0.06
0.692ProTrp: 0.692 ± 0.024
1.017ProTyr: 1.017 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
4.394GlnAla: 4.394 ± 0.052
0.238GlnCys: 0.238 ± 0.012
1.562GlnAsp: 1.562 ± 0.031
1.508GlnGlu: 1.508 ± 0.03
1.078GlnPhe: 1.078 ± 0.025
2.702GlnGly: 2.702 ± 0.044
0.639GlnHis: 0.639 ± 0.018
1.915GlnIle: 1.915 ± 0.03
0.999GlnLys: 0.999 ± 0.028
3.181GlnLeu: 3.181 ± 0.045
0.912GlnMet: 0.912 ± 0.023
0.827GlnAsn: 0.827 ± 0.024
1.84GlnPro: 1.84 ± 0.033
1.24GlnGln: 1.24 ± 0.03
2.639GlnArg: 2.639 ± 0.043
1.974GlnSer: 1.974 ± 0.056
1.653GlnThr: 1.653 ± 0.028
2.206GlnVal: 2.206 ± 0.04
0.501GlnTrp: 0.501 ± 0.017
0.617GlnTyr: 0.617 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
8.604ArgAla: 8.604 ± 0.084
0.594ArgCys: 0.594 ± 0.019
4.369ArgAsp: 4.369 ± 0.061
4.129ArgGlu: 4.129 ± 0.061
3.076ArgPhe: 3.076 ± 0.041
4.974ArgGly: 4.974 ± 0.062
2.009ArgHis: 2.009 ± 0.035
4.394ArgIle: 4.394 ± 0.05
2.335ArgLys: 2.335 ± 0.042
8.5ArgLeu: 8.5 ± 0.088
2.01ArgMet: 2.01 ± 0.043
1.989ArgAsn: 1.989 ± 0.034
3.817ArgPro: 3.817 ± 0.053
2.815ArgGln: 2.815 ± 0.05
6.123ArgArg: 6.123 ± 0.077
4.103ArgSer: 4.103 ± 0.053
3.572ArgThr: 3.572 ± 0.051
4.55ArgVal: 4.55 ± 0.054
1.226ArgTrp: 1.226 ± 0.031
1.918ArgTyr: 1.918 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
6.593SerAla: 6.593 ± 0.087
0.49SerCys: 0.49 ± 0.019
3.276SerAsp: 3.276 ± 0.048
2.729SerGlu: 2.729 ± 0.045
2.281SerPhe: 2.281 ± 0.032
5.722SerGly: 5.722 ± 0.092
1.087SerHis: 1.087 ± 0.026
2.867SerIle: 2.867 ± 0.056
1.451SerLys: 1.451 ± 0.032
5.468SerLeu: 5.468 ± 0.058
1.313SerMet: 1.313 ± 0.029
1.455SerAsn: 1.455 ± 0.034
2.863SerPro: 2.863 ± 0.048
1.692SerGln: 1.692 ± 0.032
3.85SerArg: 3.85 ± 0.048
2.977SerSer: 2.977 ± 0.05
2.658SerThr: 2.658 ± 0.05
3.821SerVal: 3.821 ± 0.077
0.867SerTrp: 0.867 ± 0.021
1.447SerTyr: 1.447 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
6.193ThrAla: 6.193 ± 0.068
0.404ThrCys: 0.404 ± 0.015
2.812ThrAsp: 2.812 ± 0.051
2.25ThrGlu: 2.25 ± 0.04
1.69ThrPhe: 1.69 ± 0.037
5.128ThrGly: 5.128 ± 0.084
1.044ThrHis: 1.044 ± 0.025
2.978ThrIle: 2.978 ± 0.048
1.283ThrLys: 1.283 ± 0.03
5.815ThrLeu: 5.815 ± 0.069
1.152ThrMet: 1.152 ± 0.025
1.229ThrAsn: 1.229 ± 0.035
3.209ThrPro: 3.209 ± 0.044
1.486ThrGln: 1.486 ± 0.029
3.372ThrArg: 3.372 ± 0.045
2.642ThrSer: 2.642 ± 0.051
2.504ThrThr: 2.504 ± 0.052
4.058ThrVal: 4.058 ± 0.059
0.571ThrTrp: 0.571 ± 0.018
1.132ThrTyr: 1.132 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
9.018ValAla: 9.018 ± 0.075
0.522ValCys: 0.522 ± 0.019
4.257ValAsp: 4.257 ± 0.053
4.546ValGlu: 4.546 ± 0.054
2.018ValPhe: 2.018 ± 0.034
5.403ValGly: 5.403 ± 0.092
1.406ValHis: 1.406 ± 0.031
3.653ValIle: 3.653 ± 0.053
2.146ValLys: 2.146 ± 0.038
6.256ValLeu: 6.256 ± 0.056
1.651ValMet: 1.651 ± 0.032
1.925ValAsn: 1.925 ± 0.041
3.764ValPro: 3.764 ± 0.051
2.169ValGln: 2.169 ± 0.035
5.118ValArg: 5.118 ± 0.059
4.038ValSer: 4.038 ± 0.051
4.357ValThr: 4.357 ± 0.08
4.84ValVal: 4.84 ± 0.056
0.752ValTrp: 0.752 ± 0.021
1.356ValTyr: 1.356 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
1.382TrpAla: 1.382 ± 0.031
0.134TrpCys: 0.134 ± 0.008
0.71TrpAsp: 0.71 ± 0.023
0.632TrpGlu: 0.632 ± 0.017
0.516TrpPhe: 0.516 ± 0.019
0.932TrpGly: 0.932 ± 0.023
0.354TrpHis: 0.354 ± 0.013
0.712TrpIle: 0.712 ± 0.022
0.491TrpLys: 0.491 ± 0.016
1.752TrpLeu: 1.752 ± 0.04
0.367TrpMet: 0.367 ± 0.015
0.462TrpAsn: 0.462 ± 0.015
0.714TrpPro: 0.714 ± 0.024
0.567TrpGln: 0.567 ± 0.018
1.333TrpArg: 1.333 ± 0.03
0.905TrpSer: 0.905 ± 0.02
0.868TrpThr: 0.868 ± 0.024
0.805TrpVal: 0.805 ± 0.026
0.241TrpTrp: 0.241 ± 0.011
0.273TrpTyr: 0.273 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.691TyrAla: 2.691 ± 0.04
0.276TyrCys: 0.276 ± 0.011
1.544TyrAsp: 1.544 ± 0.033
1.141TyrGlu: 1.141 ± 0.028
0.79TyrPhe: 0.79 ± 0.023
2.095TyrGly: 2.095 ± 0.04
0.517TyrHis: 0.517 ± 0.014
0.858TyrIle: 0.858 ± 0.022
0.583TyrLys: 0.583 ± 0.02
2.147TyrLeu: 2.147 ± 0.037
0.458TyrMet: 0.458 ± 0.016
0.594TyrAsn: 0.594 ± 0.021
1.02TyrPro: 1.02 ± 0.024
0.72TyrGln: 0.72 ± 0.019
1.949TyrArg: 1.949 ± 0.04
1.245TyrSer: 1.245 ± 0.027
0.94TyrThr: 0.94 ± 0.03
1.569TyrVal: 1.569 ± 0.033
0.341TyrTrp: 0.341 ± 0.013
0.618TyrTyr: 0.618 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5795 proteins (1774848 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski