Amino acid dipepetide frequency for Catabacter hongkongensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.087AlaAla: 9.087 ± 0.127
1.345AlaCys: 1.345 ± 0.037
5.075AlaAsp: 5.075 ± 0.086
5.722AlaGlu: 5.722 ± 0.096
3.405AlaPhe: 3.405 ± 0.067
7.381AlaGly: 7.381 ± 0.097
1.331AlaHis: 1.331 ± 0.036
5.639AlaIle: 5.639 ± 0.092
5.534AlaLys: 5.534 ± 0.083
7.817AlaLeu: 7.817 ± 0.12
2.53AlaMet: 2.53 ± 0.049
2.945AlaAsn: 2.945 ± 0.067
2.665AlaPro: 2.665 ± 0.086
3.655AlaGln: 3.655 ± 0.075
3.714AlaArg: 3.714 ± 0.067
4.331AlaSer: 4.331 ± 0.077
3.714AlaThr: 3.714 ± 0.074
7.088AlaVal: 7.088 ± 0.104
0.628AlaTrp: 0.628 ± 0.029
2.711AlaTyr: 2.711 ± 0.064
0.002AlaXaa: 0.002 ± 0.001
Cys
1.416CysAla: 1.416 ± 0.046
0.286CysCys: 0.286 ± 0.018
0.848CysAsp: 0.848 ± 0.035
0.943CysGlu: 0.943 ± 0.035
0.629CysPhe: 0.629 ± 0.024
1.683CysGly: 1.683 ± 0.047
0.255CysHis: 0.255 ± 0.017
1.022CysIle: 1.022 ± 0.038
0.724CysLys: 0.724 ± 0.033
0.967CysLeu: 0.967 ± 0.032
0.449CysMet: 0.449 ± 0.021
0.486CysAsn: 0.486 ± 0.023
0.657CysPro: 0.657 ± 0.032
0.263CysGln: 0.263 ± 0.017
0.673CysArg: 0.673 ± 0.032
0.848CysSer: 0.848 ± 0.034
0.673CysThr: 0.673 ± 0.026
1.189CysVal: 1.189 ± 0.04
0.097CysTrp: 0.097 ± 0.009
0.445CysTyr: 0.445 ± 0.021
0.001CysXaa: 0.001 ± 0.001
Asp
5.127AspAla: 5.127 ± 0.068
0.767AspCys: 0.767 ± 0.03
2.938AspAsp: 2.938 ± 0.065
4.621AspGlu: 4.621 ± 0.083
2.754AspPhe: 2.754 ± 0.06
4.921AspGly: 4.921 ± 0.122
0.83AspHis: 0.83 ± 0.033
4.778AspIle: 4.778 ± 0.076
3.279AspLys: 3.279 ± 0.063
4.419AspLeu: 4.419 ± 0.063
2.011AspMet: 2.011 ± 0.05
2.034AspAsn: 2.034 ± 0.045
1.869AspPro: 1.869 ± 0.053
1.388AspGln: 1.388 ± 0.041
2.272AspArg: 2.272 ± 0.054
2.757AspSer: 2.757 ± 0.058
3.077AspThr: 3.077 ± 0.061
4.32AspVal: 4.32 ± 0.067
0.567AspTrp: 0.567 ± 0.027
2.431AspTyr: 2.431 ± 0.062
0.001AspXaa: 0.001 ± 0.001
Glu
5.645GluAla: 5.645 ± 0.099
0.82GluCys: 0.82 ± 0.029
3.518GluAsp: 3.518 ± 0.077
5.76GluGlu: 5.76 ± 0.088
2.526GluPhe: 2.526 ± 0.058
4.431GluGly: 4.431 ± 0.089
1.182GluHis: 1.182 ± 0.039
5.443GluIle: 5.443 ± 0.08
5.851GluLys: 5.851 ± 0.084
6.361GluLeu: 6.361 ± 0.085
2.415GluMet: 2.415 ± 0.05
3.711GluAsn: 3.711 ± 0.073
2.214GluPro: 2.214 ± 0.058
3.214GluGln: 3.214 ± 0.073
3.372GluArg: 3.372 ± 0.082
3.074GluSer: 3.074 ± 0.064
3.761GluThr: 3.761 ± 0.073
4.146GluVal: 4.146 ± 0.073
0.593GluTrp: 0.593 ± 0.027
2.98GluTyr: 2.98 ± 0.066
0.002GluXaa: 0.002 ± 0.001
Phe
3.562PheAla: 3.562 ± 0.067
0.715PheCys: 0.715 ± 0.032
2.755PheAsp: 2.755 ± 0.055
2.797PheGlu: 2.797 ± 0.072
1.892PhePhe: 1.892 ± 0.056
3.19PheGly: 3.19 ± 0.066
0.734PheHis: 0.734 ± 0.031
2.661PheIle: 2.661 ± 0.069
1.702PheLys: 1.702 ± 0.037
3.72PheLeu: 3.72 ± 0.077
1.157PheMet: 1.157 ± 0.042
1.391PheAsn: 1.391 ± 0.041
1.402PhePro: 1.402 ± 0.039
1.074PheGln: 1.074 ± 0.033
1.6PheArg: 1.6 ± 0.044
2.888PheSer: 2.888 ± 0.057
2.428PheThr: 2.428 ± 0.093
2.953PheVal: 2.953 ± 0.065
0.357PheTrp: 0.357 ± 0.022
1.486PheTyr: 1.486 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
6.446GlyAla: 6.446 ± 0.107
1.291GlyCys: 1.291 ± 0.042
3.899GlyAsp: 3.899 ± 0.083
5.168GlyGlu: 5.168 ± 0.085
3.238GlyPhe: 3.238 ± 0.061
6.186GlyGly: 6.186 ± 0.131
1.215GlyHis: 1.215 ± 0.037
6.579GlyIle: 6.579 ± 0.091
5.578GlyLys: 5.578 ± 0.073
6.279GlyLeu: 6.279 ± 0.084
2.706GlyMet: 2.706 ± 0.057
3.256GlyAsn: 3.256 ± 0.071
1.42GlyPro: 1.42 ± 0.044
2.319GlyGln: 2.319 ± 0.054
3.434GlyArg: 3.434 ± 0.06
4.143GlySer: 4.143 ± 0.091
4.67GlyThr: 4.67 ± 0.097
5.8GlyVal: 5.8 ± 0.101
0.641GlyTrp: 0.641 ± 0.026
3.124GlyTyr: 3.124 ± 0.062
0.0GlyXaa: 0.0 ± 0.0
His
1.264HisAla: 1.264 ± 0.041
0.288HisCys: 0.288 ± 0.019
0.877HisAsp: 0.877 ± 0.032
1.031HisGlu: 1.031 ± 0.032
0.751HisPhe: 0.751 ± 0.03
1.283HisGly: 1.283 ± 0.041
0.322HisHis: 0.322 ± 0.019
1.292HisIle: 1.292 ± 0.044
0.969HisLys: 0.969 ± 0.034
1.324HisLeu: 1.324 ± 0.038
0.533HisMet: 0.533 ± 0.023
0.632HisAsn: 0.632 ± 0.025
0.821HisPro: 0.821 ± 0.033
0.422HisGln: 0.422 ± 0.02
0.716HisArg: 0.716 ± 0.029
0.927HisSer: 0.927 ± 0.032
0.963HisThr: 0.963 ± 0.03
0.971HisVal: 0.971 ± 0.031
0.139HisTrp: 0.139 ± 0.011
0.622HisTyr: 0.622 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.713IleAla: 6.713 ± 0.097
1.135IleCys: 1.135 ± 0.041
4.606IleAsp: 4.606 ± 0.082
4.633IleGlu: 4.633 ± 0.082
2.816IlePhe: 2.816 ± 0.063
5.428IleGly: 5.428 ± 0.083
1.08IleHis: 1.08 ± 0.036
5.068IleIle: 5.068 ± 0.107
4.062IleLys: 4.062 ± 0.069
6.496IleLeu: 6.496 ± 0.107
2.12IleMet: 2.12 ± 0.047
2.795IleAsn: 2.795 ± 0.056
3.097IlePro: 3.097 ± 0.058
1.74IleGln: 1.74 ± 0.051
3.391IleArg: 3.391 ± 0.066
4.76IleSer: 4.76 ± 0.079
4.288IleThr: 4.288 ± 0.071
5.272IleVal: 5.272 ± 0.077
0.533IleTrp: 0.533 ± 0.024
2.332IleTyr: 2.332 ± 0.058
0.0IleXaa: 0.0 ± 0.0
Lys
5.138LysAla: 5.138 ± 0.073
0.714LysCys: 0.714 ± 0.03
3.649LysAsp: 3.649 ± 0.068
5.371LysGlu: 5.371 ± 0.082
1.721LysPhe: 1.721 ± 0.043
4.19LysGly: 4.19 ± 0.067
1.01LysHis: 1.01 ± 0.033
4.564LysIle: 4.564 ± 0.084
5.894LysLys: 5.894 ± 0.084
5.19LysLeu: 5.19 ± 0.08
2.263LysMet: 2.263 ± 0.048
3.454LysAsn: 3.454 ± 0.064
2.175LysPro: 2.175 ± 0.056
2.539LysGln: 2.539 ± 0.058
3.3LysArg: 3.3 ± 0.07
3.151LysSer: 3.151 ± 0.058
3.824LysThr: 3.824 ± 0.061
3.691LysVal: 3.691 ± 0.073
0.552LysTrp: 0.552 ± 0.019
2.363LysTyr: 2.363 ± 0.055
0.003LysXaa: 0.003 ± 0.002
Leu
7.365LeuAla: 7.365 ± 0.1
1.571LeuCys: 1.571 ± 0.043
4.783LeuAsp: 4.783 ± 0.076
5.637LeuGlu: 5.637 ± 0.088
3.686LeuPhe: 3.686 ± 0.075
6.119LeuGly: 6.119 ± 0.102
1.488LeuHis: 1.488 ± 0.045
5.716LeuIle: 5.716 ± 0.098
5.736LeuLys: 5.736 ± 0.077
7.98LeuLeu: 7.98 ± 0.143
2.686LeuMet: 2.686 ± 0.065
3.446LeuAsn: 3.446 ± 0.076
3.546LeuPro: 3.546 ± 0.065
2.738LeuGln: 2.738 ± 0.061
4.16LeuArg: 4.16 ± 0.081
6.005LeuSer: 6.005 ± 0.097
4.81LeuThr: 4.81 ± 0.074
5.604LeuVal: 5.604 ± 0.085
0.682LeuTrp: 0.682 ± 0.028
3.018LeuTyr: 3.018 ± 0.07
0.0LeuXaa: 0.0 ± 0.0
Met
2.551MetAla: 2.551 ± 0.058
0.425MetCys: 0.425 ± 0.023
2.036MetAsp: 2.036 ± 0.05
2.337MetGlu: 2.337 ± 0.054
0.964MetPhe: 0.964 ± 0.033
2.36MetGly: 2.36 ± 0.049
0.448MetHis: 0.448 ± 0.022
2.174MetIle: 2.174 ± 0.053
2.389MetLys: 2.389 ± 0.051
2.913MetLeu: 2.913 ± 0.055
1.025MetMet: 1.025 ± 0.04
1.45MetAsn: 1.45 ± 0.044
1.335MetPro: 1.335 ± 0.042
1.298MetGln: 1.298 ± 0.033
1.511MetArg: 1.511 ± 0.042
1.747MetSer: 1.747 ± 0.051
1.739MetThr: 1.739 ± 0.045
1.959MetVal: 1.959 ± 0.054
0.204MetTrp: 0.204 ± 0.015
0.801MetTyr: 0.801 ± 0.032
0.001MetXaa: 0.001 ± 0.001
Asn
3.483AsnAla: 3.483 ± 0.063
0.521AsnCys: 0.521 ± 0.025
2.103AsnAsp: 2.103 ± 0.049
2.692AsnGlu: 2.692 ± 0.055
1.467AsnPhe: 1.467 ± 0.039
3.679AsnGly: 3.679 ± 0.094
0.638AsnHis: 0.638 ± 0.027
3.323AsnIle: 3.323 ± 0.059
2.241AsnLys: 2.241 ± 0.046
3.392AsnLeu: 3.392 ± 0.064
1.334AsnMet: 1.334 ± 0.038
1.712AsnAsn: 1.712 ± 0.055
1.989AsnPro: 1.989 ± 0.044
1.253AsnGln: 1.253 ± 0.042
1.777AsnArg: 1.777 ± 0.047
2.098AsnSer: 2.098 ± 0.061
2.29AsnThr: 2.29 ± 0.058
2.949AsnVal: 2.949 ± 0.061
0.339AsnTrp: 0.339 ± 0.02
1.485AsnTyr: 1.485 ± 0.052
0.0AsnXaa: 0.0 ± 0.0
Pro
3.184ProAla: 3.184 ± 0.076
0.476ProCys: 0.476 ± 0.023
2.62ProAsp: 2.62 ± 0.063
3.29ProGlu: 3.29 ± 0.072
1.568ProPhe: 1.568 ± 0.046
2.56ProGly: 2.56 ± 0.061
0.657ProHis: 0.657 ± 0.028
2.042ProIle: 2.042 ± 0.047
1.908ProLys: 1.908 ± 0.043
2.882ProLeu: 2.882 ± 0.059
0.957ProMet: 0.957 ± 0.029
1.187ProAsn: 1.187 ± 0.04
1.049ProPro: 1.049 ± 0.038
1.629ProGln: 1.629 ± 0.039
1.124ProArg: 1.124 ± 0.033
1.91ProSer: 1.91 ± 0.047
1.857ProThr: 1.857 ± 0.054
3.316ProVal: 3.316 ± 0.071
0.24ProTrp: 0.24 ± 0.017
1.411ProTyr: 1.411 ± 0.041
0.001ProXaa: 0.001 ± 0.001
Gln
3.043GlnAla: 3.043 ± 0.072
0.318GlnCys: 0.318 ± 0.019
1.825GlnAsp: 1.825 ± 0.042
2.776GlnGlu: 2.776 ± 0.072
1.147GlnPhe: 1.147 ± 0.03
2.294GlnGly: 2.294 ± 0.057
0.425GlnHis: 0.425 ± 0.022
2.482GlnIle: 2.482 ± 0.05
2.869GlnLys: 2.869 ± 0.06
2.679GlnLeu: 2.679 ± 0.057
1.185GlnMet: 1.185 ± 0.034
1.738GlnAsn: 1.738 ± 0.05
1.26GlnPro: 1.26 ± 0.038
1.62GlnGln: 1.62 ± 0.057
1.521GlnArg: 1.521 ± 0.046
1.751GlnSer: 1.751 ± 0.046
1.804GlnThr: 1.804 ± 0.05
2.044GlnVal: 2.044 ± 0.044
0.303GlnTrp: 0.303 ± 0.02
1.312GlnTyr: 1.312 ± 0.042
0.001GlnXaa: 0.001 ± 0.001
Arg
3.281ArgAla: 3.281 ± 0.059
0.548ArgCys: 0.548 ± 0.029
2.402ArgAsp: 2.402 ± 0.053
3.995ArgGlu: 3.995 ± 0.071
1.95ArgPhe: 1.95 ± 0.047
2.837ArgGly: 2.837 ± 0.054
0.776ArgHis: 0.776 ± 0.034
3.481ArgIle: 3.481 ± 0.073
3.129ArgLys: 3.129 ± 0.063
4.054ArgLeu: 4.054 ± 0.075
1.565ArgMet: 1.565 ± 0.047
1.854ArgAsn: 1.854 ± 0.042
1.376ArgPro: 1.376 ± 0.04
1.759ArgGln: 1.759 ± 0.049
2.372ArgArg: 2.372 ± 0.067
2.126ArgSer: 2.126 ± 0.054
2.194ArgThr: 2.194 ± 0.048
2.872ArgVal: 2.872 ± 0.055
0.363ArgTrp: 0.363 ± 0.019
1.752ArgTyr: 1.752 ± 0.044
0.001ArgXaa: 0.001 ± 0.001
Ser
4.933SerAla: 4.933 ± 0.081
0.76SerCys: 0.76 ± 0.027
3.005SerAsp: 3.005 ± 0.057
3.334SerGlu: 3.334 ± 0.066
2.541SerPhe: 2.541 ± 0.054
5.549SerGly: 5.549 ± 0.086
0.944SerHis: 0.944 ± 0.036
3.951SerIle: 3.951 ± 0.063
2.994SerLys: 2.994 ± 0.067
4.676SerLeu: 4.676 ± 0.073
1.718SerMet: 1.718 ± 0.049
2.0SerAsn: 2.0 ± 0.047
1.987SerPro: 1.987 ± 0.05
1.796SerGln: 1.796 ± 0.046
2.452SerArg: 2.452 ± 0.058
3.26SerSer: 3.26 ± 0.08
2.736SerThr: 2.736 ± 0.067
4.435SerVal: 4.435 ± 0.072
0.428SerTrp: 0.428 ± 0.022
2.021SerTyr: 2.021 ± 0.051
0.001SerXaa: 0.001 ± 0.001
Thr
4.865ThrAla: 4.865 ± 0.113
0.598ThrCys: 0.598 ± 0.027
3.304ThrAsp: 3.304 ± 0.081
3.383ThrGlu: 3.383 ± 0.073
2.173ThrPhe: 2.173 ± 0.053
5.103ThrGly: 5.103 ± 0.095
0.897ThrHis: 0.897 ± 0.036
3.773ThrIle: 3.773 ± 0.066
3.073ThrLys: 3.073 ± 0.063
5.02ThrLeu: 5.02 ± 0.082
1.482ThrMet: 1.482 ± 0.044
1.973ThrAsn: 1.973 ± 0.064
2.612ThrPro: 2.612 ± 0.061
2.001ThrGln: 2.001 ± 0.05
1.996ThrArg: 1.996 ± 0.053
2.704ThrSer: 2.704 ± 0.07
2.666ThrThr: 2.666 ± 0.073
4.561ThrVal: 4.561 ± 0.123
0.408ThrTrp: 0.408 ± 0.024
1.691ThrTyr: 1.691 ± 0.056
0.0ThrXaa: 0.0 ± 0.0
Val
5.699ValAla: 5.699 ± 0.079
1.272ValCys: 1.272 ± 0.046
3.916ValAsp: 3.916 ± 0.07
4.455ValGlu: 4.455 ± 0.069
3.2ValPhe: 3.2 ± 0.07
4.74ValGly: 4.74 ± 0.078
1.068ValHis: 1.068 ± 0.033
5.379ValIle: 5.379 ± 0.092
4.406ValLys: 4.406 ± 0.067
6.669ValLeu: 6.669 ± 0.106
2.253ValMet: 2.253 ± 0.043
2.911ValAsn: 2.911 ± 0.065
2.783ValPro: 2.783 ± 0.059
2.147ValGln: 2.147 ± 0.052
3.163ValArg: 3.163 ± 0.06
4.566ValSer: 4.566 ± 0.074
4.286ValThr: 4.286 ± 0.138
5.473ValVal: 5.473 ± 0.092
0.628ValTrp: 0.628 ± 0.028
2.469ValTyr: 2.469 ± 0.053
0.001ValXaa: 0.001 ± 0.001
Trp
0.563TrpAla: 0.563 ± 0.023
0.122TrpCys: 0.122 ± 0.012
0.532TrpAsp: 0.532 ± 0.023
0.562TrpGlu: 0.562 ± 0.029
0.385TrpPhe: 0.385 ± 0.019
0.587TrpGly: 0.587 ± 0.026
0.151TrpHis: 0.151 ± 0.011
0.583TrpIle: 0.583 ± 0.028
0.591TrpLys: 0.591 ± 0.023
0.832TrpLeu: 0.832 ± 0.026
0.269TrpMet: 0.269 ± 0.016
0.418TrpAsn: 0.418 ± 0.024
0.189TrpPro: 0.189 ± 0.016
0.351TrpGln: 0.351 ± 0.018
0.325TrpArg: 0.325 ± 0.02
0.408TrpSer: 0.408 ± 0.02
0.4TrpThr: 0.4 ± 0.024
0.455TrpVal: 0.455 ± 0.024
0.091TrpTrp: 0.091 ± 0.01
0.298TrpTyr: 0.298 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.954TyrAla: 2.954 ± 0.068
0.58TyrCys: 0.58 ± 0.023
2.47TyrAsp: 2.47 ± 0.052
2.512TyrGlu: 2.512 ± 0.068
1.652TyrPhe: 1.652 ± 0.043
2.87TyrGly: 2.87 ± 0.069
0.696TyrHis: 0.696 ± 0.028
2.416TyrIle: 2.416 ± 0.053
1.824TyrLys: 1.824 ± 0.054
3.103TyrLeu: 3.103 ± 0.07
1.008TyrMet: 1.008 ± 0.033
1.373TyrAsn: 1.373 ± 0.043
1.423TyrPro: 1.423 ± 0.043
1.192TyrGln: 1.192 ± 0.039
1.812TyrArg: 1.812 ± 0.052
2.064TyrSer: 2.064 ± 0.047
2.102TyrThr: 2.102 ± 0.073
2.386TyrVal: 2.386 ± 0.056
0.319TyrTrp: 0.319 ± 0.019
1.485TyrTyr: 1.485 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.002
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.001XaaPhe: 0.001 ± 0.001
0.002XaaGly: 0.002 ± 0.002
0.001XaaHis: 0.001 ± 0.001
0.0XaaIle: 0.0 ± 0.0
0.004XaaLys: 0.004 ± 0.002
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.002XaaTyr: 0.002 ± 0.001
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 3148 proteins (940206 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski