Amino acid dipepetide frequency for Tyzzerella sp. An114

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.055AlaAla: 4.055 ± 0.105
0.769AlaCys: 0.769 ± 0.031
3.48AlaAsp: 3.48 ± 0.068
4.566AlaGlu: 4.566 ± 0.087
2.761AlaPhe: 2.761 ± 0.053
4.493AlaGly: 4.493 ± 0.112
0.737AlaHis: 0.737 ± 0.029
5.144AlaIle: 5.144 ± 0.093
4.783AlaLys: 4.783 ± 0.08
5.514AlaLeu: 5.514 ± 0.088
1.8AlaMet: 1.8 ± 0.051
2.418AlaAsn: 2.418 ± 0.063
1.552AlaPro: 1.552 ± 0.049
1.551AlaGln: 1.551 ± 0.042
1.895AlaArg: 1.895 ± 0.062
3.251AlaSer: 3.251 ± 0.068
2.896AlaThr: 2.896 ± 0.069
5.677AlaVal: 5.677 ± 0.097
0.352AlaTrp: 0.352 ± 0.024
2.173AlaTyr: 2.173 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.789CysAla: 0.789 ± 0.036
0.262CysCys: 0.262 ± 0.019
0.957CysAsp: 0.957 ± 0.035
0.944CysGlu: 0.944 ± 0.037
0.584CysPhe: 0.584 ± 0.026
1.326CysGly: 1.326 ± 0.053
0.24CysHis: 0.24 ± 0.016
1.252CysIle: 1.252 ± 0.042
0.964CysLys: 0.964 ± 0.039
0.927CysLeu: 0.927 ± 0.034
0.318CysMet: 0.318 ± 0.02
0.732CysAsn: 0.732 ± 0.03
0.507CysPro: 0.507 ± 0.029
0.275CysGln: 0.275 ± 0.015
0.412CysArg: 0.412 ± 0.022
0.799CysSer: 0.799 ± 0.038
0.655CysThr: 0.655 ± 0.031
0.906CysVal: 0.906 ± 0.031
0.084CysTrp: 0.084 ± 0.012
0.478CysTyr: 0.478 ± 0.032
0.0CysXaa: 0.0 ± 0.0
Asp
2.994AspAla: 2.994 ± 0.059
0.822AspCys: 0.822 ± 0.029
3.634AspAsp: 3.634 ± 0.098
5.331AspGlu: 5.331 ± 0.095
3.286AspPhe: 3.286 ± 0.074
3.637AspGly: 3.637 ± 0.086
0.533AspHis: 0.533 ± 0.032
7.155AspIle: 7.155 ± 0.11
4.864AspLys: 4.864 ± 0.081
3.635AspLeu: 3.635 ± 0.075
1.958AspMet: 1.958 ± 0.055
3.393AspAsn: 3.393 ± 0.065
1.254AspPro: 1.254 ± 0.044
0.625AspGln: 0.625 ± 0.029
1.982AspArg: 1.982 ± 0.049
2.978AspSer: 2.978 ± 0.071
3.434AspThr: 3.434 ± 0.072
4.131AspVal: 4.131 ± 0.073
0.478AspTrp: 0.478 ± 0.028
2.937AspTyr: 2.937 ± 0.069
0.0AspXaa: 0.0 ± 0.0
Glu
4.424GluAla: 4.424 ± 0.092
0.827GluCys: 0.827 ± 0.035
3.914GluAsp: 3.914 ± 0.076
6.22GluGlu: 6.22 ± 0.112
3.111GluPhe: 3.111 ± 0.066
4.68GluGly: 4.68 ± 0.085
0.909GluHis: 0.909 ± 0.037
7.196GluIle: 7.196 ± 0.099
8.44GluLys: 8.44 ± 0.124
6.008GluLeu: 6.008 ± 0.088
2.161GluMet: 2.161 ± 0.053
6.699GluAsn: 6.699 ± 0.119
1.715GluPro: 1.715 ± 0.051
1.918GluGln: 1.918 ± 0.059
2.864GluArg: 2.864 ± 0.069
3.489GluSer: 3.489 ± 0.072
3.853GluThr: 3.853 ± 0.071
4.249GluVal: 4.249 ± 0.082
0.575GluTrp: 0.575 ± 0.03
3.552GluTyr: 3.552 ± 0.075
0.0GluXaa: 0.0 ± 0.0
Phe
2.8PheAla: 2.8 ± 0.058
0.736PheCys: 0.736 ± 0.029
3.225PheAsp: 3.225 ± 0.076
3.442PheGlu: 3.442 ± 0.073
2.101PhePhe: 2.101 ± 0.057
3.187PheGly: 3.187 ± 0.062
0.554PheHis: 0.554 ± 0.03
4.58PheIle: 4.58 ± 0.102
3.399PheLys: 3.399 ± 0.078
3.332PheLeu: 3.332 ± 0.078
1.455PheMet: 1.455 ± 0.041
2.537PheAsn: 2.537 ± 0.06
1.244PhePro: 1.244 ± 0.042
0.936PheGln: 0.936 ± 0.035
1.337PheArg: 1.337 ± 0.041
3.286PheSer: 3.286 ± 0.08
2.449PheThr: 2.449 ± 0.063
3.274PheVal: 3.274 ± 0.075
0.286PheTrp: 0.286 ± 0.021
1.862PheTyr: 1.862 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
4.275GlyAla: 4.275 ± 0.092
1.028GlyCys: 1.028 ± 0.045
3.537GlyAsp: 3.537 ± 0.076
4.668GlyGlu: 4.668 ± 0.085
3.227GlyPhe: 3.227 ± 0.072
4.801GlyGly: 4.801 ± 0.11
1.066GlyHis: 1.066 ± 0.037
6.93GlyIle: 6.93 ± 0.109
5.755GlyLys: 5.755 ± 0.093
5.051GlyLeu: 5.051 ± 0.087
2.01GlyMet: 2.01 ± 0.052
3.442GlyAsn: 3.442 ± 0.071
1.103GlyPro: 1.103 ± 0.041
1.544GlyGln: 1.544 ± 0.041
2.32GlyArg: 2.32 ± 0.053
3.903GlySer: 3.903 ± 0.077
4.034GlyThr: 4.034 ± 0.083
4.934GlyVal: 4.934 ± 0.086
0.544GlyTrp: 0.544 ± 0.027
3.054GlyTyr: 3.054 ± 0.064
0.0GlyXaa: 0.0 ± 0.0
His
0.652HisAla: 0.652 ± 0.029
0.252HisCys: 0.252 ± 0.018
0.698HisAsp: 0.698 ± 0.032
0.815HisGlu: 0.815 ± 0.033
0.662HisPhe: 0.662 ± 0.027
0.865HisGly: 0.865 ± 0.035
0.301HisHis: 0.301 ± 0.031
1.451HisIle: 1.451 ± 0.041
0.98HisLys: 0.98 ± 0.042
0.902HisLeu: 0.902 ± 0.031
0.373HisMet: 0.373 ± 0.016
0.763HisAsn: 0.763 ± 0.033
0.585HisPro: 0.585 ± 0.032
0.289HisGln: 0.289 ± 0.02
0.532HisArg: 0.532 ± 0.028
0.868HisSer: 0.868 ± 0.031
0.777HisThr: 0.777 ± 0.036
0.696HisVal: 0.696 ± 0.029
0.109HisTrp: 0.109 ± 0.012
0.601HisTyr: 0.601 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.729IleAla: 5.729 ± 0.106
1.439IleCys: 1.439 ± 0.043
6.007IleAsp: 6.007 ± 0.099
7.303IleGlu: 7.303 ± 0.099
4.477IlePhe: 4.477 ± 0.09
5.592IleGly: 5.592 ± 0.1
1.179IleHis: 1.179 ± 0.037
8.917IleIle: 8.917 ± 0.147
7.95IleLys: 7.95 ± 0.12
7.897IleLeu: 7.897 ± 0.12
2.605IleMet: 2.605 ± 0.065
5.574IleAsn: 5.574 ± 0.094
3.443IlePro: 3.443 ± 0.069
2.041IleGln: 2.041 ± 0.046
2.941IleArg: 2.941 ± 0.067
6.742IleSer: 6.742 ± 0.098
5.238IleThr: 5.238 ± 0.085
6.143IleVal: 6.143 ± 0.096
0.574IleTrp: 0.574 ± 0.031
3.679IleTyr: 3.679 ± 0.068
0.0IleXaa: 0.0 ± 0.0
Lys
4.884LysAla: 4.884 ± 0.098
0.886LysCys: 0.886 ± 0.038
5.051LysAsp: 5.051 ± 0.084
7.512LysGlu: 7.512 ± 0.105
3.043LysPhe: 3.043 ± 0.061
5.157LysGly: 5.157 ± 0.074
1.044LysHis: 1.044 ± 0.04
7.411LysIle: 7.411 ± 0.112
7.616LysLys: 7.616 ± 0.107
5.97LysLeu: 5.97 ± 0.089
2.47LysMet: 2.47 ± 0.062
6.483LysAsn: 6.483 ± 0.124
2.135LysPro: 2.135 ± 0.058
1.851LysGln: 1.851 ± 0.051
3.146LysArg: 3.146 ± 0.069
4.974LysSer: 4.974 ± 0.105
4.623LysThr: 4.623 ± 0.075
4.724LysVal: 4.724 ± 0.09
0.57LysTrp: 0.57 ± 0.025
4.12LysTyr: 4.12 ± 0.078
0.0LysXaa: 0.0 ± 0.0
Leu
4.593LeuAla: 4.593 ± 0.085
1.157LeuCys: 1.157 ± 0.04
4.342LeuAsp: 4.342 ± 0.083
5.375LeuGlu: 5.375 ± 0.086
3.586LeuPhe: 3.586 ± 0.076
5.272LeuGly: 5.272 ± 0.079
1.019LeuHis: 1.019 ± 0.037
6.312LeuIle: 6.312 ± 0.107
7.236LeuLys: 7.236 ± 0.111
6.071LeuLeu: 6.071 ± 0.108
2.306LeuMet: 2.306 ± 0.055
4.421LeuAsn: 4.421 ± 0.08
2.676LeuPro: 2.676 ± 0.064
1.824LeuGln: 1.824 ± 0.046
2.916LeuArg: 2.916 ± 0.064
6.057LeuSer: 6.057 ± 0.094
4.026LeuThr: 4.026 ± 0.078
4.398LeuVal: 4.398 ± 0.081
0.595LeuTrp: 0.595 ± 0.029
3.078LeuTyr: 3.078 ± 0.068
0.0LeuXaa: 0.0 ± 0.0
Met
2.158MetAla: 2.158 ± 0.051
0.358MetCys: 0.358 ± 0.019
1.59MetAsp: 1.59 ± 0.039
2.274MetGlu: 2.274 ± 0.062
1.122MetPhe: 1.122 ± 0.039
2.331MetGly: 2.331 ± 0.062
0.326MetHis: 0.326 ± 0.017
2.22MetIle: 2.22 ± 0.05
2.497MetLys: 2.497 ± 0.055
2.22MetLeu: 2.22 ± 0.057
0.69MetMet: 0.69 ± 0.034
1.722MetAsn: 1.722 ± 0.045
1.032MetPro: 1.032 ± 0.038
0.647MetGln: 0.647 ± 0.026
1.041MetArg: 1.041 ± 0.032
1.879MetSer: 1.879 ± 0.051
1.563MetThr: 1.563 ± 0.038
1.712MetVal: 1.712 ± 0.043
0.174MetTrp: 0.174 ± 0.015
1.069MetTyr: 1.069 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
3.038AsnAla: 3.038 ± 0.065
0.837AsnCys: 0.837 ± 0.036
3.301AsnAsp: 3.301 ± 0.072
4.173AsnGlu: 4.173 ± 0.082
2.639AsnPhe: 2.639 ± 0.063
4.039AsnGly: 4.039 ± 0.082
0.767AsnHis: 0.767 ± 0.031
7.388AsnIle: 7.388 ± 0.114
4.483AsnLys: 4.483 ± 0.101
4.186AsnLeu: 4.186 ± 0.07
1.824AsnMet: 1.824 ± 0.047
3.776AsnAsn: 3.776 ± 0.098
2.264AsnPro: 2.264 ± 0.057
1.18AsnGln: 1.18 ± 0.037
1.88AsnArg: 1.88 ± 0.047
3.965AsnSer: 3.965 ± 0.098
3.367AsnThr: 3.367 ± 0.073
3.852AsnVal: 3.852 ± 0.071
0.415AsnTrp: 0.415 ± 0.025
2.373AsnTyr: 2.373 ± 0.053
0.0AsnXaa: 0.0 ± 0.0
Pro
1.696ProAla: 1.696 ± 0.055
0.375ProCys: 0.375 ± 0.019
1.885ProAsp: 1.885 ± 0.06
2.863ProGlu: 2.863 ± 0.067
1.578ProPhe: 1.578 ± 0.049
1.565ProGly: 1.565 ± 0.045
0.45ProHis: 0.45 ± 0.025
2.369ProIle: 2.369 ± 0.066
2.157ProLys: 2.157 ± 0.052
2.33ProLeu: 2.33 ± 0.053
0.733ProMet: 0.733 ± 0.029
1.434ProAsn: 1.434 ± 0.05
0.721ProPro: 0.721 ± 0.033
0.816ProGln: 0.816 ± 0.032
0.774ProArg: 0.774 ± 0.036
1.936ProSer: 1.936 ± 0.053
1.378ProThr: 1.378 ± 0.053
2.712ProVal: 2.712 ± 0.06
0.21ProTrp: 0.21 ± 0.016
1.272ProTyr: 1.272 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
1.385GlnAla: 1.385 ± 0.034
0.258GlnCys: 0.258 ± 0.017
0.995GlnAsp: 0.995 ± 0.032
1.553GlnGlu: 1.553 ± 0.052
0.9GlnPhe: 0.9 ± 0.033
1.442GlnGly: 1.442 ± 0.046
0.324GlnHis: 0.324 ± 0.022
2.098GlnIle: 2.098 ± 0.047
2.051GlnLys: 2.051 ± 0.053
1.951GlnLeu: 1.951 ± 0.049
0.62GlnMet: 0.62 ± 0.027
1.483GlnAsn: 1.483 ± 0.042
0.595GlnPro: 0.595 ± 0.03
0.683GlnGln: 0.683 ± 0.036
0.979GlnArg: 0.979 ± 0.041
1.394GlnSer: 1.394 ± 0.044
1.184GlnThr: 1.184 ± 0.039
1.259GlnVal: 1.259 ± 0.04
0.195GlnTrp: 0.195 ± 0.016
1.031GlnTyr: 1.031 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
1.984ArgAla: 1.984 ± 0.045
0.422ArgCys: 0.422 ± 0.025
1.88ArgAsp: 1.88 ± 0.053
2.922ArgGlu: 2.922 ± 0.067
1.557ArgPhe: 1.557 ± 0.049
2.113ArgGly: 2.113 ± 0.056
0.493ArgHis: 0.493 ± 0.025
3.189ArgIle: 3.189 ± 0.067
3.032ArgLys: 3.032 ± 0.072
2.891ArgLeu: 2.891 ± 0.072
0.988ArgMet: 0.988 ± 0.036
2.103ArgAsn: 2.103 ± 0.048
1.003ArgPro: 1.003 ± 0.038
0.989ArgGln: 0.989 ± 0.039
1.456ArgArg: 1.456 ± 0.053
1.597ArgSer: 1.597 ± 0.047
1.703ArgThr: 1.703 ± 0.043
2.133ArgVal: 2.133 ± 0.059
0.235ArgTrp: 0.235 ± 0.018
1.482ArgTyr: 1.482 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
3.74SerAla: 3.74 ± 0.082
0.74SerCys: 0.74 ± 0.035
3.966SerAsp: 3.966 ± 0.083
4.673SerGlu: 4.673 ± 0.078
3.046SerPhe: 3.046 ± 0.067
4.745SerGly: 4.745 ± 0.079
0.885SerHis: 0.885 ± 0.034
5.792SerIle: 5.792 ± 0.104
4.842SerLys: 4.842 ± 0.084
5.059SerLeu: 5.059 ± 0.092
1.681SerMet: 1.681 ± 0.045
3.297SerAsn: 3.297 ± 0.077
1.731SerPro: 1.731 ± 0.047
1.639SerGln: 1.639 ± 0.049
2.12SerArg: 2.12 ± 0.051
4.067SerSer: 4.067 ± 0.093
3.005SerThr: 3.005 ± 0.072
4.606SerVal: 4.606 ± 0.089
0.414SerTrp: 0.414 ± 0.027
2.51SerTyr: 2.51 ± 0.059
0.0SerXaa: 0.0 ± 0.0
Thr
3.899ThrAla: 3.899 ± 0.078
0.453ThrCys: 0.453 ± 0.025
3.356ThrAsp: 3.356 ± 0.07
4.177ThrGlu: 4.177 ± 0.082
2.34ThrPhe: 2.34 ± 0.057
4.473ThrGly: 4.473 ± 0.089
0.713ThrHis: 0.713 ± 0.029
4.633ThrIle: 4.633 ± 0.076
3.603ThrLys: 3.603 ± 0.067
4.304ThrLeu: 4.304 ± 0.071
1.377ThrMet: 1.377 ± 0.04
2.572ThrAsn: 2.572 ± 0.071
2.03ThrPro: 2.03 ± 0.053
1.169ThrGln: 1.169 ± 0.037
1.613ThrArg: 1.613 ± 0.047
3.073ThrSer: 3.073 ± 0.072
2.941ThrThr: 2.941 ± 0.075
4.863ThrVal: 4.863 ± 0.082
0.344ThrTrp: 0.344 ± 0.023
1.963ThrTyr: 1.963 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
4.158ValAla: 4.158 ± 0.089
1.042ValCys: 1.042 ± 0.035
3.96ValAsp: 3.96 ± 0.073
4.484ValGlu: 4.484 ± 0.078
3.546ValPhe: 3.546 ± 0.076
4.083ValGly: 4.083 ± 0.083
0.897ValHis: 0.897 ± 0.032
6.447ValIle: 6.447 ± 0.099
5.201ValLys: 5.201 ± 0.084
5.68ValLeu: 5.68 ± 0.095
1.938ValMet: 1.938 ± 0.046
3.541ValAsn: 3.541 ± 0.072
2.333ValPro: 2.333 ± 0.053
1.373ValGln: 1.373 ± 0.039
2.148ValArg: 2.148 ± 0.056
5.079ValSer: 5.079 ± 0.083
3.952ValThr: 3.952 ± 0.097
4.882ValVal: 4.882 ± 0.091
0.446ValTrp: 0.446 ± 0.021
2.827ValTyr: 2.827 ± 0.065
0.0ValXaa: 0.0 ± 0.0
Trp
0.401TrpAla: 0.401 ± 0.026
0.087TrpCys: 0.087 ± 0.011
0.466TrpAsp: 0.466 ± 0.023
0.472TrpGlu: 0.472 ± 0.024
0.337TrpPhe: 0.337 ± 0.022
0.559TrpGly: 0.559 ± 0.027
0.132TrpHis: 0.132 ± 0.013
0.589TrpIle: 0.589 ± 0.031
0.529TrpLys: 0.529 ± 0.029
0.629TrpLeu: 0.629 ± 0.03
0.134TrpMet: 0.134 ± 0.013
0.502TrpAsn: 0.502 ± 0.027
0.116TrpPro: 0.116 ± 0.012
0.239TrpGln: 0.239 ± 0.018
0.244TrpArg: 0.244 ± 0.018
0.414TrpSer: 0.414 ± 0.024
0.339TrpThr: 0.339 ± 0.025
0.39TrpVal: 0.39 ± 0.021
0.076TrpTrp: 0.076 ± 0.011
0.333TrpTyr: 0.333 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.209TyrAla: 2.209 ± 0.059
0.658TyrCys: 0.658 ± 0.026
2.819TyrAsp: 2.819 ± 0.071
3.039TyrGlu: 3.039 ± 0.066
2.121TyrPhe: 2.121 ± 0.057
2.866TyrGly: 2.866 ± 0.06
0.605TyrHis: 0.605 ± 0.028
4.395TyrIle: 4.395 ± 0.093
3.285TyrLys: 3.285 ± 0.075
2.701TyrLeu: 2.701 ± 0.057
1.183TyrMet: 1.183 ± 0.045
2.854TyrAsn: 2.854 ± 0.064
1.216TyrPro: 1.216 ± 0.045
0.793TyrGln: 0.793 ± 0.033
1.575TyrArg: 1.575 ± 0.046
2.809TyrSer: 2.809 ± 0.058
2.477TyrThr: 2.477 ± 0.056
2.479TyrVal: 2.479 ± 0.057
0.309TyrTrp: 0.309 ± 0.02
2.01TyrTyr: 2.01 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2549 proteins (804866 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski