Amino acid dipepetide frequency for Idiomarina xiamenensis 10-D-4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.606AlaAla: 11.606 ± 0.16
1.084AlaCys: 1.084 ± 0.037
6.686AlaAsp: 6.686 ± 0.092
6.848AlaGlu: 6.848 ± 0.106
3.322AlaPhe: 3.322 ± 0.066
7.123AlaGly: 7.123 ± 0.106
1.806AlaHis: 1.806 ± 0.052
6.116AlaIle: 6.116 ± 0.087
4.005AlaLys: 4.005 ± 0.077
11.68AlaLeu: 11.68 ± 0.122
2.754AlaMet: 2.754 ± 0.061
3.753AlaAsn: 3.753 ± 0.076
3.334AlaPro: 3.334 ± 0.062
5.828AlaGln: 5.828 ± 0.09
5.204AlaArg: 5.204 ± 0.084
6.096AlaSer: 6.096 ± 0.105
4.766AlaThr: 4.766 ± 0.074
7.066AlaVal: 7.066 ± 0.095
1.232AlaTrp: 1.232 ± 0.041
2.723AlaTyr: 2.723 ± 0.06
0.0AlaXaa: 0.0 ± 0.0
Cys
0.862CysAla: 0.862 ± 0.033
0.192CysCys: 0.192 ± 0.014
0.537CysAsp: 0.537 ± 0.026
0.51CysGlu: 0.51 ± 0.025
0.384CysPhe: 0.384 ± 0.022
0.774CysGly: 0.774 ± 0.032
0.314CysHis: 0.314 ± 0.02
0.41CysIle: 0.41 ± 0.021
0.254CysLys: 0.254 ± 0.016
0.955CysLeu: 0.955 ± 0.034
0.183CysMet: 0.183 ± 0.013
0.243CysAsn: 0.243 ± 0.019
0.402CysPro: 0.402 ± 0.022
0.689CysGln: 0.689 ± 0.032
0.576CysArg: 0.576 ± 0.027
0.605CysSer: 0.605 ± 0.028
0.317CysThr: 0.317 ± 0.021
0.551CysVal: 0.551 ± 0.029
0.155CysTrp: 0.155 ± 0.013
0.329CysTyr: 0.329 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
5.671AspAla: 5.671 ± 0.09
0.558AspCys: 0.558 ± 0.026
4.186AspAsp: 4.186 ± 0.084
3.852AspGlu: 3.852 ± 0.077
2.372AspPhe: 2.372 ± 0.057
4.404AspGly: 4.404 ± 0.084
1.082AspHis: 1.082 ± 0.037
3.893AspIle: 3.893 ± 0.065
2.711AspLys: 2.711 ± 0.056
4.841AspLeu: 4.841 ± 0.079
1.522AspMet: 1.522 ± 0.043
2.76AspAsn: 2.76 ± 0.056
1.994AspPro: 1.994 ± 0.056
2.276AspGln: 2.276 ± 0.047
2.497AspArg: 2.497 ± 0.049
3.748AspSer: 3.748 ± 0.078
2.52AspThr: 2.52 ± 0.051
4.148AspVal: 4.148 ± 0.075
1.101AspTrp: 1.101 ± 0.038
2.388AspTyr: 2.388 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
5.249GluAla: 5.249 ± 0.084
0.319GluCys: 0.319 ± 0.021
2.436GluAsp: 2.436 ± 0.063
2.584GluGlu: 2.584 ± 0.075
2.065GluPhe: 2.065 ± 0.055
2.873GluGly: 2.873 ± 0.054
1.605GluHis: 1.605 ± 0.044
2.711GluIle: 2.711 ± 0.066
2.409GluLys: 2.409 ± 0.064
6.612GluLeu: 6.612 ± 0.099
1.239GluMet: 1.239 ± 0.038
1.887GluAsn: 1.887 ± 0.052
2.089GluPro: 2.089 ± 0.048
5.698GluGln: 5.698 ± 0.102
4.226GluArg: 4.226 ± 0.08
2.758GluSer: 2.758 ± 0.053
2.466GluThr: 2.466 ± 0.052
3.86GluVal: 3.86 ± 0.063
0.605GluTrp: 0.605 ± 0.025
1.465GluTyr: 1.465 ± 0.046
0.0GluXaa: 0.0 ± 0.0
Phe
3.931PheAla: 3.931 ± 0.08
0.412PheCys: 0.412 ± 0.022
2.549PheAsp: 2.549 ± 0.059
2.012PheGlu: 2.012 ± 0.053
1.536PhePhe: 1.536 ± 0.041
2.685PheGly: 2.685 ± 0.065
0.741PheHis: 0.741 ± 0.03
2.329PheIle: 2.329 ± 0.056
1.386PheLys: 1.386 ± 0.045
2.854PheLeu: 2.854 ± 0.072
0.864PheMet: 0.864 ± 0.035
1.634PheAsn: 1.634 ± 0.044
1.213PhePro: 1.213 ± 0.035
1.346PheGln: 1.346 ± 0.043
1.775PheArg: 1.775 ± 0.042
3.038PheSer: 3.038 ± 0.07
1.895PheThr: 1.895 ± 0.042
2.498PheVal: 2.498 ± 0.06
0.52PheTrp: 0.52 ± 0.023
1.309PheTyr: 1.309 ± 0.042
0.0PheXaa: 0.0 ± 0.0
Gly
5.728GlyAla: 5.728 ± 0.093
0.748GlyCys: 0.748 ± 0.035
4.095GlyAsp: 4.095 ± 0.075
4.285GlyGlu: 4.285 ± 0.077
2.97GlyPhe: 2.97 ± 0.069
4.834GlyGly: 4.834 ± 0.093
1.616GlyHis: 1.616 ± 0.051
4.005GlyIle: 4.005 ± 0.087
3.009GlyLys: 3.009 ± 0.072
6.954GlyLeu: 6.954 ± 0.108
1.916GlyMet: 1.916 ± 0.052
2.362GlyAsn: 2.362 ± 0.061
1.694GlyPro: 1.694 ± 0.048
3.627GlyGln: 3.627 ± 0.07
3.796GlyArg: 3.796 ± 0.07
4.071GlySer: 4.071 ± 0.084
2.86GlyThr: 2.86 ± 0.07
5.174GlyVal: 5.174 ± 0.075
1.037GlyTrp: 1.037 ± 0.033
2.584GlyTyr: 2.584 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.9HisAla: 1.9 ± 0.048
0.347HisCys: 0.347 ± 0.019
1.287HisAsp: 1.287 ± 0.042
1.14HisGlu: 1.14 ± 0.043
0.98HisPhe: 0.98 ± 0.039
1.739HisGly: 1.739 ± 0.041
0.671HisHis: 0.671 ± 0.028
1.263HisIle: 1.263 ± 0.041
0.717HisLys: 0.717 ± 0.027
2.163HisLeu: 2.163 ± 0.051
0.466HisMet: 0.466 ± 0.023
0.709HisAsn: 0.709 ± 0.028
1.165HisPro: 1.165 ± 0.035
1.502HisGln: 1.502 ± 0.049
1.263HisArg: 1.263 ± 0.045
1.413HisSer: 1.413 ± 0.04
0.893HisThr: 0.893 ± 0.031
1.308HisVal: 1.308 ± 0.037
0.471HisTrp: 0.471 ± 0.022
1.02HisTyr: 1.02 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
6.391IleAla: 6.391 ± 0.092
0.55IleCys: 0.55 ± 0.025
4.268IleAsp: 4.268 ± 0.067
3.77IleGlu: 3.77 ± 0.069
1.745IlePhe: 1.745 ± 0.048
4.215IleGly: 4.215 ± 0.081
1.039IleHis: 1.039 ± 0.035
2.848IleIle: 2.848 ± 0.061
2.268IleLys: 2.268 ± 0.047
4.207IleLeu: 4.207 ± 0.08
1.098IleMet: 1.098 ± 0.038
2.452IleAsn: 2.452 ± 0.056
2.121IlePro: 2.121 ± 0.049
2.014IleGln: 2.014 ± 0.049
2.816IleArg: 2.816 ± 0.059
3.602IleSer: 3.602 ± 0.068
2.785IleThr: 2.785 ± 0.066
3.553IleVal: 3.553 ± 0.079
0.56IleTrp: 0.56 ± 0.027
1.522IleTyr: 1.522 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
4.065LysAla: 4.065 ± 0.083
0.183LysCys: 0.183 ± 0.015
1.734LysAsp: 1.734 ± 0.051
1.714LysGlu: 1.714 ± 0.049
1.052LysPhe: 1.052 ± 0.035
2.219LysGly: 2.219 ± 0.056
1.0LysHis: 1.0 ± 0.033
1.695LysIle: 1.695 ± 0.046
1.782LysLys: 1.782 ± 0.051
3.923LysLeu: 3.923 ± 0.077
0.83LysMet: 0.83 ± 0.033
1.263LysAsn: 1.263 ± 0.043
2.052LysPro: 2.052 ± 0.048
2.771LysGln: 2.771 ± 0.063
2.834LysArg: 2.834 ± 0.064
2.138LysSer: 2.138 ± 0.058
1.992LysThr: 1.992 ± 0.048
2.693LysVal: 2.693 ± 0.054
0.397LysTrp: 0.397 ± 0.02
0.884LysTyr: 0.884 ± 0.038
0.0LysXaa: 0.0 ± 0.0
Leu
12.829LeuAla: 12.829 ± 0.155
0.992LeuCys: 0.992 ± 0.036
5.629LeuAsp: 5.629 ± 0.08
4.813LeuGlu: 4.813 ± 0.09
3.771LeuPhe: 3.771 ± 0.077
6.709LeuGly: 6.709 ± 0.112
2.214LeuHis: 2.214 ± 0.059
5.238LeuIle: 5.238 ± 0.082
4.051LeuLys: 4.051 ± 0.069
13.312LeuLeu: 13.312 ± 0.198
2.553LeuMet: 2.553 ± 0.052
4.019LeuAsn: 4.019 ± 0.068
5.277LeuPro: 5.277 ± 0.072
6.736LeuGln: 6.736 ± 0.124
6.092LeuArg: 6.092 ± 0.094
8.068LeuSer: 8.068 ± 0.099
6.59LeuThr: 6.59 ± 0.097
7.024LeuVal: 7.024 ± 0.114
1.308LeuTrp: 1.308 ± 0.041
2.671LeuTyr: 2.671 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
2.694MetAla: 2.694 ± 0.059
0.135MetCys: 0.135 ± 0.012
1.019MetAsp: 1.019 ± 0.032
0.907MetGlu: 0.907 ± 0.038
0.666MetPhe: 0.666 ± 0.028
1.487MetGly: 1.487 ± 0.039
0.472MetHis: 0.472 ± 0.022
1.07MetIle: 1.07 ± 0.038
1.042MetLys: 1.042 ± 0.034
2.656MetLeu: 2.656 ± 0.059
0.654MetMet: 0.654 ± 0.03
0.885MetAsn: 0.885 ± 0.03
1.221MetPro: 1.221 ± 0.039
1.472MetGln: 1.472 ± 0.039
1.402MetArg: 1.402 ± 0.043
1.86MetSer: 1.86 ± 0.049
1.506MetThr: 1.506 ± 0.041
1.685MetVal: 1.685 ± 0.049
0.204MetTrp: 0.204 ± 0.017
0.479MetTyr: 0.479 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
3.524AsnAla: 3.524 ± 0.073
0.323AsnCys: 0.323 ± 0.022
2.496AsnAsp: 2.496 ± 0.057
1.917AsnGlu: 1.917 ± 0.046
1.288AsnPhe: 1.288 ± 0.041
2.859AsnGly: 2.859 ± 0.077
0.781AsnHis: 0.781 ± 0.024
1.992AsnIle: 1.992 ± 0.053
1.317AsnLys: 1.317 ± 0.046
3.379AsnLeu: 3.379 ± 0.062
0.796AsnMet: 0.796 ± 0.028
1.596AsnAsn: 1.596 ± 0.05
1.721AsnPro: 1.721 ± 0.046
1.921AsnGln: 1.921 ± 0.053
2.035AsnArg: 2.035 ± 0.049
2.263AsnSer: 2.263 ± 0.057
1.66AsnThr: 1.66 ± 0.039
2.312AsnVal: 2.312 ± 0.056
0.57AsnTrp: 0.57 ± 0.027
1.188AsnTyr: 1.188 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
4.096ProAla: 4.096 ± 0.078
0.28ProCys: 0.28 ± 0.015
2.475ProAsp: 2.475 ± 0.053
2.775ProGlu: 2.775 ± 0.061
1.53ProPhe: 1.53 ± 0.042
2.524ProGly: 2.524 ± 0.059
0.794ProHis: 0.794 ± 0.031
2.105ProIle: 2.105 ± 0.056
1.439ProLys: 1.439 ± 0.045
4.856ProLeu: 4.856 ± 0.079
0.95ProMet: 0.95 ± 0.032
1.425ProAsn: 1.425 ± 0.041
1.333ProPro: 1.333 ± 0.042
2.548ProGln: 2.548 ± 0.056
1.821ProArg: 1.821 ± 0.049
2.467ProSer: 2.467 ± 0.06
1.998ProThr: 1.998 ± 0.047
2.914ProVal: 2.914 ± 0.059
0.567ProTrp: 0.567 ± 0.026
1.228ProTyr: 1.228 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
7.077GlnAla: 7.077 ± 0.121
0.421GlnCys: 0.421 ± 0.024
2.22GlnAsp: 2.22 ± 0.053
2.155GlnGlu: 2.155 ± 0.051
2.016GlnPhe: 2.016 ± 0.041
3.725GlnGly: 3.725 ± 0.071
1.941GlnHis: 1.941 ± 0.057
2.433GlnIle: 2.433 ± 0.043
1.465GlnLys: 1.465 ± 0.047
8.683GlnLeu: 8.683 ± 0.149
1.144GlnMet: 1.144 ± 0.036
1.3GlnAsn: 1.3 ± 0.04
3.062GlnPro: 3.062 ± 0.071
8.983GlnGln: 8.983 ± 0.218
5.391GlnArg: 5.391 ± 0.104
3.163GlnSer: 3.163 ± 0.061
2.631GlnThr: 2.631 ± 0.058
4.276GlnVal: 4.276 ± 0.085
1.054GlnTrp: 1.054 ± 0.043
1.558GlnTyr: 1.558 ± 0.045
0.0GlnXaa: 0.0 ± 0.0
Arg
4.815ArgAla: 4.815 ± 0.08
0.577ArgCys: 0.577 ± 0.024
3.329ArgAsp: 3.329 ± 0.069
3.599ArgGlu: 3.599 ± 0.073
2.583ArgPhe: 2.583 ± 0.059
3.323ArgGly: 3.323 ± 0.06
1.466ArgHis: 1.466 ± 0.045
3.46ArgIle: 3.46 ± 0.055
1.937ArgLys: 1.937 ± 0.048
7.117ArgLeu: 7.117 ± 0.116
1.391ArgMet: 1.391 ± 0.036
1.813ArgAsn: 1.813 ± 0.037
2.022ArgPro: 2.022 ± 0.048
4.43ArgGln: 4.43 ± 0.091
3.67ArgArg: 3.67 ± 0.073
2.958ArgSer: 2.958 ± 0.068
2.139ArgThr: 2.139 ± 0.049
4.065ArgVal: 4.065 ± 0.065
1.152ArgTrp: 1.152 ± 0.036
2.336ArgTyr: 2.336 ± 0.066
0.0ArgXaa: 0.0 ± 0.0
Ser
6.576SerAla: 6.576 ± 0.095
0.597SerCys: 0.597 ± 0.029
4.034SerAsp: 4.034 ± 0.079
3.635SerGlu: 3.635 ± 0.064
2.336SerPhe: 2.336 ± 0.054
4.878SerGly: 4.878 ± 0.073
1.461SerHis: 1.461 ± 0.04
3.206SerIle: 3.206 ± 0.06
2.081SerLys: 2.081 ± 0.05
6.832SerLeu: 6.832 ± 0.11
1.503SerMet: 1.503 ± 0.037
2.252SerAsn: 2.252 ± 0.056
2.392SerPro: 2.392 ± 0.059
3.325SerGln: 3.325 ± 0.065
3.43SerArg: 3.43 ± 0.067
4.11SerSer: 4.11 ± 0.09
2.882SerThr: 2.882 ± 0.062
4.339SerVal: 4.339 ± 0.078
0.94SerTrp: 0.94 ± 0.033
2.056SerTyr: 2.056 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
5.054ThrAla: 5.054 ± 0.082
0.375ThrCys: 0.375 ± 0.022
2.931ThrAsp: 2.931 ± 0.062
2.751ThrGlu: 2.751 ± 0.062
1.606ThrPhe: 1.606 ± 0.047
3.42ThrGly: 3.42 ± 0.074
0.993ThrHis: 0.993 ± 0.035
2.777ThrIle: 2.777 ± 0.056
1.29ThrLys: 1.29 ± 0.045
6.065ThrLeu: 6.065 ± 0.088
0.91ThrMet: 0.91 ± 0.033
1.47ThrAsn: 1.47 ± 0.038
2.46ThrPro: 2.46 ± 0.053
2.51ThrGln: 2.51 ± 0.058
2.544ThrArg: 2.544 ± 0.058
2.821ThrSer: 2.821 ± 0.053
2.541ThrThr: 2.541 ± 0.058
3.675ThrVal: 3.675 ± 0.076
0.548ThrTrp: 0.548 ± 0.025
1.276ThrTyr: 1.276 ± 0.045
0.0ThrXaa: 0.0 ± 0.0
Val
7.463ValAla: 7.463 ± 0.107
0.611ValCys: 0.611 ± 0.027
4.393ValAsp: 4.393 ± 0.075
4.059ValGlu: 4.059 ± 0.067
2.45ValPhe: 2.45 ± 0.051
4.66ValGly: 4.66 ± 0.081
1.224ValHis: 1.224 ± 0.035
4.369ValIle: 4.369 ± 0.073
2.97ValLys: 2.97 ± 0.054
7.0ValLeu: 7.0 ± 0.098
1.851ValMet: 1.851 ± 0.049
2.825ValAsn: 2.825 ± 0.063
2.607ValPro: 2.607 ± 0.052
2.565ValGln: 2.565 ± 0.052
3.422ValArg: 3.422 ± 0.056
5.016ValSer: 5.016 ± 0.084
3.889ValThr: 3.889 ± 0.063
5.511ValVal: 5.511 ± 0.098
0.721ValTrp: 0.721 ± 0.03
1.742ValTyr: 1.742 ± 0.043
0.0ValXaa: 0.0 ± 0.0
Trp
0.812TrpAla: 0.812 ± 0.035
0.151TrpCys: 0.151 ± 0.013
0.529TrpAsp: 0.529 ± 0.026
0.392TrpGlu: 0.392 ± 0.022
0.632TrpPhe: 0.632 ± 0.027
0.707TrpGly: 0.707 ± 0.029
0.434TrpHis: 0.434 ± 0.021
0.544TrpIle: 0.544 ± 0.026
0.247TrpLys: 0.247 ± 0.018
2.5TrpLeu: 2.5 ± 0.066
0.315TrpMet: 0.315 ± 0.02
0.318TrpAsn: 0.318 ± 0.018
0.615TrpPro: 0.615 ± 0.025
1.885TrpGln: 1.885 ± 0.063
1.159TrpArg: 1.159 ± 0.041
0.802TrpSer: 0.802 ± 0.029
0.42TrpThr: 0.42 ± 0.025
0.89TrpVal: 0.89 ± 0.036
0.263TrpTrp: 0.263 ± 0.018
0.369TrpTyr: 0.369 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.689TyrAla: 2.689 ± 0.06
0.385TyrCys: 0.385 ± 0.024
1.687TyrAsp: 1.687 ± 0.054
1.438TyrGlu: 1.438 ± 0.049
1.221TyrPhe: 1.221 ± 0.036
2.211TyrGly: 2.211 ± 0.062
0.762TyrHis: 0.762 ± 0.029
1.359TyrIle: 1.359 ± 0.039
0.839TyrLys: 0.839 ± 0.031
3.305TyrLeu: 3.305 ± 0.065
0.577TyrMet: 0.577 ± 0.029
0.975TyrAsn: 0.975 ± 0.039
1.353TyrPro: 1.353 ± 0.047
2.573TyrGln: 2.573 ± 0.058
2.232TyrArg: 2.232 ± 0.058
1.869TyrSer: 1.869 ± 0.049
1.267TyrThr: 1.267 ± 0.041
1.808TyrVal: 1.808 ± 0.049
0.578TyrTrp: 0.578 ± 0.024
0.987TyrTyr: 0.987 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2673 proteins (878315 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski