Amino acid dipepetide frequency for Prosthecochloris sp. GSB1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.27AlaAla: 9.27 ± 0.154
1.195AlaCys: 1.195 ± 0.045
4.669AlaAsp: 4.669 ± 0.086
6.462AlaGlu: 6.462 ± 0.101
3.997AlaPhe: 3.997 ± 0.08
8.4AlaGly: 8.4 ± 0.135
1.477AlaHis: 1.477 ± 0.045
4.964AlaIle: 4.964 ± 0.098
3.53AlaLys: 3.53 ± 0.071
9.595AlaLeu: 9.595 ± 0.132
2.525AlaMet: 2.525 ± 0.063
2.293AlaAsn: 2.293 ± 0.063
3.095AlaPro: 3.095 ± 0.075
1.909AlaGln: 1.909 ± 0.048
5.61AlaArg: 5.61 ± 0.089
5.667AlaSer: 5.667 ± 0.094
3.639AlaThr: 3.639 ± 0.075
7.29AlaVal: 7.29 ± 0.115
0.949AlaTrp: 0.949 ± 0.038
2.264AlaTyr: 2.264 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.884CysAla: 0.884 ± 0.035
0.204CysCys: 0.204 ± 0.019
0.689CysAsp: 0.689 ± 0.034
0.625CysGlu: 0.625 ± 0.035
0.521CysPhe: 0.521 ± 0.026
1.252CysGly: 1.252 ± 0.041
0.296CysHis: 0.296 ± 0.026
0.663CysIle: 0.663 ± 0.03
0.392CysLys: 0.392 ± 0.021
0.903CysLeu: 0.903 ± 0.031
0.274CysMet: 0.274 ± 0.02
0.409CysAsn: 0.409 ± 0.023
0.636CysPro: 0.636 ± 0.033
0.258CysGln: 0.258 ± 0.019
0.926CysArg: 0.926 ± 0.038
0.87CysSer: 0.87 ± 0.035
0.511CysThr: 0.511 ± 0.026
0.721CysVal: 0.721 ± 0.036
0.103CysTrp: 0.103 ± 0.011
0.324CysTyr: 0.324 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
4.879AspAla: 4.879 ± 0.087
0.584AspCys: 0.584 ± 0.033
3.05AspAsp: 3.05 ± 0.077
4.08AspGlu: 4.08 ± 0.092
2.689AspPhe: 2.689 ± 0.058
4.341AspGly: 4.341 ± 0.09
0.988AspHis: 0.988 ± 0.04
4.28AspIle: 4.28 ± 0.075
2.062AspLys: 2.062 ± 0.063
5.043AspLeu: 5.043 ± 0.09
1.469AspMet: 1.469 ± 0.045
1.594AspAsn: 1.594 ± 0.048
2.693AspPro: 2.693 ± 0.063
1.201AspGln: 1.201 ± 0.045
3.948AspArg: 3.948 ± 0.085
3.052AspSer: 3.052 ± 0.075
2.519AspThr: 2.519 ± 0.063
3.785AspVal: 3.785 ± 0.081
0.623AspTrp: 0.623 ± 0.031
1.962AspTyr: 1.962 ± 0.053
0.0AspXaa: 0.0 ± 0.0
Glu
6.244GluAla: 6.244 ± 0.108
0.628GluCys: 0.628 ± 0.03
2.94GluAsp: 2.94 ± 0.068
5.279GluGlu: 5.279 ± 0.097
2.417GluPhe: 2.417 ± 0.056
4.629GluGly: 4.629 ± 0.092
1.597GluHis: 1.597 ± 0.054
4.493GluIle: 4.493 ± 0.092
5.406GluLys: 5.406 ± 0.103
6.841GluLeu: 6.841 ± 0.112
1.876GluMet: 1.876 ± 0.053
2.959GluAsn: 2.959 ± 0.07
2.649GluPro: 2.649 ± 0.064
2.421GluGln: 2.421 ± 0.068
5.128GluArg: 5.128 ± 0.097
3.91GluSer: 3.91 ± 0.084
4.088GluThr: 4.088 ± 0.081
4.147GluVal: 4.147 ± 0.086
0.721GluTrp: 0.721 ± 0.035
1.901GluTyr: 1.901 ± 0.053
0.0GluXaa: 0.0 ± 0.0
Phe
3.709PheAla: 3.709 ± 0.079
0.585PheCys: 0.585 ± 0.029
2.858PheAsp: 2.858 ± 0.058
2.594PheGlu: 2.594 ± 0.057
2.386PhePhe: 2.386 ± 0.078
3.774PheGly: 3.774 ± 0.08
0.878PheHis: 0.878 ± 0.034
2.475PheIle: 2.475 ± 0.051
1.234PheLys: 1.234 ± 0.049
4.526PheLeu: 4.526 ± 0.09
1.089PheMet: 1.089 ± 0.04
1.413PheAsn: 1.413 ± 0.045
1.834PhePro: 1.834 ± 0.054
1.053PheGln: 1.053 ± 0.042
3.331PheArg: 3.331 ± 0.073
3.806PheSer: 3.806 ± 0.083
2.147PheThr: 2.147 ± 0.057
2.984PheVal: 2.984 ± 0.064
0.542PheTrp: 0.542 ± 0.025
1.379PheTyr: 1.379 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
6.526GlyAla: 6.526 ± 0.108
1.084GlyCys: 1.084 ± 0.042
4.098GlyAsp: 4.098 ± 0.084
5.584GlyGlu: 5.584 ± 0.096
3.967GlyPhe: 3.967 ± 0.084
6.484GlyGly: 6.484 ± 0.12
1.595GlyHis: 1.595 ± 0.056
5.554GlyIle: 5.554 ± 0.109
4.812GlyLys: 4.812 ± 0.077
7.544GlyLeu: 7.544 ± 0.099
2.44GlyMet: 2.44 ± 0.059
2.66GlyAsn: 2.66 ± 0.062
2.325GlyPro: 2.325 ± 0.062
1.986GlyGln: 1.986 ± 0.057
5.353GlyArg: 5.353 ± 0.094
5.025GlySer: 5.025 ± 0.099
3.984GlyThr: 3.984 ± 0.073
5.702GlyVal: 5.702 ± 0.105
0.923GlyTrp: 0.923 ± 0.035
2.745GlyTyr: 2.745 ± 0.071
0.0GlyXaa: 0.0 ± 0.0
His
1.808HisAla: 1.808 ± 0.051
0.279HisCys: 0.279 ± 0.019
1.245HisAsp: 1.245 ± 0.043
1.395HisGlu: 1.395 ± 0.053
0.944HisPhe: 0.944 ± 0.032
1.711HisGly: 1.711 ± 0.049
0.556HisHis: 0.556 ± 0.031
1.226HisIle: 1.226 ± 0.046
0.681HisLys: 0.681 ± 0.032
1.984HisLeu: 1.984 ± 0.057
0.422HisMet: 0.422 ± 0.023
0.655HisAsn: 0.655 ± 0.033
1.234HisPro: 1.234 ± 0.043
0.511HisGln: 0.511 ± 0.026
1.328HisArg: 1.328 ± 0.045
1.141HisSer: 1.141 ± 0.041
0.974HisThr: 0.974 ± 0.039
1.402HisVal: 1.402 ± 0.048
0.228HisTrp: 0.228 ± 0.022
0.738HisTyr: 0.738 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.142IleAla: 6.142 ± 0.111
0.661IleCys: 0.661 ± 0.032
4.062IleAsp: 4.062 ± 0.075
4.597IleGlu: 4.597 ± 0.078
2.273IlePhe: 2.273 ± 0.057
5.012IleGly: 5.012 ± 0.094
1.162IleHis: 1.162 ± 0.035
3.235IleIle: 3.235 ± 0.08
1.793IleLys: 1.793 ± 0.052
5.272IleLeu: 5.272 ± 0.095
1.308IleMet: 1.308 ± 0.041
1.837IleAsn: 1.837 ± 0.05
2.707IlePro: 2.707 ± 0.059
1.298IleGln: 1.298 ± 0.043
4.25IleArg: 4.25 ± 0.088
3.949IleSer: 3.949 ± 0.072
2.997IleThr: 2.997 ± 0.058
5.042IleVal: 5.042 ± 0.097
0.463IleTrp: 0.463 ± 0.027
1.477IleTyr: 1.477 ± 0.042
0.0IleXaa: 0.0 ± 0.0
Lys
4.327LysAla: 4.327 ± 0.087
0.309LysCys: 0.309 ± 0.022
2.208LysAsp: 2.208 ± 0.062
3.356LysGlu: 3.356 ± 0.08
1.327LysPhe: 1.327 ± 0.042
3.549LysGly: 3.549 ± 0.07
0.956LysHis: 0.956 ± 0.036
3.064LysIle: 3.064 ± 0.072
3.867LysLys: 3.867 ± 0.09
4.304LysLeu: 4.304 ± 0.079
1.142LysMet: 1.142 ± 0.042
2.204LysAsn: 2.204 ± 0.057
2.282LysPro: 2.282 ± 0.059
1.58LysGln: 1.58 ± 0.051
2.899LysArg: 2.899 ± 0.068
2.868LysSer: 2.868 ± 0.069
3.052LysThr: 3.052 ± 0.07
3.016LysVal: 3.016 ± 0.079
0.404LysTrp: 0.404 ± 0.024
1.284LysTyr: 1.284 ± 0.038
0.0LysXaa: 0.0 ± 0.0
Leu
9.279LeuAla: 9.279 ± 0.134
1.156LeuCys: 1.156 ± 0.04
6.239LeuAsp: 6.239 ± 0.094
6.958LeuGlu: 6.958 ± 0.109
4.907LeuPhe: 4.907 ± 0.104
7.297LeuGly: 7.297 ± 0.105
2.158LeuHis: 2.158 ± 0.056
4.557LeuIle: 4.557 ± 0.096
4.875LeuLys: 4.875 ± 0.086
10.987LeuLeu: 10.987 ± 0.177
2.246LeuMet: 2.246 ± 0.06
2.942LeuAsn: 2.942 ± 0.067
4.707LeuPro: 4.707 ± 0.09
3.046LeuGln: 3.046 ± 0.073
6.731LeuArg: 6.731 ± 0.11
6.894LeuSer: 6.894 ± 0.104
4.505LeuThr: 4.505 ± 0.081
7.467LeuVal: 7.467 ± 0.105
0.921LeuTrp: 0.921 ± 0.033
2.72LeuTyr: 2.72 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
2.137MetAla: 2.137 ± 0.051
0.176MetCys: 0.176 ± 0.016
1.167MetAsp: 1.167 ± 0.038
1.597MetGlu: 1.597 ± 0.047
0.921MetPhe: 0.921 ± 0.033
1.663MetGly: 1.663 ± 0.052
0.578MetHis: 0.578 ± 0.033
1.497MetIle: 1.497 ± 0.044
1.857MetLys: 1.857 ± 0.047
2.878MetLeu: 2.878 ± 0.066
0.68MetMet: 0.68 ± 0.033
1.166MetAsn: 1.166 ± 0.041
1.348MetPro: 1.348 ± 0.046
0.971MetGln: 0.971 ± 0.039
1.687MetArg: 1.687 ± 0.046
1.576MetSer: 1.576 ± 0.049
1.462MetThr: 1.462 ± 0.045
1.676MetVal: 1.676 ± 0.055
0.161MetTrp: 0.161 ± 0.015
0.445MetTyr: 0.445 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
3.111AsnAla: 3.111 ± 0.062
0.421AsnCys: 0.421 ± 0.025
1.79AsnAsp: 1.79 ± 0.058
1.886AsnGlu: 1.886 ± 0.053
1.272AsnPhe: 1.272 ± 0.046
2.753AsnGly: 2.753 ± 0.061
0.65AsnHis: 0.65 ± 0.027
2.418AsnIle: 2.418 ± 0.064
1.046AsnLys: 1.046 ± 0.046
3.185AsnLeu: 3.185 ± 0.078
0.768AsnMet: 0.768 ± 0.035
1.077AsnAsn: 1.077 ± 0.041
2.086AsnPro: 2.086 ± 0.055
0.763AsnGln: 0.763 ± 0.034
2.608AsnArg: 2.608 ± 0.063
1.656AsnSer: 1.656 ± 0.051
1.641AsnThr: 1.641 ± 0.055
2.392AsnVal: 2.392 ± 0.061
0.411AsnTrp: 0.411 ± 0.024
1.085AsnTyr: 1.085 ± 0.044
0.0AsnXaa: 0.0 ± 0.0
Pro
3.96ProAla: 3.96 ± 0.083
0.463ProCys: 0.463 ± 0.024
3.059ProAsp: 3.059 ± 0.07
4.586ProGlu: 4.586 ± 0.091
2.086ProPhe: 2.086 ± 0.057
3.976ProGly: 3.976 ± 0.077
0.828ProHis: 0.828 ± 0.031
1.519ProIle: 1.519 ± 0.05
1.579ProLys: 1.579 ± 0.053
4.372ProLeu: 4.372 ± 0.09
0.962ProMet: 0.962 ± 0.038
1.026ProAsn: 1.026 ± 0.038
1.737ProPro: 1.737 ± 0.055
1.059ProGln: 1.059 ± 0.044
2.027ProArg: 2.027 ± 0.055
2.77ProSer: 2.77 ± 0.07
1.523ProThr: 1.523 ± 0.049
4.166ProVal: 4.166 ± 0.085
0.495ProTrp: 0.495 ± 0.031
1.223ProTyr: 1.223 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
2.661GlnAla: 2.661 ± 0.056
0.247GlnCys: 0.247 ± 0.02
1.359GlnAsp: 1.359 ± 0.044
1.916GlnGlu: 1.916 ± 0.056
0.955GlnPhe: 0.955 ± 0.038
1.898GlnGly: 1.898 ± 0.043
0.575GlnHis: 0.575 ± 0.029
1.495GlnIle: 1.495 ± 0.041
1.733GlnLys: 1.733 ± 0.055
2.543GlnLeu: 2.543 ± 0.066
0.73GlnMet: 0.73 ± 0.033
1.006GlnAsn: 1.006 ± 0.041
1.165GlnPro: 1.165 ± 0.037
1.145GlnGln: 1.145 ± 0.048
1.75GlnArg: 1.75 ± 0.056
1.47GlnSer: 1.47 ± 0.048
1.337GlnThr: 1.337 ± 0.046
1.795GlnVal: 1.795 ± 0.054
0.309GlnTrp: 0.309 ± 0.022
0.791GlnTyr: 0.791 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
4.64ArgAla: 4.64 ± 0.088
0.667ArgCys: 0.667 ± 0.028
3.587ArgAsp: 3.587 ± 0.075
5.575ArgGlu: 5.575 ± 0.098
3.384ArgPhe: 3.384 ± 0.073
3.96ArgGly: 3.96 ± 0.074
1.577ArgHis: 1.577 ± 0.048
4.53ArgIle: 4.53 ± 0.077
4.444ArgLys: 4.444 ± 0.078
6.794ArgLeu: 6.794 ± 0.109
1.869ArgMet: 1.869 ± 0.06
2.828ArgAsn: 2.828 ± 0.069
2.532ArgPro: 2.532 ± 0.057
2.343ArgGln: 2.343 ± 0.055
4.49ArgArg: 4.49 ± 0.109
4.108ArgSer: 4.108 ± 0.094
3.282ArgThr: 3.282 ± 0.071
3.917ArgVal: 3.917 ± 0.079
0.682ArgTrp: 0.682 ± 0.034
2.358ArgTyr: 2.358 ± 0.062
0.0ArgXaa: 0.0 ± 0.0
Ser
5.507SerAla: 5.507 ± 0.079
0.873SerCys: 0.873 ± 0.036
3.071SerAsp: 3.071 ± 0.065
3.778SerGlu: 3.778 ± 0.079
3.035SerPhe: 3.035 ± 0.078
6.523SerGly: 6.523 ± 0.088
1.224SerHis: 1.224 ± 0.043
3.791SerIle: 3.791 ± 0.085
2.361SerLys: 2.361 ± 0.063
6.663SerLeu: 6.663 ± 0.097
1.843SerMet: 1.843 ± 0.054
1.741SerAsn: 1.741 ± 0.055
2.914SerPro: 2.914 ± 0.068
1.431SerGln: 1.431 ± 0.046
4.526SerArg: 4.526 ± 0.089
4.241SerSer: 4.241 ± 0.096
2.857SerThr: 2.857 ± 0.072
4.687SerVal: 4.687 ± 0.088
0.777SerTrp: 0.777 ± 0.037
1.595SerTyr: 1.595 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
4.621ThrAla: 4.621 ± 0.089
0.502ThrCys: 0.502 ± 0.03
2.471ThrAsp: 2.471 ± 0.063
2.8ThrGlu: 2.8 ± 0.077
2.047ThrPhe: 2.047 ± 0.052
5.253ThrGly: 5.253 ± 0.095
0.888ThrHis: 0.888 ± 0.032
3.177ThrIle: 3.177 ± 0.066
1.706ThrLys: 1.706 ± 0.049
5.175ThrLeu: 5.175 ± 0.085
1.16ThrMet: 1.16 ± 0.044
1.305ThrAsn: 1.305 ± 0.043
2.45ThrPro: 2.45 ± 0.064
1.013ThrGln: 1.013 ± 0.039
2.679ThrArg: 2.679 ± 0.058
2.725ThrSer: 2.725 ± 0.06
2.442ThrThr: 2.442 ± 0.063
4.523ThrVal: 4.523 ± 0.079
0.527ThrTrp: 0.527 ± 0.027
1.391ThrTyr: 1.391 ± 0.052
0.0ThrXaa: 0.0 ± 0.0
Val
6.066ValAla: 6.066 ± 0.103
0.923ValCys: 0.923 ± 0.039
3.98ValAsp: 3.98 ± 0.076
4.847ValGlu: 4.847 ± 0.09
3.599ValPhe: 3.599 ± 0.083
4.682ValGly: 4.682 ± 0.099
1.531ValHis: 1.531 ± 0.047
4.49ValIle: 4.49 ± 0.081
2.893ValLys: 2.893 ± 0.071
7.611ValLeu: 7.611 ± 0.117
1.895ValMet: 1.895 ± 0.059
2.451ValAsn: 2.451 ± 0.055
3.211ValPro: 3.211 ± 0.072
1.811ValGln: 1.811 ± 0.059
5.042ValArg: 5.042 ± 0.084
5.219ValSer: 5.219 ± 0.092
4.024ValThr: 4.024 ± 0.084
5.522ValVal: 5.522 ± 0.096
0.737ValTrp: 0.737 ± 0.036
2.084ValTyr: 2.084 ± 0.052
0.0ValXaa: 0.0 ± 0.0
Trp
0.73TrpAla: 0.73 ± 0.034
0.113TrpCys: 0.113 ± 0.013
0.534TrpAsp: 0.534 ± 0.028
0.585TrpGlu: 0.585 ± 0.03
0.528TrpPhe: 0.528 ± 0.028
0.705TrpGly: 0.705 ± 0.034
0.263TrpHis: 0.263 ± 0.018
0.643TrpIle: 0.643 ± 0.03
0.7TrpLys: 0.7 ± 0.03
1.233TrpLeu: 1.233 ± 0.048
0.336TrpMet: 0.336 ± 0.021
0.441TrpAsn: 0.441 ± 0.026
0.386TrpPro: 0.386 ± 0.025
0.36TrpGln: 0.36 ± 0.024
0.713TrpArg: 0.713 ± 0.032
0.61TrpSer: 0.61 ± 0.028
0.497TrpThr: 0.497 ± 0.027
0.602TrpVal: 0.602 ± 0.033
0.158TrpTrp: 0.158 ± 0.016
0.342TrpTyr: 0.342 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.3TyrAla: 2.3 ± 0.056
0.392TyrCys: 0.392 ± 0.024
1.765TyrAsp: 1.765 ± 0.051
1.772TyrGlu: 1.772 ± 0.052
1.345TyrPhe: 1.345 ± 0.042
2.528TyrGly: 2.528 ± 0.064
0.684TyrHis: 0.684 ± 0.031
1.467TyrIle: 1.467 ± 0.047
1.028TyrLys: 1.028 ± 0.035
3.136TyrLeu: 3.136 ± 0.066
0.607TyrMet: 0.607 ± 0.028
1.016TyrAsn: 1.016 ± 0.036
1.423TyrPro: 1.423 ± 0.042
0.72TyrGln: 0.72 ± 0.029
2.543TyrArg: 2.543 ± 0.063
1.882TyrSer: 1.882 ± 0.06
1.37TyrThr: 1.37 ± 0.045
1.784TyrVal: 1.784 ± 0.055
0.345TyrTrp: 0.345 ± 0.021
1.019TyrTyr: 1.019 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2269 proteins (719611 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski