Amino acid dipepetide frequency for Microlunatus sagamiharensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.389AlaAla: 20.389 ± 0.181
0.948AlaCys: 0.948 ± 0.03
8.53AlaAsp: 8.53 ± 0.1
7.942AlaGlu: 7.942 ± 0.113
3.509AlaPhe: 3.509 ± 0.067
13.298AlaGly: 13.298 ± 0.132
2.343AlaHis: 2.343 ± 0.049
3.363AlaIle: 3.363 ± 0.065
2.331AlaLys: 2.331 ± 0.059
14.627AlaLeu: 14.627 ± 0.136
2.455AlaMet: 2.455 ± 0.05
1.999AlaAsn: 1.999 ± 0.052
7.092AlaPro: 7.092 ± 0.125
3.74AlaGln: 3.74 ± 0.048
10.104AlaArg: 10.104 ± 0.124
7.196AlaSer: 7.196 ± 0.114
7.799AlaThr: 7.799 ± 0.205
13.029AlaVal: 13.029 ± 0.138
2.01AlaTrp: 2.01 ± 0.04
3.027AlaTyr: 3.027 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.79CysAla: 0.79 ± 0.023
0.081CysCys: 0.081 ± 0.008
0.39CysAsp: 0.39 ± 0.018
0.314CysGlu: 0.314 ± 0.014
0.209CysPhe: 0.209 ± 0.013
0.777CysGly: 0.777 ± 0.023
0.143CysHis: 0.143 ± 0.011
0.151CysIle: 0.151 ± 0.011
0.07CysLys: 0.07 ± 0.007
0.639CysLeu: 0.639 ± 0.022
0.084CysMet: 0.084 ± 0.008
0.105CysAsn: 0.105 ± 0.011
0.454CysPro: 0.454 ± 0.041
0.139CysGln: 0.139 ± 0.009
0.483CysArg: 0.483 ± 0.02
0.389CysSer: 0.389 ± 0.018
0.462CysThr: 0.462 ± 0.047
0.57CysVal: 0.57 ± 0.021
0.112CysTrp: 0.112 ± 0.009
0.126CysTyr: 0.126 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
8.726AspAla: 8.726 ± 0.098
0.305AspCys: 0.305 ± 0.016
4.456AspAsp: 4.456 ± 0.077
4.123AspGlu: 4.123 ± 0.058
1.482AspPhe: 1.482 ± 0.035
6.326AspGly: 6.326 ± 0.089
1.412AspHis: 1.412 ± 0.04
1.363AspIle: 1.363 ± 0.035
0.932AspLys: 0.932 ± 0.033
7.392AspLeu: 7.392 ± 0.073
0.622AspMet: 0.622 ± 0.023
0.86AspAsn: 0.86 ± 0.026
4.645AspPro: 4.645 ± 0.069
1.797AspGln: 1.797 ± 0.044
4.855AspArg: 4.855 ± 0.076
2.247AspSer: 2.247 ± 0.037
2.727AspThr: 2.727 ± 0.045
6.32AspVal: 6.32 ± 0.069
0.879AspTrp: 0.879 ± 0.028
1.12AspTyr: 1.12 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
7.087GluAla: 7.087 ± 0.1
0.226GluCys: 0.226 ± 0.013
2.85GluAsp: 2.85 ± 0.057
2.902GluGlu: 2.902 ± 0.056
1.097GluPhe: 1.097 ± 0.03
4.289GluGly: 4.289 ± 0.065
1.533GluHis: 1.533 ± 0.032
1.698GluIle: 1.698 ± 0.038
0.875GluLys: 0.875 ± 0.03
5.985GluLeu: 5.985 ± 0.081
0.688GluMet: 0.688 ± 0.025
0.757GluAsn: 0.757 ± 0.026
3.416GluPro: 3.416 ± 0.065
2.314GluGln: 2.314 ± 0.049
5.292GluArg: 5.292 ± 0.078
2.233GluSer: 2.233 ± 0.041
2.763GluThr: 2.763 ± 0.049
5.693GluVal: 5.693 ± 0.089
0.593GluTrp: 0.593 ± 0.022
0.74GluTyr: 0.74 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
3.791PheAla: 3.791 ± 0.061
0.24PheCys: 0.24 ± 0.014
2.064PheAsp: 2.064 ± 0.047
1.26PheGlu: 1.26 ± 0.034
0.951PhePhe: 0.951 ± 0.035
3.076PheGly: 3.076 ± 0.064
0.52PheHis: 0.52 ± 0.023
0.626PheIle: 0.626 ± 0.024
0.427PheLys: 0.427 ± 0.022
2.531PheLeu: 2.531 ± 0.057
0.335PheMet: 0.335 ± 0.017
0.538PheAsn: 0.538 ± 0.023
1.216PhePro: 1.216 ± 0.031
0.58PheGln: 0.58 ± 0.023
1.698PheArg: 1.698 ± 0.036
1.597PheSer: 1.597 ± 0.036
1.88PheThr: 1.88 ± 0.048
2.797PheVal: 2.797 ± 0.053
0.394PheTrp: 0.394 ± 0.02
0.596PheTyr: 0.596 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
11.102GlyAla: 11.102 ± 0.115
0.703GlyCys: 0.703 ± 0.024
5.123GlyAsp: 5.123 ± 0.07
4.686GlyGlu: 4.686 ± 0.068
2.988GlyPhe: 2.988 ± 0.051
8.496GlyGly: 8.496 ± 0.119
1.965GlyHis: 1.965 ± 0.039
3.091GlyIle: 3.091 ± 0.054
1.872GlyLys: 1.872 ± 0.05
10.502GlyLeu: 10.502 ± 0.116
1.736GlyMet: 1.736 ± 0.043
1.562GlyAsn: 1.562 ± 0.06
5.251GlyPro: 5.251 ± 0.069
2.785GlyGln: 2.785 ± 0.057
7.935GlyArg: 7.935 ± 0.095
6.166GlySer: 6.166 ± 0.092
6.445GlyThr: 6.445 ± 0.111
8.652GlyVal: 8.652 ± 0.085
1.92GlyTrp: 1.92 ± 0.041
2.167GlyTyr: 2.167 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
2.431HisAla: 2.431 ± 0.045
0.135HisCys: 0.135 ± 0.01
1.402HisAsp: 1.402 ± 0.035
1.07HisGlu: 1.07 ± 0.025
0.511HisPhe: 0.511 ± 0.02
2.013HisGly: 2.013 ± 0.043
0.591HisHis: 0.591 ± 0.022
0.372HisIle: 0.372 ± 0.018
0.233HisLys: 0.233 ± 0.014
2.326HisLeu: 2.326 ± 0.052
0.224HisMet: 0.224 ± 0.012
0.305HisAsn: 0.305 ± 0.017
1.58HisPro: 1.58 ± 0.043
0.551HisGln: 0.551 ± 0.021
1.705HisArg: 1.705 ± 0.036
0.771HisSer: 0.771 ± 0.023
0.882HisThr: 0.882 ± 0.027
1.941HisVal: 1.941 ± 0.042
0.297HisTrp: 0.297 ± 0.014
0.368HisTyr: 0.368 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
3.953IleAla: 3.953 ± 0.06
0.207IleCys: 0.207 ± 0.013
2.012IleAsp: 2.012 ± 0.04
1.579IleGlu: 1.579 ± 0.039
0.754IlePhe: 0.754 ± 0.03
3.064IleGly: 3.064 ± 0.067
0.417IleHis: 0.417 ± 0.017
0.884IleIle: 0.884 ± 0.032
0.637IleLys: 0.637 ± 0.021
2.042IleLeu: 2.042 ± 0.051
0.41IleMet: 0.41 ± 0.02
0.659IleAsn: 0.659 ± 0.024
1.421IlePro: 1.421 ± 0.037
0.555IleGln: 0.555 ± 0.02
1.622IleArg: 1.622 ± 0.033
1.673IleSer: 1.673 ± 0.037
1.959IleThr: 1.959 ± 0.043
2.487IleVal: 2.487 ± 0.053
0.297IleTrp: 0.297 ± 0.015
0.559IleTyr: 0.559 ± 0.023
0.0IleXaa: 0.0 ± 0.0
Lys
2.322LysAla: 2.322 ± 0.055
0.061LysCys: 0.061 ± 0.007
0.984LysAsp: 0.984 ± 0.032
0.721LysGlu: 0.721 ± 0.024
0.411LysPhe: 0.411 ± 0.019
1.489LysGly: 1.489 ± 0.041
0.348LysHis: 0.348 ± 0.017
0.735LysIle: 0.735 ± 0.026
0.634LysLys: 0.634 ± 0.034
1.407LysLeu: 1.407 ± 0.042
0.303LysMet: 0.303 ± 0.015
0.446LysAsn: 0.446 ± 0.023
1.099LysPro: 1.099 ± 0.031
0.601LysGln: 0.601 ± 0.021
1.147LysArg: 1.147 ± 0.032
0.973LysSer: 0.973 ± 0.026
1.205LysThr: 1.205 ± 0.041
1.834LysVal: 1.834 ± 0.043
0.163LysTrp: 0.163 ± 0.011
0.338LysTyr: 0.338 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
15.749LeuAla: 15.749 ± 0.152
0.634LeuCys: 0.634 ± 0.023
7.414LeuAsp: 7.414 ± 0.08
5.203LeuGlu: 5.203 ± 0.073
2.455LeuPhe: 2.455 ± 0.06
10.176LeuGly: 10.176 ± 0.115
2.008LeuHis: 2.008 ± 0.041
2.416LeuIle: 2.416 ± 0.06
1.475LeuLys: 1.475 ± 0.036
11.55LeuLeu: 11.55 ± 0.181
1.369LeuMet: 1.369 ± 0.039
1.605LeuAsn: 1.605 ± 0.038
5.944LeuPro: 5.944 ± 0.076
2.458LeuGln: 2.458 ± 0.046
8.134LeuArg: 8.134 ± 0.107
5.339LeuSer: 5.339 ± 0.069
6.484LeuThr: 6.484 ± 0.081
12.13LeuVal: 12.13 ± 0.143
1.223LeuTrp: 1.223 ± 0.035
1.632LeuTyr: 1.632 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
1.993MetAla: 1.993 ± 0.044
0.09MetCys: 0.09 ± 0.009
0.736MetAsp: 0.736 ± 0.024
0.549MetGlu: 0.549 ± 0.021
0.4MetPhe: 0.4 ± 0.017
1.141MetGly: 1.141 ± 0.033
0.278MetHis: 0.278 ± 0.015
0.583MetIle: 0.583 ± 0.025
0.303MetLys: 0.303 ± 0.015
1.51MetLeu: 1.51 ± 0.036
0.234MetMet: 0.234 ± 0.014
0.338MetAsn: 0.338 ± 0.017
1.056MetPro: 1.056 ± 0.033
0.395MetGln: 0.395 ± 0.018
1.169MetArg: 1.169 ± 0.03
1.387MetSer: 1.387 ± 0.033
1.526MetThr: 1.526 ± 0.035
1.353MetVal: 1.353 ± 0.037
0.162MetTrp: 0.162 ± 0.011
0.222MetTyr: 0.222 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
2.159AsnAla: 2.159 ± 0.045
0.104AsnCys: 0.104 ± 0.008
0.912AsnAsp: 0.912 ± 0.029
0.707AsnGlu: 0.707 ± 0.023
0.511AsnPhe: 0.511 ± 0.018
1.647AsnGly: 1.647 ± 0.038
0.338AsnHis: 0.338 ± 0.015
0.53AsnIle: 0.53 ± 0.019
0.354AsnLys: 0.354 ± 0.02
1.773AsnLeu: 1.773 ± 0.038
0.216AsnMet: 0.216 ± 0.014
0.418AsnAsn: 0.418 ± 0.021
1.354AsnPro: 1.354 ± 0.034
0.474AsnGln: 0.474 ± 0.022
1.21AsnArg: 1.21 ± 0.029
0.732AsnSer: 0.732 ± 0.026
1.005AsnThr: 1.005 ± 0.072
1.533AsnVal: 1.533 ± 0.043
0.221AsnTrp: 0.221 ± 0.013
0.392AsnTyr: 0.392 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
8.232ProAla: 8.232 ± 0.113
0.259ProCys: 0.259 ± 0.017
4.587ProAsp: 4.587 ± 0.065
3.856ProGlu: 3.856 ± 0.063
1.514ProPhe: 1.514 ± 0.04
6.539ProGly: 6.539 ± 0.091
1.082ProHis: 1.082 ± 0.031
1.159ProIle: 1.159 ± 0.032
0.939ProLys: 0.939 ± 0.029
5.143ProLeu: 5.143 ± 0.08
0.867ProMet: 0.867 ± 0.027
0.871ProAsn: 0.871 ± 0.028
2.945ProPro: 2.945 ± 0.054
1.609ProGln: 1.609 ± 0.04
3.828ProArg: 3.828 ± 0.058
3.449ProSer: 3.449 ± 0.055
3.916ProThr: 3.916 ± 0.071
5.775ProVal: 5.775 ± 0.069
0.902ProTrp: 0.902 ± 0.031
1.293ProTyr: 1.293 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
3.891GlnAla: 3.891 ± 0.074
0.123GlnCys: 0.123 ± 0.009
1.307GlnAsp: 1.307 ± 0.034
1.174GlnGlu: 1.174 ± 0.028
0.644GlnPhe: 0.644 ± 0.025
2.369GlnGly: 2.369 ± 0.05
0.637GlnHis: 0.637 ± 0.022
0.933GlnIle: 0.933 ± 0.027
0.51GlnLys: 0.51 ± 0.022
2.985GlnLeu: 2.985 ± 0.052
0.406GlnMet: 0.406 ± 0.019
0.453GlnAsn: 0.453 ± 0.018
1.743GlnPro: 1.743 ± 0.046
1.197GlnGln: 1.197 ± 0.041
2.409GlnArg: 2.409 ± 0.045
1.254GlnSer: 1.254 ± 0.033
1.673GlnThr: 1.673 ± 0.045
3.089GlnVal: 3.089 ± 0.055
0.361GlnTrp: 0.361 ± 0.017
0.493GlnTyr: 0.493 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
9.48ArgAla: 9.48 ± 0.095
0.52ArgCys: 0.52 ± 0.022
4.265ArgAsp: 4.265 ± 0.067
4.106ArgGlu: 4.106 ± 0.064
2.317ArgPhe: 2.317 ± 0.048
5.99ArgGly: 5.99 ± 0.085
1.64ArgHis: 1.64 ± 0.035
2.291ArgIle: 2.291 ± 0.046
1.183ArgLys: 1.183 ± 0.03
8.686ArgLeu: 8.686 ± 0.112
1.556ArgMet: 1.556 ± 0.034
1.14ArgAsn: 1.14 ± 0.033
4.817ArgPro: 4.817 ± 0.074
2.205ArgGln: 2.205 ± 0.042
7.936ArgArg: 7.936 ± 0.115
4.547ArgSer: 4.547 ± 0.071
5.232ArgThr: 5.232 ± 0.076
6.905ArgVal: 6.905 ± 0.088
1.514ArgTrp: 1.514 ± 0.037
1.578ArgTyr: 1.578 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
7.09SerAla: 7.09 ± 0.109
0.351SerCys: 0.351 ± 0.018
3.024SerAsp: 3.024 ± 0.045
2.563SerGlu: 2.563 ± 0.04
1.682SerPhe: 1.682 ± 0.04
6.084SerGly: 6.084 ± 0.087
0.907SerHis: 0.907 ± 0.026
1.708SerIle: 1.708 ± 0.04
1.037SerLys: 1.037 ± 0.031
5.066SerLeu: 5.066 ± 0.076
1.182SerMet: 1.182 ± 0.037
0.945SerAsn: 0.945 ± 0.031
3.25SerPro: 3.25 ± 0.057
1.326SerGln: 1.326 ± 0.035
3.888SerArg: 3.888 ± 0.056
3.758SerSer: 3.758 ± 0.078
4.065SerThr: 4.065 ± 0.079
4.948SerVal: 4.948 ± 0.087
0.926SerTrp: 0.926 ± 0.025
1.203SerTyr: 1.203 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
8.142ThrAla: 8.142 ± 0.225
0.456ThrCys: 0.456 ± 0.072
3.689ThrAsp: 3.689 ± 0.056
2.819ThrGlu: 2.819 ± 0.046
1.93ThrPhe: 1.93 ± 0.037
6.437ThrGly: 6.437 ± 0.119
1.05ThrHis: 1.05 ± 0.028
1.966ThrIle: 1.966 ± 0.047
1.219ThrLys: 1.219 ± 0.037
6.118ThrLeu: 6.118 ± 0.065
0.948ThrMet: 0.948 ± 0.027
1.149ThrAsn: 1.149 ± 0.052
4.097ThrPro: 4.097 ± 0.063
1.597ThrGln: 1.597 ± 0.053
4.115ThrArg: 4.115 ± 0.06
4.063ThrSer: 4.063 ± 0.095
4.885ThrThr: 4.885 ± 0.167
6.187ThrVal: 6.187 ± 0.146
1.065ThrTrp: 1.065 ± 0.029
1.558ThrTyr: 1.558 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
13.731ValAla: 13.731 ± 0.129
0.745ValCys: 0.745 ± 0.03
6.592ValAsp: 6.592 ± 0.077
5.835ValGlu: 5.835 ± 0.083
2.62ValPhe: 2.62 ± 0.054
8.731ValGly: 8.731 ± 0.094
1.839ValHis: 1.839 ± 0.038
2.551ValIle: 2.551 ± 0.055
1.605ValLys: 1.605 ± 0.036
11.392ValLeu: 11.392 ± 0.141
1.357ValMet: 1.357 ± 0.037
1.686ValAsn: 1.686 ± 0.036
5.546ValPro: 5.546 ± 0.073
2.343ValGln: 2.343 ± 0.047
7.508ValArg: 7.508 ± 0.085
5.167ValSer: 5.167 ± 0.086
6.388ValThr: 6.388 ± 0.214
12.284ValVal: 12.284 ± 0.14
1.263ValTrp: 1.263 ± 0.032
1.602ValTyr: 1.602 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
1.702TrpAla: 1.702 ± 0.04
0.137TrpCys: 0.137 ± 0.009
0.843TrpAsp: 0.843 ± 0.028
0.627TrpGlu: 0.627 ± 0.022
0.535TrpPhe: 0.535 ± 0.022
1.056TrpGly: 1.056 ± 0.028
0.323TrpHis: 0.323 ± 0.014
0.409TrpIle: 0.409 ± 0.017
0.265TrpLys: 0.265 ± 0.016
1.668TrpLeu: 1.668 ± 0.046
0.238TrpMet: 0.238 ± 0.015
0.353TrpAsn: 0.353 ± 0.016
0.803TrpPro: 0.803 ± 0.025
0.513TrpGln: 0.513 ± 0.023
1.353TrpArg: 1.353 ± 0.036
1.063TrpSer: 1.063 ± 0.026
1.096TrpThr: 1.096 ± 0.028
1.355TrpVal: 1.355 ± 0.034
0.389TrpTrp: 0.389 ± 0.019
0.291TrpTyr: 0.291 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.969TyrAla: 2.969 ± 0.044
0.165TyrCys: 0.165 ± 0.011
1.411TyrAsp: 1.411 ± 0.042
0.996TyrGlu: 0.996 ± 0.03
0.595TyrPhe: 0.595 ± 0.023
2.149TyrGly: 2.149 ± 0.047
0.331TyrHis: 0.331 ± 0.015
0.364TyrIle: 0.364 ± 0.016
0.319TyrLys: 0.319 ± 0.018
2.054TyrLeu: 2.054 ± 0.042
0.195TyrMet: 0.195 ± 0.013
0.401TyrAsn: 0.401 ± 0.021
1.038TyrPro: 1.038 ± 0.032
0.475TyrGln: 0.475 ± 0.02
1.514TyrArg: 1.514 ± 0.036
1.0TyrSer: 1.0 ± 0.033
1.099TyrThr: 1.099 ± 0.05
1.903TyrVal: 1.903 ± 0.042
0.327TyrTrp: 0.327 ± 0.018
0.434TyrTyr: 0.434 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3981 proteins (1314726 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski