Amino acid dipepetide frequency for Aliikangiella marina

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.669AlaAla: 6.669 ± 0.106
0.789AlaCys: 0.789 ± 0.026
4.41AlaAsp: 4.41 ± 0.063
5.571AlaGlu: 5.571 ± 0.076
3.339AlaPhe: 3.339 ± 0.05
5.267AlaGly: 5.267 ± 0.073
1.438AlaHis: 1.438 ± 0.03
5.926AlaIle: 5.926 ± 0.068
5.031AlaLys: 5.031 ± 0.076
8.034AlaLeu: 8.034 ± 0.089
2.042AlaMet: 2.042 ± 0.045
3.919AlaAsn: 3.919 ± 0.059
2.582AlaPro: 2.582 ± 0.041
3.336AlaGln: 3.336 ± 0.046
3.378AlaArg: 3.378 ± 0.062
5.363AlaSer: 5.363 ± 0.074
4.162AlaThr: 4.162 ± 0.061
5.094AlaVal: 5.094 ± 0.071
0.869AlaTrp: 0.869 ± 0.022
2.367AlaTyr: 2.367 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.641CysAla: 0.641 ± 0.023
0.128CysCys: 0.128 ± 0.01
0.575CysAsp: 0.575 ± 0.02
0.616CysGlu: 0.616 ± 0.022
0.456CysPhe: 0.456 ± 0.019
0.831CysGly: 0.831 ± 0.027
0.305CysHis: 0.305 ± 0.016
0.524CysIle: 0.524 ± 0.021
0.41CysLys: 0.41 ± 0.018
0.98CysLeu: 0.98 ± 0.028
0.17CysMet: 0.17 ± 0.01
0.351CysAsn: 0.351 ± 0.017
0.418CysPro: 0.418 ± 0.02
0.422CysGln: 0.422 ± 0.017
0.469CysArg: 0.469 ± 0.017
0.635CysSer: 0.635 ± 0.022
0.37CysThr: 0.37 ± 0.016
0.587CysVal: 0.587 ± 0.021
0.116CysTrp: 0.116 ± 0.01
0.3CysTyr: 0.3 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.058AspAla: 4.058 ± 0.056
0.582AspCys: 0.582 ± 0.02
3.289AspAsp: 3.289 ± 0.061
4.055AspGlu: 4.055 ± 0.068
2.919AspPhe: 2.919 ± 0.047
3.854AspGly: 3.854 ± 0.069
0.995AspHis: 0.995 ± 0.025
3.985AspIle: 3.985 ± 0.055
3.617AspLys: 3.617 ± 0.053
5.499AspLeu: 5.499 ± 0.068
1.261AspMet: 1.261 ± 0.031
2.838AspAsn: 2.838 ± 0.053
2.236AspPro: 2.236 ± 0.052
2.119AspGln: 2.119 ± 0.039
2.31AspArg: 2.31 ± 0.037
4.13AspSer: 4.13 ± 0.06
2.624AspThr: 2.624 ± 0.05
3.412AspVal: 3.412 ± 0.049
0.978AspTrp: 0.978 ± 0.029
2.23AspTyr: 2.23 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
4.98GluAla: 4.98 ± 0.064
0.477GluCys: 0.477 ± 0.018
3.151GluAsp: 3.151 ± 0.048
4.13GluGlu: 4.13 ± 0.071
2.969GluPhe: 2.969 ± 0.048
3.335GluGly: 3.335 ± 0.058
1.316GluHis: 1.316 ± 0.029
4.588GluIle: 4.588 ± 0.06
4.708GluLys: 4.708 ± 0.065
7.156GluLeu: 7.156 ± 0.089
1.562GluMet: 1.562 ± 0.033
3.382GluAsn: 3.382 ± 0.048
2.064GluPro: 2.064 ± 0.038
3.395GluGln: 3.395 ± 0.053
3.238GluArg: 3.238 ± 0.054
4.972GluSer: 4.972 ± 0.059
3.46GluThr: 3.46 ± 0.042
4.252GluVal: 4.252 ± 0.06
0.765GluTrp: 0.765 ± 0.022
2.028GluTyr: 2.028 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
3.494PheAla: 3.494 ± 0.055
0.492PheCys: 0.492 ± 0.019
3.184PheAsp: 3.184 ± 0.051
3.222PheGlu: 3.222 ± 0.056
1.922PhePhe: 1.922 ± 0.039
3.024PheGly: 3.024 ± 0.055
0.844PheHis: 0.844 ± 0.023
2.942PheIle: 2.942 ± 0.048
2.562PheLys: 2.562 ± 0.04
3.81PheLeu: 3.81 ± 0.058
0.969PheMet: 0.969 ± 0.028
2.411PheAsn: 2.411 ± 0.045
1.477PhePro: 1.477 ± 0.028
1.506PheGln: 1.506 ± 0.033
1.706PheArg: 1.706 ± 0.038
3.697PheSer: 3.697 ± 0.051
2.258PheThr: 2.258 ± 0.046
2.954PheVal: 2.954 ± 0.046
0.582PheTrp: 0.582 ± 0.021
1.582PheTyr: 1.582 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
4.675GlyAla: 4.675 ± 0.077
0.753GlyCys: 0.753 ± 0.021
3.764GlyAsp: 3.764 ± 0.067
4.418GlyGlu: 4.418 ± 0.053
3.364GlyPhe: 3.364 ± 0.061
4.594GlyGly: 4.594 ± 0.088
1.353GlyHis: 1.353 ± 0.035
4.458GlyIle: 4.458 ± 0.069
3.828GlyLys: 3.828 ± 0.061
6.412GlyLeu: 6.412 ± 0.079
1.637GlyMet: 1.637 ± 0.037
2.816GlyAsn: 2.816 ± 0.062
1.636GlyPro: 1.636 ± 0.035
2.448GlyGln: 2.448 ± 0.045
2.834GlyArg: 2.834 ± 0.049
4.051GlySer: 4.051 ± 0.069
3.152GlyThr: 3.152 ± 0.06
4.605GlyVal: 4.605 ± 0.07
0.924GlyTrp: 0.924 ± 0.025
2.386GlyTyr: 2.386 ± 0.042
0.0GlyXaa: 0.0 ± 0.0
His
1.316HisAla: 1.316 ± 0.028
0.291HisCys: 0.291 ± 0.013
0.967HisAsp: 0.967 ± 0.025
1.107HisGlu: 1.107 ± 0.028
1.062HisPhe: 1.062 ± 0.028
1.254HisGly: 1.254 ± 0.029
0.618HisHis: 0.618 ± 0.026
1.234HisIle: 1.234 ± 0.03
1.017HisLys: 1.017 ± 0.028
2.106HisLeu: 2.106 ± 0.044
0.437HisMet: 0.437 ± 0.017
0.814HisAsn: 0.814 ± 0.024
0.992HisPro: 0.992 ± 0.028
1.182HisGln: 1.182 ± 0.034
1.014HisArg: 1.014 ± 0.028
1.368HisSer: 1.368 ± 0.034
0.876HisThr: 0.876 ± 0.027
1.076HisVal: 1.076 ± 0.027
0.342HisTrp: 0.342 ± 0.015
0.8HisTyr: 0.8 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.212IleAla: 6.212 ± 0.069
0.644IleCys: 0.644 ± 0.023
4.836IleAsp: 4.836 ± 0.058
5.333IleGlu: 5.333 ± 0.06
2.605IlePhe: 2.605 ± 0.045
4.672IleGly: 4.672 ± 0.062
1.238IleHis: 1.238 ± 0.03
4.114IleIle: 4.114 ± 0.068
4.107IleLys: 4.107 ± 0.062
5.636IleLeu: 5.636 ± 0.06
1.171IleMet: 1.171 ± 0.027
3.784IleAsn: 3.784 ± 0.061
2.8IlePro: 2.8 ± 0.049
2.56IleGln: 2.56 ± 0.04
2.969IleArg: 2.969 ± 0.041
4.885IleSer: 4.885 ± 0.053
3.532IleThr: 3.532 ± 0.052
4.333IleVal: 4.333 ± 0.058
0.75IleTrp: 0.75 ± 0.024
1.904IleTyr: 1.904 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
4.865LysAla: 4.865 ± 0.066
0.363LysCys: 0.363 ± 0.019
3.067LysAsp: 3.067 ± 0.047
3.548LysGlu: 3.548 ± 0.062
1.994LysPhe: 1.994 ± 0.037
3.189LysGly: 3.189 ± 0.045
1.251LysHis: 1.251 ± 0.033
3.999LysIle: 3.999 ± 0.062
3.7LysLys: 3.7 ± 0.066
6.151LysLeu: 6.151 ± 0.072
1.312LysMet: 1.312 ± 0.031
2.935LysAsn: 2.935 ± 0.055
2.492LysPro: 2.492 ± 0.045
3.06LysGln: 3.06 ± 0.046
2.971LysArg: 2.971 ± 0.05
4.037LysSer: 4.037 ± 0.064
3.351LysThr: 3.351 ± 0.052
4.093LysVal: 4.093 ± 0.061
0.562LysTrp: 0.562 ± 0.021
1.596LysTyr: 1.596 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
8.945LeuAla: 8.945 ± 0.086
0.887LeuCys: 0.887 ± 0.026
5.647LeuAsp: 5.647 ± 0.068
6.423LeuGlu: 6.423 ± 0.081
4.397LeuPhe: 4.397 ± 0.069
6.126LeuGly: 6.126 ± 0.064
1.632LeuHis: 1.632 ± 0.029
6.831LeuIle: 6.831 ± 0.082
6.253LeuLys: 6.253 ± 0.078
9.878LeuLeu: 9.878 ± 0.117
2.441LeuMet: 2.441 ± 0.041
5.05LeuAsn: 5.05 ± 0.068
4.254LeuPro: 4.254 ± 0.062
3.584LeuGln: 3.584 ± 0.054
4.059LeuArg: 4.059 ± 0.054
8.001LeuSer: 8.001 ± 0.091
5.744LeuThr: 5.744 ± 0.069
6.99LeuVal: 6.99 ± 0.09
1.023LeuTrp: 1.023 ± 0.029
2.569LeuTyr: 2.569 ± 0.044
0.0LeuXaa: 0.0 ± 0.0
Met
1.975MetAla: 1.975 ± 0.045
0.173MetCys: 0.173 ± 0.009
1.1MetAsp: 1.1 ± 0.033
1.171MetGlu: 1.171 ± 0.031
0.848MetPhe: 0.848 ± 0.026
1.497MetGly: 1.497 ± 0.038
0.397MetHis: 0.397 ± 0.017
1.45MetIle: 1.45 ± 0.033
1.407MetLys: 1.407 ± 0.035
2.325MetLeu: 2.325 ± 0.037
0.608MetMet: 0.608 ± 0.024
1.051MetAsn: 1.051 ± 0.028
1.015MetPro: 1.015 ± 0.026
1.047MetGln: 1.047 ± 0.027
1.137MetArg: 1.137 ± 0.029
1.81MetSer: 1.81 ± 0.037
1.362MetThr: 1.362 ± 0.03
1.508MetVal: 1.508 ± 0.036
0.201MetTrp: 0.201 ± 0.012
0.477MetTyr: 0.477 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.514AsnAla: 3.514 ± 0.052
0.513AsnCys: 0.513 ± 0.019
2.678AsnAsp: 2.678 ± 0.049
2.895AsnGlu: 2.895 ± 0.048
2.043AsnPhe: 2.043 ± 0.038
3.072AsnGly: 3.072 ± 0.066
1.083AsnHis: 1.083 ± 0.027
3.245AsnIle: 3.245 ± 0.044
2.712AsnLys: 2.712 ± 0.046
4.859AsnLeu: 4.859 ± 0.071
0.959AsnMet: 0.959 ± 0.027
2.597AsnAsn: 2.597 ± 0.056
2.358AsnPro: 2.358 ± 0.04
3.014AsnGln: 3.014 ± 0.06
2.456AsnArg: 2.456 ± 0.047
3.24AsnSer: 3.24 ± 0.062
2.383AsnThr: 2.383 ± 0.048
2.653AsnVal: 2.653 ± 0.046
0.687AsnTrp: 0.687 ± 0.026
1.702AsnTyr: 1.702 ± 0.041
0.0AsnXaa: 0.0 ± 0.0
Pro
2.825ProAla: 2.825 ± 0.048
0.267ProCys: 0.267 ± 0.014
2.268ProAsp: 2.268 ± 0.039
3.04ProGlu: 3.04 ± 0.046
1.758ProPhe: 1.758 ± 0.032
2.442ProGly: 2.442 ± 0.048
0.741ProHis: 0.741 ± 0.021
2.688ProIle: 2.688 ± 0.042
2.213ProLys: 2.213 ± 0.042
3.767ProLeu: 3.767 ± 0.059
0.823ProMet: 0.823 ± 0.027
1.837ProAsn: 1.837 ± 0.042
1.295ProPro: 1.295 ± 0.04
1.642ProGln: 1.642 ± 0.033
1.352ProArg: 1.352 ± 0.027
2.509ProSer: 2.509 ± 0.044
2.028ProThr: 2.028 ± 0.04
2.917ProVal: 2.917 ± 0.042
0.476ProTrp: 0.476 ± 0.018
1.092ProTyr: 1.092 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
3.872GlnAla: 3.872 ± 0.057
0.344GlnCys: 0.344 ± 0.016
2.031GlnAsp: 2.031 ± 0.039
2.608GlnGlu: 2.608 ± 0.047
1.948GlnPhe: 1.948 ± 0.033
2.556GlnGly: 2.556 ± 0.043
0.853GlnHis: 0.853 ± 0.025
3.038GlnIle: 3.038 ± 0.041
2.506GlnLys: 2.506 ± 0.04
5.14GlnLeu: 5.14 ± 0.074
1.032GlnMet: 1.032 ± 0.027
1.952GlnAsn: 1.952 ± 0.042
1.656GlnPro: 1.656 ± 0.038
2.671GlnGln: 2.671 ± 0.057
2.122GlnArg: 2.122 ± 0.043
3.418GlnSer: 3.418 ± 0.057
2.212GlnThr: 2.212 ± 0.037
3.155GlnVal: 3.155 ± 0.06
0.594GlnTrp: 0.594 ± 0.022
1.333GlnTyr: 1.333 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
3.179ArgAla: 3.179 ± 0.053
0.393ArgCys: 0.393 ± 0.018
2.399ArgAsp: 2.399 ± 0.038
2.991ArgGlu: 2.991 ± 0.055
2.288ArgPhe: 2.288 ± 0.041
2.534ArgGly: 2.534 ± 0.051
1.028ArgHis: 1.028 ± 0.025
3.107ArgIle: 3.107 ± 0.051
2.557ArgLys: 2.557 ± 0.05
5.063ArgLeu: 5.063 ± 0.063
1.1ArgMet: 1.1 ± 0.027
2.102ArgAsn: 2.102 ± 0.039
1.546ArgPro: 1.546 ± 0.032
2.204ArgGln: 2.204 ± 0.042
2.221ArgArg: 2.221 ± 0.049
2.728ArgSer: 2.728 ± 0.041
2.014ArgThr: 2.014 ± 0.042
3.21ArgVal: 3.21 ± 0.049
0.667ArgTrp: 0.667 ± 0.025
1.689ArgTyr: 1.689 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
5.397SerAla: 5.397 ± 0.065
0.654SerCys: 0.654 ± 0.023
4.111SerAsp: 4.111 ± 0.05
4.763SerGlu: 4.763 ± 0.063
3.339SerPhe: 3.339 ± 0.053
5.133SerGly: 5.133 ± 0.068
1.531SerHis: 1.531 ± 0.036
4.98SerIle: 4.98 ± 0.065
3.662SerLys: 3.662 ± 0.059
7.461SerLeu: 7.461 ± 0.088
1.611SerMet: 1.611 ± 0.033
3.321SerAsn: 3.321 ± 0.062
2.65SerPro: 2.65 ± 0.046
3.506SerGln: 3.506 ± 0.054
3.227SerArg: 3.227 ± 0.044
4.94SerSer: 4.94 ± 0.079
3.374SerThr: 3.374 ± 0.05
4.649SerVal: 4.649 ± 0.068
0.912SerTrp: 0.912 ± 0.027
2.168SerTyr: 2.168 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
4.134ThrAla: 4.134 ± 0.066
0.421ThrCys: 0.421 ± 0.019
2.974ThrAsp: 2.974 ± 0.06
3.189ThrGlu: 3.189 ± 0.046
2.152ThrPhe: 2.152 ± 0.044
3.738ThrGly: 3.738 ± 0.062
1.14ThrHis: 1.14 ± 0.029
3.658ThrIle: 3.658 ± 0.05
2.482ThrLys: 2.482 ± 0.044
5.297ThrLeu: 5.297 ± 0.065
0.965ThrMet: 0.965 ± 0.027
2.423ThrAsn: 2.423 ± 0.05
2.444ThrPro: 2.444 ± 0.042
2.488ThrGln: 2.488 ± 0.046
2.319ThrArg: 2.319 ± 0.043
3.471ThrSer: 3.471 ± 0.049
2.867ThrThr: 2.867 ± 0.055
3.482ThrVal: 3.482 ± 0.056
0.569ThrTrp: 0.569 ± 0.02
1.504ThrTyr: 1.504 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
5.85ValAla: 5.85 ± 0.058
0.657ValCys: 0.657 ± 0.021
4.266ValAsp: 4.266 ± 0.051
4.588ValGlu: 4.588 ± 0.058
2.88ValPhe: 2.88 ± 0.047
4.271ValGly: 4.271 ± 0.066
1.073ValHis: 1.073 ± 0.029
4.777ValIle: 4.777 ± 0.064
3.814ValLys: 3.814 ± 0.049
6.07ValLeu: 6.07 ± 0.069
1.522ValMet: 1.522 ± 0.034
3.357ValAsn: 3.357 ± 0.058
2.33ValPro: 2.33 ± 0.043
1.912ValGln: 1.912 ± 0.037
2.608ValArg: 2.608 ± 0.049
5.024ValSer: 5.024 ± 0.068
3.907ValThr: 3.907 ± 0.067
4.745ValVal: 4.745 ± 0.063
0.722ValTrp: 0.722 ± 0.025
1.916ValTyr: 1.916 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
0.723TrpAla: 0.723 ± 0.025
0.119TrpCys: 0.119 ± 0.009
0.571TrpAsp: 0.571 ± 0.021
0.586TrpGlu: 0.586 ± 0.021
0.647TrpPhe: 0.647 ± 0.02
0.751TrpGly: 0.751 ± 0.025
0.3TrpHis: 0.3 ± 0.014
0.796TrpIle: 0.796 ± 0.027
0.569TrpLys: 0.569 ± 0.021
1.591TrpLeu: 1.591 ± 0.032
0.336TrpMet: 0.336 ± 0.014
0.527TrpAsn: 0.527 ± 0.018
0.435TrpPro: 0.435 ± 0.019
0.902TrpGln: 0.902 ± 0.027
0.712TrpArg: 0.712 ± 0.022
0.857TrpSer: 0.857 ± 0.029
0.591TrpThr: 0.591 ± 0.019
0.858TrpVal: 0.858 ± 0.027
0.185TrpTrp: 0.185 ± 0.012
0.387TrpTyr: 0.387 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.26TyrAla: 2.26 ± 0.04
0.359TyrCys: 0.359 ± 0.015
1.701TyrAsp: 1.701 ± 0.04
1.674TyrGlu: 1.674 ± 0.032
1.709TyrPhe: 1.709 ± 0.034
1.969TyrGly: 1.969 ± 0.042
0.737TyrHis: 0.737 ± 0.024
1.743TyrIle: 1.743 ± 0.038
1.382TyrLys: 1.382 ± 0.03
3.499TyrLeu: 3.499 ± 0.047
0.601TyrMet: 0.601 ± 0.018
1.198TyrAsn: 1.198 ± 0.035
1.299TyrPro: 1.299 ± 0.028
2.045TyrGln: 2.045 ± 0.042
1.918TyrArg: 1.918 ± 0.044
2.251TyrSer: 2.251 ± 0.048
1.45TyrThr: 1.45 ± 0.03
1.719TyrVal: 1.719 ± 0.038
0.517TyrTrp: 0.517 ± 0.018
1.087TyrTyr: 1.087 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4367 proteins (1527963 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski