Amino acid dipepetide frequency for Ectocarpus siliculosus (Brown alga) (Conferva siliculosa)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.131AlaAla: 21.131 ± 0.182
1.687AlaCys: 1.687 ± 0.017
5.275AlaAsp: 5.275 ± 0.031
7.323AlaGlu: 7.323 ± 0.056
2.954AlaPhe: 2.954 ± 0.024
11.261AlaGly: 11.261 ± 0.071
1.6AlaHis: 1.6 ± 0.015
3.068AlaIle: 3.068 ± 0.023
4.402AlaLys: 4.402 ± 0.037
8.781AlaLeu: 8.781 ± 0.051
2.354AlaMet: 2.354 ± 0.019
2.451AlaAsn: 2.451 ± 0.02
5.681AlaPro: 5.681 ± 0.054
2.951AlaGln: 2.951 ± 0.024
7.163AlaArg: 7.163 ± 0.046
8.775AlaSer: 8.775 ± 0.045
6.508AlaThr: 6.508 ± 0.041
8.08AlaVal: 8.08 ± 0.041
1.235AlaTrp: 1.235 ± 0.013
1.59AlaTyr: 1.59 ± 0.02
0.002AlaXaa: 0.002 ± 0.0
Cys
1.333CysAla: 1.333 ± 0.015
0.47CysCys: 0.47 ± 0.012
0.833CysAsp: 0.833 ± 0.014
0.812CysGlu: 0.812 ± 0.011
0.58CysPhe: 0.58 ± 0.009
1.553CysGly: 1.553 ± 0.018
0.336CysHis: 0.336 ± 0.007
0.499CysIle: 0.499 ± 0.008
0.521CysLys: 0.521 ± 0.008
1.458CysLeu: 1.458 ± 0.015
0.342CysMet: 0.342 ± 0.007
0.411CysAsn: 0.411 ± 0.008
0.905CysPro: 0.905 ± 0.02
0.446CysGln: 0.446 ± 0.008
1.157CysArg: 1.157 ± 0.015
1.392CysSer: 1.392 ± 0.019
0.851CysThr: 0.851 ± 0.014
1.09CysVal: 1.09 ± 0.014
0.242CysTrp: 0.242 ± 0.006
0.347CysTyr: 0.347 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
5.62AspAla: 5.62 ± 0.033
0.779AspCys: 0.779 ± 0.013
4.879AspAsp: 4.879 ± 0.044
4.378AspGlu: 4.378 ± 0.033
1.729AspPhe: 1.729 ± 0.018
6.787AspGly: 6.787 ± 0.041
1.059AspHis: 1.059 ± 0.014
1.897AspIle: 1.897 ± 0.017
2.129AspLys: 2.129 ± 0.018
4.37AspLeu: 4.37 ± 0.027
1.239AspMet: 1.239 ± 0.013
1.699AspAsn: 1.699 ± 0.013
2.838AspPro: 2.838 ± 0.026
1.495AspGln: 1.495 ± 0.015
3.363AspArg: 3.363 ± 0.027
3.898AspSer: 3.898 ± 0.025
2.697AspThr: 2.697 ± 0.023
4.097AspVal: 4.097 ± 0.048
0.685AspTrp: 0.685 ± 0.01
1.089AspTyr: 1.089 ± 0.013
0.0AspXaa: 0.0 ± 0.0
Glu
8.018GluAla: 8.018 ± 0.058
0.83GluCys: 0.83 ± 0.012
4.5GluAsp: 4.5 ± 0.028
8.103GluGlu: 8.103 ± 0.076
1.612GluPhe: 1.612 ± 0.015
6.82GluGly: 6.82 ± 0.034
1.3GluHis: 1.3 ± 0.016
2.069GluIle: 2.069 ± 0.018
3.095GluLys: 3.095 ± 0.027
5.135GluLeu: 5.135 ± 0.038
1.524GluMet: 1.524 ± 0.016
1.772GluAsn: 1.772 ± 0.017
2.507GluPro: 2.507 ± 0.031
2.459GluGln: 2.459 ± 0.034
4.889GluArg: 4.889 ± 0.04
4.017GluSer: 4.017 ± 0.026
3.344GluThr: 3.344 ± 0.022
4.494GluVal: 4.494 ± 0.029
0.796GluTrp: 0.796 ± 0.012
1.138GluTyr: 1.138 ± 0.013
0.0GluXaa: 0.0 ± 0.0
Phe
2.497PheAla: 2.497 ± 0.021
0.605PheCys: 0.605 ± 0.01
1.875PheAsp: 1.875 ± 0.022
1.783PheGlu: 1.783 ± 0.018
1.226PhePhe: 1.226 ± 0.015
2.625PheGly: 2.625 ± 0.024
0.658PheHis: 0.658 ± 0.01
0.899PheIle: 0.899 ± 0.012
1.092PheLys: 1.092 ± 0.012
2.772PheLeu: 2.772 ± 0.022
0.66PheMet: 0.66 ± 0.009
0.921PheAsn: 0.921 ± 0.013
1.411PhePro: 1.411 ± 0.015
0.928PheGln: 0.928 ± 0.012
1.905PheArg: 1.905 ± 0.015
2.396PheSer: 2.396 ± 0.022
1.443PheThr: 1.443 ± 0.017
2.255PheVal: 2.255 ± 0.022
0.409PheTrp: 0.409 ± 0.007
0.698PheTyr: 0.698 ± 0.011
0.0PheXaa: 0.0 ± 0.0
Gly
10.304GlyAla: 10.304 ± 0.06
1.513GlyCys: 1.513 ± 0.02
6.42GlyAsp: 6.42 ± 0.042
6.456GlyGlu: 6.456 ± 0.042
2.613GlyPhe: 2.613 ± 0.022
21.055GlyGly: 21.055 ± 0.205
1.853GlyHis: 1.853 ± 0.019
2.638GlyIle: 2.638 ± 0.018
4.089GlyLys: 4.089 ± 0.032
6.554GlyLeu: 6.554 ± 0.034
2.042GlyMet: 2.042 ± 0.018
2.832GlyAsn: 2.832 ± 0.023
3.688GlyPro: 3.688 ± 0.03
2.705GlyGln: 2.705 ± 0.021
7.307GlyArg: 7.307 ± 0.048
8.606GlySer: 8.606 ± 0.052
4.895GlyThr: 4.895 ± 0.031
6.913GlyVal: 6.913 ± 0.032
1.227GlyTrp: 1.227 ± 0.016
1.694GlyTyr: 1.694 ± 0.021
0.004GlyXaa: 0.004 ± 0.001
His
1.885HisAla: 1.885 ± 0.014
0.338HisCys: 0.338 ± 0.006
1.082HisAsp: 1.082 ± 0.011
1.118HisGlu: 1.118 ± 0.014
0.635HisPhe: 0.635 ± 0.008
1.937HisGly: 1.937 ± 0.017
0.754HisHis: 0.754 ± 0.013
0.597HisIle: 0.597 ± 0.009
0.711HisLys: 0.711 ± 0.01
1.802HisLeu: 1.802 ± 0.018
0.403HisMet: 0.403 ± 0.006
0.534HisAsn: 0.534 ± 0.008
1.299HisPro: 1.299 ± 0.015
0.824HisGln: 0.824 ± 0.011
1.606HisArg: 1.606 ± 0.014
1.303HisSer: 1.303 ± 0.012
0.92HisThr: 0.92 ± 0.012
1.378HisVal: 1.378 ± 0.013
0.251HisTrp: 0.251 ± 0.005
0.427HisTyr: 0.427 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
2.831IleAla: 2.831 ± 0.024
0.503IleCys: 0.503 ± 0.008
1.778IleAsp: 1.778 ± 0.019
1.844IleGlu: 1.844 ± 0.018
1.054IlePhe: 1.054 ± 0.013
2.341IleGly: 2.341 ± 0.018
0.594IleHis: 0.594 ± 0.008
1.063IleIle: 1.063 ± 0.015
1.351IleLys: 1.351 ± 0.016
2.508IleLeu: 2.508 ± 0.02
0.677IleMet: 0.677 ± 0.011
0.969IleAsn: 0.969 ± 0.013
1.635IlePro: 1.635 ± 0.016
0.952IleGln: 0.952 ± 0.013
1.887IleArg: 1.887 ± 0.018
2.23IleSer: 2.23 ± 0.018
1.604IleThr: 1.604 ± 0.016
2.164IleVal: 2.164 ± 0.018
0.31IleTrp: 0.31 ± 0.006
0.632IleTyr: 0.632 ± 0.009
0.001IleXaa: 0.001 ± 0.0
Lys
4.626LysAla: 4.626 ± 0.033
0.462LysCys: 0.462 ± 0.008
2.243LysAsp: 2.243 ± 0.019
3.186LysGlu: 3.186 ± 0.027
0.999LysPhe: 0.999 ± 0.013
3.598LysGly: 3.598 ± 0.028
0.901LysHis: 0.901 ± 0.011
1.384LysIle: 1.384 ± 0.016
3.002LysLys: 3.002 ± 0.035
3.187LysLeu: 3.187 ± 0.026
0.971LysMet: 0.971 ± 0.014
1.258LysAsn: 1.258 ± 0.013
1.901LysPro: 1.901 ± 0.019
1.54LysGln: 1.54 ± 0.017
3.349LysArg: 3.349 ± 0.028
2.552LysSer: 2.552 ± 0.019
2.259LysThr: 2.259 ± 0.019
2.677LysVal: 2.677 ± 0.021
0.492LysTrp: 0.492 ± 0.01
0.799LysTyr: 0.799 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
8.518LeuAla: 8.518 ± 0.05
1.433LeuCys: 1.433 ± 0.015
4.519LeuAsp: 4.519 ± 0.029
5.909LeuGlu: 5.909 ± 0.04
2.53LeuPhe: 2.53 ± 0.022
6.443LeuGly: 6.443 ± 0.036
1.896LeuHis: 1.896 ± 0.02
2.072LeuIle: 2.072 ± 0.022
3.42LeuLys: 3.42 ± 0.027
8.179LeuLeu: 8.179 ± 0.05
1.723LeuMet: 1.723 ± 0.015
1.966LeuAsn: 1.966 ± 0.02
4.609LeuPro: 4.609 ± 0.028
3.178LeuGln: 3.178 ± 0.023
6.248LeuArg: 6.248 ± 0.042
6.392LeuSer: 6.392 ± 0.036
3.873LeuThr: 3.873 ± 0.025
5.723LeuVal: 5.723 ± 0.032
1.047LeuTrp: 1.047 ± 0.012
1.582LeuTyr: 1.582 ± 0.017
0.002LeuXaa: 0.002 ± 0.0
Met
2.363MetAla: 2.363 ± 0.021
0.326MetCys: 0.326 ± 0.006
1.251MetAsp: 1.251 ± 0.014
1.533MetGlu: 1.533 ± 0.014
0.683MetPhe: 0.683 ± 0.009
1.7MetGly: 1.7 ± 0.018
0.436MetHis: 0.436 ± 0.008
0.664MetIle: 0.664 ± 0.011
0.945MetLys: 0.945 ± 0.012
1.877MetLeu: 1.877 ± 0.017
0.633MetMet: 0.633 ± 0.011
0.562MetAsn: 0.562 ± 0.009
1.116MetPro: 1.116 ± 0.014
0.757MetGln: 0.757 ± 0.01
1.387MetArg: 1.387 ± 0.014
1.652MetSer: 1.652 ± 0.015
1.152MetThr: 1.152 ± 0.012
1.586MetVal: 1.586 ± 0.014
0.262MetTrp: 0.262 ± 0.006
0.427MetTyr: 0.427 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.641AsnAla: 2.641 ± 0.02
0.369AsnCys: 0.369 ± 0.008
1.561AsnAsp: 1.561 ± 0.016
1.527AsnGlu: 1.527 ± 0.016
0.843AsnPhe: 0.843 ± 0.012
2.875AsnGly: 2.875 ± 0.025
0.585AsnHis: 0.585 ± 0.009
0.999AsnIle: 0.999 ± 0.014
1.228AsnLys: 1.228 ± 0.015
2.142AsnLeu: 2.142 ± 0.022
0.609AsnMet: 0.609 ± 0.011
1.245AsnAsn: 1.245 ± 0.018
1.554AsnPro: 1.554 ± 0.014
0.844AsnGln: 0.844 ± 0.011
1.71AsnArg: 1.71 ± 0.016
2.014AsnSer: 2.014 ± 0.018
1.504AsnThr: 1.504 ± 0.015
1.811AsnVal: 1.811 ± 0.018
0.295AsnTrp: 0.295 ± 0.006
0.528AsnTyr: 0.528 ± 0.009
0.001AsnXaa: 0.001 ± 0.0
Pro
6.393ProAla: 6.393 ± 0.05
0.695ProCys: 0.695 ± 0.011
2.635ProAsp: 2.635 ± 0.029
3.298ProGlu: 3.298 ± 0.022
1.419ProPhe: 1.419 ± 0.013
4.654ProGly: 4.654 ± 0.035
0.975ProHis: 0.975 ± 0.01
1.299ProIle: 1.299 ± 0.015
1.813ProLys: 1.813 ± 0.018
4.151ProLeu: 4.151 ± 0.03
0.937ProMet: 0.937 ± 0.011
1.239ProAsn: 1.239 ± 0.014
5.882ProPro: 5.882 ± 0.066
1.654ProGln: 1.654 ± 0.017
3.594ProArg: 3.594 ± 0.028
5.247ProSer: 5.247 ± 0.039
3.363ProThr: 3.363 ± 0.051
3.459ProVal: 3.459 ± 0.04
0.672ProTrp: 0.672 ± 0.013
0.849ProTyr: 0.849 ± 0.012
0.0ProXaa: 0.0 ± 0.0
Gln
3.594GlnAla: 3.594 ± 0.026
0.399GlnCys: 0.399 ± 0.007
1.698GlnAsp: 1.698 ± 0.017
2.627GlnGlu: 2.627 ± 0.023
0.738GlnPhe: 0.738 ± 0.01
2.835GlnGly: 2.835 ± 0.02
0.948GlnHis: 0.948 ± 0.011
0.899GlnIle: 0.899 ± 0.012
1.286GlnLys: 1.286 ± 0.016
2.784GlnLeu: 2.784 ± 0.024
0.704GlnMet: 0.704 ± 0.009
0.758GlnAsn: 0.758 ± 0.011
1.709GlnPro: 1.709 ± 0.016
2.783GlnGln: 2.783 ± 0.042
2.8GlnArg: 2.8 ± 0.022
1.874GlnSer: 1.874 ± 0.015
1.471GlnThr: 1.471 ± 0.028
1.991GlnVal: 1.991 ± 0.017
0.418GlnTrp: 0.418 ± 0.01
0.583GlnTyr: 0.583 ± 0.01
0.001GlnXaa: 0.001 ± 0.0
Arg
6.963ArgAla: 6.963 ± 0.049
1.174ArgCys: 1.174 ± 0.014
3.58ArgAsp: 3.58 ± 0.026
4.937ArgGlu: 4.937 ± 0.039
1.964ArgPhe: 1.964 ± 0.019
7.02ArgGly: 7.02 ± 0.051
1.655ArgHis: 1.655 ± 0.017
1.873ArgIle: 1.873 ± 0.016
3.261ArgLys: 3.261 ± 0.024
6.059ArgLeu: 6.059 ± 0.039
1.504ArgMet: 1.504 ± 0.015
1.851ArgAsn: 1.851 ± 0.018
3.524ArgPro: 3.524 ± 0.028
2.833ArgGln: 2.833 ± 0.024
7.805ArgArg: 7.805 ± 0.055
5.519ArgSer: 5.519 ± 0.035
3.246ArgThr: 3.246 ± 0.025
4.455ArgVal: 4.455 ± 0.024
1.039ArgTrp: 1.039 ± 0.012
1.292ArgTyr: 1.292 ± 0.016
0.001ArgXaa: 0.001 ± 0.0
Ser
8.362SerAla: 8.362 ± 0.042
1.293SerCys: 1.293 ± 0.014
3.897SerAsp: 3.897 ± 0.025
4.064SerGlu: 4.064 ± 0.025
2.46SerPhe: 2.46 ± 0.022
8.332SerGly: 8.332 ± 0.059
1.289SerHis: 1.289 ± 0.012
2.253SerIle: 2.253 ± 0.02
3.059SerLys: 3.059 ± 0.023
6.215SerLeu: 6.215 ± 0.03
1.574SerMet: 1.574 ± 0.013
2.187SerAsn: 2.187 ± 0.021
5.123SerPro: 5.123 ± 0.044
2.017SerGln: 2.017 ± 0.019
5.541SerArg: 5.541 ± 0.035
10.032SerSer: 10.032 ± 0.059
4.825SerThr: 4.825 ± 0.033
4.892SerVal: 4.892 ± 0.041
0.998SerTrp: 0.998 ± 0.012
1.338SerTyr: 1.338 ± 0.017
0.002SerXaa: 0.002 ± 0.0
Thr
6.79ThrAla: 6.79 ± 0.05
0.906ThrCys: 0.906 ± 0.015
2.5ThrAsp: 2.5 ± 0.021
2.885ThrGlu: 2.885 ± 0.019
1.634ThrPhe: 1.634 ± 0.019
4.847ThrGly: 4.847 ± 0.034
0.896ThrHis: 0.896 ± 0.011
1.719ThrIle: 1.719 ± 0.019
2.07ThrLys: 2.07 ± 0.018
4.302ThrLeu: 4.302 ± 0.032
1.111ThrMet: 1.111 ± 0.015
1.409ThrAsn: 1.409 ± 0.016
3.835ThrPro: 3.835 ± 0.043
1.375ThrGln: 1.375 ± 0.014
3.203ThrArg: 3.203 ± 0.024
4.416ThrSer: 4.416 ± 0.03
3.947ThrThr: 3.947 ± 0.039
3.736ThrVal: 3.736 ± 0.037
0.604ThrTrp: 0.604 ± 0.01
0.969ThrTyr: 0.969 ± 0.015
0.002ThrXaa: 0.002 ± 0.0
Val
7.75ValAla: 7.75 ± 0.038
1.166ValCys: 1.166 ± 0.014
4.241ValAsp: 4.241 ± 0.031
4.743ValGlu: 4.743 ± 0.034
2.313ValPhe: 2.313 ± 0.021
6.004ValGly: 6.004 ± 0.026
1.346ValHis: 1.346 ± 0.014
2.046ValIle: 2.046 ± 0.02
2.669ValLys: 2.669 ± 0.022
6.211ValLeu: 6.211 ± 0.034
1.508ValMet: 1.508 ± 0.014
1.78ValAsn: 1.78 ± 0.017
3.596ValPro: 3.596 ± 0.032
2.123ValGln: 2.123 ± 0.019
4.317ValArg: 4.317 ± 0.029
5.196ValSer: 5.196 ± 0.045
3.598ValThr: 3.598 ± 0.034
5.952ValVal: 5.952 ± 0.037
0.844ValTrp: 0.844 ± 0.011
1.414ValTyr: 1.414 ± 0.019
0.001ValXaa: 0.001 ± 0.0
Trp
1.118TrpAla: 1.118 ± 0.014
0.255TrpCys: 0.255 ± 0.005
0.756TrpAsp: 0.756 ± 0.011
0.832TrpGlu: 0.832 ± 0.011
0.365TrpPhe: 0.365 ± 0.007
1.048TrpGly: 1.048 ± 0.014
0.268TrpHis: 0.268 ± 0.006
0.36TrpIle: 0.36 ± 0.007
0.566TrpLys: 0.566 ± 0.011
1.077TrpLeu: 1.077 ± 0.013
0.332TrpMet: 0.332 ± 0.006
0.39TrpAsn: 0.39 ± 0.008
0.533TrpPro: 0.533 ± 0.01
0.432TrpGln: 0.432 ± 0.007
1.061TrpArg: 1.061 ± 0.013
0.931TrpSer: 0.931 ± 0.012
0.663TrpThr: 0.663 ± 0.01
0.803TrpVal: 0.803 ± 0.011
0.222TrpTrp: 0.222 ± 0.007
0.278TrpTyr: 0.278 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.534TyrAla: 1.534 ± 0.016
0.375TyrCys: 0.375 ± 0.012
1.203TyrAsp: 1.203 ± 0.015
1.064TyrGlu: 1.064 ± 0.015
0.708TyrPhe: 0.708 ± 0.011
1.673TyrGly: 1.673 ± 0.018
0.454TyrHis: 0.454 ± 0.008
0.629TyrIle: 0.629 ± 0.011
0.678TyrLys: 0.678 ± 0.01
1.697TyrLeu: 1.697 ± 0.017
0.451TyrMet: 0.451 ± 0.008
0.651TyrAsn: 0.651 ± 0.012
0.883TyrPro: 0.883 ± 0.011
0.603TyrGln: 0.603 ± 0.01
1.266TyrArg: 1.266 ± 0.012
1.308TyrSer: 1.308 ± 0.016
0.989TyrThr: 0.989 ± 0.017
1.263TyrVal: 1.263 ± 0.016
0.243TyrTrp: 0.243 ± 0.006
0.533TyrTyr: 0.533 ± 0.012
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.016XaaMet: 0.016 ± 0.001
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.04XaaXaa: 0.04 ± 0.011
Statistics based on 16334 proteins (8478560 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski