Amino acid dipepetide frequency for Arthrospira platensis (strain NIES-39 / IAM M-135) (Spirulina platensis)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.064AlaAla: 6.064 ± 0.085
0.718AlaCys: 0.718 ± 0.02
3.772AlaAsp: 3.772 ± 0.065
4.875AlaGlu: 4.875 ± 0.073
2.63AlaPhe: 2.63 ± 0.049
4.925AlaGly: 4.925 ± 0.064
1.193AlaHis: 1.193 ± 0.03
7.579AlaIle: 7.579 ± 0.109
3.553AlaLys: 3.553 ± 0.053
7.401AlaLeu: 7.401 ± 0.076
1.575AlaMet: 1.575 ± 0.036
3.257AlaAsn: 3.257 ± 0.064
2.564AlaPro: 2.564 ± 0.05
3.493AlaGln: 3.493 ± 0.05
3.403AlaArg: 3.403 ± 0.054
4.37AlaSer: 4.37 ± 0.085
4.48AlaThr: 4.48 ± 0.064
4.811AlaVal: 4.811 ± 0.07
0.95AlaTrp: 0.95 ± 0.026
2.174AlaTyr: 2.174 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.547CysAla: 0.547 ± 0.018
0.231CysCys: 0.231 ± 0.017
0.657CysAsp: 0.657 ± 0.021
0.569CysGlu: 0.569 ± 0.018
0.405CysPhe: 0.405 ± 0.015
0.909CysGly: 0.909 ± 0.027
0.379CysHis: 0.379 ± 0.016
0.551CysIle: 0.551 ± 0.02
0.348CysLys: 0.348 ± 0.015
1.14CysLeu: 1.14 ± 0.028
0.156CysMet: 0.156 ± 0.009
0.408CysAsn: 0.408 ± 0.015
0.631CysPro: 0.631 ± 0.02
0.721CysGln: 0.721 ± 0.025
0.645CysArg: 0.645 ± 0.02
0.606CysSer: 0.606 ± 0.018
0.478CysThr: 0.478 ± 0.017
0.563CysVal: 0.563 ± 0.022
0.235CysTrp: 0.235 ± 0.014
0.462CysTyr: 0.462 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
3.33AspAla: 3.33 ± 0.06
0.55AspCys: 0.55 ± 0.019
2.911AspAsp: 2.911 ± 0.076
3.053AspGlu: 3.053 ± 0.063
2.43AspPhe: 2.43 ± 0.043
3.751AspGly: 3.751 ± 0.103
0.969AspHis: 0.969 ± 0.025
3.88AspIle: 3.88 ± 0.059
1.987AspLys: 1.987 ± 0.037
5.912AspLeu: 5.912 ± 0.074
0.915AspMet: 0.915 ± 0.024
2.369AspAsn: 2.369 ± 0.037
2.589AspPro: 2.589 ± 0.052
1.998AspGln: 1.998 ± 0.038
4.097AspArg: 4.097 ± 0.054
3.366AspSer: 3.366 ± 0.201
2.624AspThr: 2.624 ± 0.053
2.766AspVal: 2.766 ± 0.071
0.99AspTrp: 0.99 ± 0.022
2.128AspTyr: 2.128 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
5.473GluAla: 5.473 ± 0.08
0.5GluCys: 0.5 ± 0.019
2.945GluAsp: 2.945 ± 0.053
3.917GluGlu: 3.917 ± 0.082
2.513GluPhe: 2.513 ± 0.043
3.388GluGly: 3.388 ± 0.054
0.965GluHis: 0.965 ± 0.025
4.884GluIle: 4.884 ± 0.063
3.169GluLys: 3.169 ± 0.05
6.762GluLeu: 6.762 ± 0.072
1.458GluMet: 1.458 ± 0.03
2.917GluAsn: 2.917 ± 0.046
2.862GluPro: 2.862 ± 0.096
3.074GluGln: 3.074 ± 0.052
3.205GluArg: 3.205 ± 0.064
3.809GluSer: 3.809 ± 0.075
4.108GluThr: 4.108 ± 0.142
4.157GluVal: 4.157 ± 0.067
0.862GluTrp: 0.862 ± 0.026
1.913GluTyr: 1.913 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
2.457PheAla: 2.457 ± 0.036
0.524PheCys: 0.524 ± 0.019
2.309PheAsp: 2.309 ± 0.04
2.25PheGlu: 2.25 ± 0.04
1.487PhePhe: 1.487 ± 0.037
2.755PheGly: 2.755 ± 0.055
0.776PheHis: 0.776 ± 0.023
2.364PheIle: 2.364 ± 0.047
1.654PheLys: 1.654 ± 0.032
3.868PheLeu: 3.868 ± 0.053
0.745PheMet: 0.745 ± 0.022
1.951PheAsn: 1.951 ± 0.038
1.938PhePro: 1.938 ± 0.039
1.788PheGln: 1.788 ± 0.038
2.04PheArg: 2.04 ± 0.037
2.726PheSer: 2.726 ± 0.044
2.164PheThr: 2.164 ± 0.039
2.203PheVal: 2.203 ± 0.042
0.748PheTrp: 0.748 ± 0.021
1.328PheTyr: 1.328 ± 0.024
0.0PheXaa: 0.0 ± 0.0
Gly
4.168GlyAla: 4.168 ± 0.071
0.885GlyCys: 0.885 ± 0.027
4.099GlyAsp: 4.099 ± 0.093
4.43GlyGlu: 4.43 ± 0.076
3.209GlyPhe: 3.209 ± 0.052
5.296GlyGly: 5.296 ± 0.102
1.291GlyHis: 1.291 ± 0.036
4.726GlyIle: 4.726 ± 0.064
3.829GlyLys: 3.829 ± 0.056
6.746GlyLeu: 6.746 ± 0.091
1.629GlyMet: 1.629 ± 0.038
3.31GlyAsn: 3.31 ± 0.074
1.422GlyPro: 1.422 ± 0.039
3.004GlyGln: 3.004 ± 0.048
3.618GlyArg: 3.618 ± 0.052
4.336GlySer: 4.336 ± 0.068
3.741GlyThr: 3.741 ± 0.063
4.897GlyVal: 4.897 ± 0.062
1.294GlyTrp: 1.294 ± 0.032
2.426GlyTyr: 2.426 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
0.852HisAla: 0.852 ± 0.024
0.359HisCys: 0.359 ± 0.016
0.9HisAsp: 0.9 ± 0.051
0.89HisGlu: 0.89 ± 0.026
0.726HisPhe: 0.726 ± 0.023
1.203HisGly: 1.203 ± 0.034
0.656HisHis: 0.656 ± 0.024
1.287HisIle: 1.287 ± 0.029
0.957HisLys: 0.957 ± 0.024
2.275HisLeu: 2.275 ± 0.044
0.218HisMet: 0.218 ± 0.012
0.928HisAsn: 0.928 ± 0.023
1.443HisPro: 1.443 ± 0.034
1.461HisGln: 1.461 ± 0.03
1.224HisArg: 1.224 ± 0.032
1.253HisSer: 1.253 ± 0.033
1.043HisThr: 1.043 ± 0.029
0.713HisVal: 0.713 ± 0.021
0.368HisTrp: 0.368 ± 0.017
0.681HisTyr: 0.681 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
7.045IleAla: 7.045 ± 0.093
0.773IleCys: 0.773 ± 0.022
4.008IleAsp: 4.008 ± 0.051
4.297IleGlu: 4.297 ± 0.059
2.525IlePhe: 2.525 ± 0.044
4.361IleGly: 4.361 ± 0.063
1.258IleHis: 1.258 ± 0.027
4.304IleIle: 4.304 ± 0.063
3.181IleLys: 3.181 ± 0.053
6.589IleLeu: 6.589 ± 0.075
1.071IleMet: 1.071 ± 0.025
3.563IleAsn: 3.563 ± 0.058
3.771IlePro: 3.771 ± 0.049
2.916IleGln: 2.916 ± 0.136
3.456IleArg: 3.456 ± 0.051
4.854IleSer: 4.854 ± 0.056
3.913IleThr: 3.913 ± 0.059
3.962IleVal: 3.962 ± 0.055
0.95IleTrp: 0.95 ± 0.025
2.209IleTyr: 2.209 ± 0.043
0.0IleXaa: 0.0 ± 0.0
Lys
3.689LysAla: 3.689 ± 0.062
0.446LysCys: 0.446 ± 0.017
1.825LysAsp: 1.825 ± 0.042
2.095LysGlu: 2.095 ± 0.039
1.669LysPhe: 1.669 ± 0.038
2.639LysGly: 2.639 ± 0.048
0.768LysHis: 0.768 ± 0.02
3.227LysIle: 3.227 ± 0.049
2.323LysLys: 2.323 ± 0.06
4.965LysLeu: 4.965 ± 0.066
0.966LysMet: 0.966 ± 0.027
1.986LysAsn: 1.986 ± 0.033
2.788LysPro: 2.788 ± 0.045
2.495LysGln: 2.495 ± 0.048
2.352LysArg: 2.352 ± 0.047
3.135LysSer: 3.135 ± 0.045
3.21LysThr: 3.21 ± 0.048
2.655LysVal: 2.655 ± 0.048
0.514LysTrp: 0.514 ± 0.018
1.358LysTyr: 1.358 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
8.228LeuAla: 8.228 ± 0.078
0.977LeuCys: 0.977 ± 0.024
5.389LeuAsp: 5.389 ± 0.061
7.377LeuGlu: 7.377 ± 0.084
3.487LeuPhe: 3.487 ± 0.06
7.838LeuGly: 7.838 ± 0.074
1.794LeuHis: 1.794 ± 0.04
6.804LeuIle: 6.804 ± 0.077
5.302LeuLys: 5.302 ± 0.058
10.281LeuLeu: 10.281 ± 0.093
2.278LeuMet: 2.278 ± 0.04
4.73LeuAsn: 4.73 ± 0.065
5.377LeuPro: 5.377 ± 0.062
5.241LeuGln: 5.241 ± 0.077
5.456LeuArg: 5.456 ± 0.069
7.532LeuSer: 7.532 ± 0.119
6.215LeuThr: 6.215 ± 0.068
6.679LeuVal: 6.679 ± 0.079
1.664LeuTrp: 1.664 ± 0.039
2.738LeuTyr: 2.738 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
2.069MetAla: 2.069 ± 0.037
0.131MetCys: 0.131 ± 0.008
0.787MetAsp: 0.787 ± 0.023
1.135MetGlu: 1.135 ± 0.03
0.582MetPhe: 0.582 ± 0.018
1.559MetGly: 1.559 ± 0.034
0.208MetHis: 0.208 ± 0.012
1.333MetIle: 1.333 ± 0.031
0.945MetLys: 0.945 ± 0.024
1.908MetLeu: 1.908 ± 0.035
0.558MetMet: 0.558 ± 0.021
0.89MetAsn: 0.89 ± 0.027
0.864MetPro: 0.864 ± 0.023
0.761MetGln: 0.761 ± 0.023
1.058MetArg: 1.058 ± 0.022
1.459MetSer: 1.459 ± 0.044
1.331MetThr: 1.331 ± 0.034
1.604MetVal: 1.604 ± 0.036
0.195MetTrp: 0.195 ± 0.012
0.458MetTyr: 0.458 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.954AsnAla: 2.954 ± 0.096
0.595AsnCys: 0.595 ± 0.02
1.934AsnAsp: 1.934 ± 0.057
1.692AsnGlu: 1.692 ± 0.039
1.822AsnPhe: 1.822 ± 0.035
2.714AsnGly: 2.714 ± 0.042
1.23AsnHis: 1.23 ± 0.029
2.922AsnIle: 2.922 ± 0.051
1.512AsnLys: 1.512 ± 0.036
5.783AsnLeu: 5.783 ± 0.084
0.8AsnMet: 0.8 ± 0.021
2.121AsnAsn: 2.121 ± 0.097
3.261AsnPro: 3.261 ± 0.051
3.016AsnGln: 3.016 ± 0.042
2.702AsnArg: 2.702 ± 0.048
3.144AsnSer: 3.144 ± 0.053
2.192AsnThr: 2.192 ± 0.037
2.044AsnVal: 2.044 ± 0.037
0.953AsnTrp: 0.953 ± 0.024
1.748AsnTyr: 1.748 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
2.79ProAla: 2.79 ± 0.047
0.36ProCys: 0.36 ± 0.016
3.429ProAsp: 3.429 ± 0.051
4.64ProGlu: 4.64 ± 0.094
1.583ProPhe: 1.583 ± 0.038
3.368ProGly: 3.368 ± 0.056
1.032ProHis: 1.032 ± 0.03
3.244ProIle: 3.244 ± 0.058
2.257ProLys: 2.257 ± 0.04
4.467ProLeu: 4.467 ± 0.055
0.903ProMet: 0.903 ± 0.023
2.476ProAsn: 2.476 ± 0.043
3.033ProPro: 3.033 ± 0.059
2.883ProGln: 2.883 ± 0.048
2.175ProArg: 2.175 ± 0.041
3.32ProSer: 3.32 ± 0.088
3.245ProThr: 3.245 ± 0.068
3.287ProVal: 3.287 ± 0.1
0.674ProTrp: 0.674 ± 0.02
1.455ProTyr: 1.455 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
4.253GlnAla: 4.253 ± 0.065
0.445GlnCys: 0.445 ± 0.019
2.218GlnAsp: 2.218 ± 0.037
3.378GlnGlu: 3.378 ± 0.057
1.846GlnPhe: 1.846 ± 0.035
3.342GlnGly: 3.342 ± 0.051
0.831GlnHis: 0.831 ± 0.021
3.595GlnIle: 3.595 ± 0.05
2.67GlnLys: 2.67 ± 0.053
5.986GlnLeu: 5.986 ± 0.08
1.089GlnMet: 1.089 ± 0.027
2.236GlnAsn: 2.236 ± 0.042
2.733GlnPro: 2.733 ± 0.051
3.736GlnGln: 3.736 ± 0.074
3.031GlnArg: 3.031 ± 0.135
2.922GlnSer: 2.922 ± 0.059
2.973GlnThr: 2.973 ± 0.048
3.605GlnVal: 3.605 ± 0.06
0.785GlnTrp: 0.785 ± 0.025
1.311GlnTyr: 1.311 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
3.299ArgAla: 3.299 ± 0.061
0.655ArgCys: 0.655 ± 0.023
3.016ArgAsp: 3.016 ± 0.05
3.554ArgGlu: 3.554 ± 0.058
2.443ArgPhe: 2.443 ± 0.04
3.429ArgGly: 3.429 ± 0.055
1.376ArgHis: 1.376 ± 0.031
3.468ArgIle: 3.468 ± 0.043
2.083ArgLys: 2.083 ± 0.036
6.51ArgLeu: 6.51 ± 0.11
1.035ArgMet: 1.035 ± 0.024
2.144ArgAsn: 2.144 ± 0.039
2.325ArgPro: 2.325 ± 0.038
3.544ArgGln: 3.544 ± 0.05
3.504ArgArg: 3.504 ± 0.059
3.487ArgSer: 3.487 ± 0.049
2.588ArgThr: 2.588 ± 0.041
3.68ArgVal: 3.68 ± 0.058
0.88ArgTrp: 0.88 ± 0.023
1.918ArgTyr: 1.918 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
4.098SerAla: 4.098 ± 0.06
0.64SerCys: 0.64 ± 0.021
3.998SerAsp: 3.998 ± 0.176
4.199SerGlu: 4.199 ± 0.15
2.221SerPhe: 2.221 ± 0.043
4.756SerGly: 4.756 ± 0.087
1.543SerHis: 1.543 ± 0.034
3.817SerIle: 3.817 ± 0.083
2.407SerLys: 2.407 ± 0.037
7.149SerLeu: 7.149 ± 0.07
1.3SerMet: 1.3 ± 0.031
2.626SerAsn: 2.626 ± 0.046
4.276SerPro: 4.276 ± 0.129
4.229SerGln: 4.229 ± 0.092
3.711SerArg: 3.711 ± 0.05
4.325SerSer: 4.325 ± 0.061
3.188SerThr: 3.188 ± 0.069
3.734SerVal: 3.734 ± 0.054
1.042SerTrp: 1.042 ± 0.028
2.052SerTyr: 2.052 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
4.508ThrAla: 4.508 ± 0.069
0.504ThrCys: 0.504 ± 0.017
2.705ThrAsp: 2.705 ± 0.049
3.664ThrGlu: 3.664 ± 0.06
1.963ThrPhe: 1.963 ± 0.039
4.286ThrGly: 4.286 ± 0.065
1.125ThrHis: 1.125 ± 0.04
4.044ThrIle: 4.044 ± 0.072
2.011ThrLys: 2.011 ± 0.038
6.179ThrLeu: 6.179 ± 0.082
0.942ThrMet: 0.942 ± 0.022
2.237ThrAsn: 2.237 ± 0.044
3.942ThrPro: 3.942 ± 0.1
2.817ThrGln: 2.817 ± 0.046
2.715ThrArg: 2.715 ± 0.05
3.333ThrSer: 3.333 ± 0.136
3.22ThrThr: 3.22 ± 0.062
3.86ThrVal: 3.86 ± 0.067
0.818ThrTrp: 0.818 ± 0.023
1.648ThrTyr: 1.648 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
5.28ValAla: 5.28 ± 0.074
0.683ValCys: 0.683 ± 0.022
3.334ValAsp: 3.334 ± 0.101
4.087ValGlu: 4.087 ± 0.071
2.483ValPhe: 2.483 ± 0.04
4.487ValGly: 4.487 ± 0.076
0.894ValHis: 0.894 ± 0.027
4.344ValIle: 4.344 ± 0.058
3.098ValLys: 3.098 ± 0.048
5.81ValLeu: 5.81 ± 0.062
1.39ValMet: 1.39 ± 0.034
3.024ValAsn: 3.024 ± 0.084
2.588ValPro: 2.588 ± 0.045
2.391ValGln: 2.391 ± 0.038
3.401ValArg: 3.401 ± 0.051
4.255ValSer: 4.255 ± 0.062
3.549ValThr: 3.549 ± 0.05
4.377ValVal: 4.377 ± 0.067
0.895ValTrp: 0.895 ± 0.027
1.842ValTyr: 1.842 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
0.932TrpAla: 0.932 ± 0.028
0.268TrpCys: 0.268 ± 0.015
0.812TrpAsp: 0.812 ± 0.025
1.21TrpGlu: 1.21 ± 0.03
0.638TrpPhe: 0.638 ± 0.023
1.154TrpGly: 1.154 ± 0.032
0.311TrpHis: 0.311 ± 0.014
0.791TrpIle: 0.791 ± 0.022
0.661TrpLys: 0.661 ± 0.018
2.071TrpLeu: 2.071 ± 0.04
0.339TrpMet: 0.339 ± 0.015
0.569TrpAsn: 0.569 ± 0.019
0.334TrpPro: 0.334 ± 0.016
1.027TrpGln: 1.027 ± 0.024
0.933TrpArg: 0.933 ± 0.026
0.964TrpSer: 0.964 ± 0.025
0.797TrpThr: 0.797 ± 0.023
1.148TrpVal: 1.148 ± 0.035
0.266TrpTrp: 0.266 ± 0.016
0.423TrpTyr: 0.423 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.748TyrAla: 1.748 ± 0.035
0.398TyrCys: 0.398 ± 0.016
1.57TyrAsp: 1.57 ± 0.033
1.57TyrGlu: 1.57 ± 0.039
1.411TyrPhe: 1.411 ± 0.029
2.196TyrGly: 2.196 ± 0.042
0.835TyrHis: 0.835 ± 0.024
1.843TyrIle: 1.843 ± 0.038
1.158TyrLys: 1.158 ± 0.029
3.586TyrLeu: 3.586 ± 0.045
0.467TyrMet: 0.467 ± 0.017
1.395TyrAsn: 1.395 ± 0.029
1.881TyrPro: 1.881 ± 0.038
2.39TyrGln: 2.39 ± 0.052
2.215TyrArg: 2.215 ± 0.042
2.116TyrSer: 2.116 ± 0.042
1.459TyrThr: 1.459 ± 0.033
1.468TyrVal: 1.468 ± 0.032
0.563TyrTrp: 0.563 ± 0.019
1.228TyrTyr: 1.228 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6009 proteins (1745523 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski