Amino acid dipepetide frequency for Cyanothece sp. (strain PCC 7425 / ATCC 29141)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.344AlaAla: 9.344 ± 0.101
0.965AlaCys: 0.965 ± 0.028
4.486AlaAsp: 4.486 ± 0.057
6.065AlaGlu: 6.065 ± 0.066
2.943AlaPhe: 2.943 ± 0.039
6.197AlaGly: 6.197 ± 0.064
1.67AlaHis: 1.67 ± 0.036
6.724AlaIle: 6.724 ± 0.07
3.197AlaLys: 3.197 ± 0.051
10.264AlaLeu: 10.264 ± 0.101
1.823AlaMet: 1.823 ± 0.032
3.002AlaAsn: 3.002 ± 0.056
3.564AlaPro: 3.564 ± 0.058
5.157AlaGln: 5.157 ± 0.074
4.451AlaArg: 4.451 ± 0.064
4.56AlaSer: 4.56 ± 0.047
4.97AlaThr: 4.97 ± 0.06
5.975AlaVal: 5.975 ± 0.067
1.195AlaTrp: 1.195 ± 0.027
2.429AlaTyr: 2.429 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
0.684CysAla: 0.684 ± 0.024
0.206CysCys: 0.206 ± 0.012
0.573CysAsp: 0.573 ± 0.018
0.514CysGlu: 0.514 ± 0.016
0.439CysPhe: 0.439 ± 0.016
0.867CysGly: 0.867 ± 0.027
0.318CysHis: 0.318 ± 0.016
0.506CysIle: 0.506 ± 0.019
0.303CysLys: 0.303 ± 0.014
1.348CysLeu: 1.348 ± 0.033
0.17CysMet: 0.17 ± 0.011
0.355CysAsn: 0.355 ± 0.015
0.674CysPro: 0.674 ± 0.023
0.689CysGln: 0.689 ± 0.022
0.664CysArg: 0.664 ± 0.021
0.695CysSer: 0.695 ± 0.018
0.531CysThr: 0.531 ± 0.021
0.568CysVal: 0.568 ± 0.017
0.2CysTrp: 0.2 ± 0.011
0.383CysTyr: 0.383 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
3.313AspAla: 3.313 ± 0.049
0.56AspCys: 0.56 ± 0.018
1.797AspAsp: 1.797 ± 0.04
2.311AspGlu: 2.311 ± 0.046
2.097AspPhe: 2.097 ± 0.039
3.02AspGly: 3.02 ± 0.05
1.013AspHis: 1.013 ± 0.025
2.525AspIle: 2.525 ± 0.046
1.38AspLys: 1.38 ± 0.034
6.712AspLeu: 6.712 ± 0.07
0.72AspMet: 0.72 ± 0.021
1.346AspAsn: 1.346 ± 0.031
3.045AspPro: 3.045 ± 0.043
2.813AspGln: 2.813 ± 0.04
4.423AspArg: 4.423 ± 0.052
2.391AspSer: 2.391 ± 0.042
2.034AspThr: 2.034 ± 0.036
2.663AspVal: 2.663 ± 0.05
1.055AspTrp: 1.055 ± 0.028
1.824AspTyr: 1.824 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
5.626GluAla: 5.626 ± 0.069
0.409GluCys: 0.409 ± 0.016
2.567GluAsp: 2.567 ± 0.044
3.548GluGlu: 3.548 ± 0.06
2.154GluPhe: 2.154 ± 0.034
3.367GluGly: 3.367 ± 0.046
1.14GluHis: 1.14 ± 0.031
3.772GluIle: 3.772 ± 0.051
2.361GluLys: 2.361 ± 0.045
7.06GluLeu: 7.06 ± 0.073
1.264GluMet: 1.264 ± 0.03
1.894GluAsn: 1.894 ± 0.038
2.676GluPro: 2.676 ± 0.051
4.309GluGln: 4.309 ± 0.06
3.87GluArg: 3.87 ± 0.06
2.898GluSer: 2.898 ± 0.049
3.428GluThr: 3.428 ± 0.045
4.151GluVal: 4.151 ± 0.057
0.835GluTrp: 0.835 ± 0.023
1.467GluTyr: 1.467 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.233PheAla: 3.233 ± 0.046
0.59PheCys: 0.59 ± 0.019
2.001PheAsp: 2.001 ± 0.034
1.924PheGlu: 1.924 ± 0.034
1.603PhePhe: 1.603 ± 0.032
2.722PheGly: 2.722 ± 0.042
0.752PheHis: 0.752 ± 0.022
2.052PheIle: 2.052 ± 0.036
1.309PheLys: 1.309 ± 0.032
4.117PheLeu: 4.117 ± 0.059
0.676PheMet: 0.676 ± 0.022
1.536PheAsn: 1.536 ± 0.031
1.903PhePro: 1.903 ± 0.034
1.885PheGln: 1.885 ± 0.035
2.036PheArg: 2.036 ± 0.038
2.701PheSer: 2.701 ± 0.041
2.221PheThr: 2.221 ± 0.038
2.235PheVal: 2.235 ± 0.041
0.789PheTrp: 0.789 ± 0.023
1.335PheTyr: 1.335 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
5.082GlyAla: 5.082 ± 0.071
0.892GlyCys: 0.892 ± 0.023
3.155GlyAsp: 3.155 ± 0.048
3.987GlyGlu: 3.987 ± 0.055
2.967GlyPhe: 2.967 ± 0.049
4.903GlyGly: 4.903 ± 0.077
1.343GlyHis: 1.343 ± 0.032
4.698GlyIle: 4.698 ± 0.061
3.154GlyLys: 3.154 ± 0.048
8.214GlyLeu: 8.214 ± 0.082
1.743GlyMet: 1.743 ± 0.034
2.348GlyAsn: 2.348 ± 0.043
1.895GlyPro: 1.895 ± 0.039
4.053GlyGln: 4.053 ± 0.06
3.676GlyArg: 3.676 ± 0.044
4.137GlySer: 4.137 ± 0.056
3.832GlyThr: 3.832 ± 0.056
4.824GlyVal: 4.824 ± 0.061
1.34GlyTrp: 1.34 ± 0.029
2.42GlyTyr: 2.42 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
1.332HisAla: 1.332 ± 0.029
0.372HisCys: 0.372 ± 0.015
0.792HisAsp: 0.792 ± 0.025
0.866HisGlu: 0.866 ± 0.024
0.891HisPhe: 0.891 ± 0.026
1.202HisGly: 1.202 ± 0.03
0.774HisHis: 0.774 ± 0.028
0.933HisIle: 0.933 ± 0.028
0.581HisLys: 0.581 ± 0.021
3.022HisLeu: 3.022 ± 0.051
0.251HisMet: 0.251 ± 0.012
0.629HisAsn: 0.629 ± 0.02
1.827HisPro: 1.827 ± 0.037
1.351HisGln: 1.351 ± 0.032
1.463HisArg: 1.463 ± 0.03
1.22HisSer: 1.22 ± 0.025
0.981HisThr: 0.981 ± 0.025
0.857HisVal: 0.857 ± 0.022
0.463HisTrp: 0.463 ± 0.019
0.815HisTyr: 0.815 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
6.912IleAla: 6.912 ± 0.061
0.702IleCys: 0.702 ± 0.024
3.07IleAsp: 3.07 ± 0.046
3.502IleGlu: 3.502 ± 0.049
2.083IlePhe: 2.083 ± 0.036
4.112IleGly: 4.112 ± 0.052
1.286IleHis: 1.286 ± 0.032
2.507IleIle: 2.507 ± 0.043
1.95IleLys: 1.95 ± 0.037
6.323IleLeu: 6.323 ± 0.078
0.729IleMet: 0.729 ± 0.023
2.184IleAsn: 2.184 ± 0.039
3.356IlePro: 3.356 ± 0.053
2.915IleGln: 2.915 ± 0.044
3.094IleArg: 3.094 ± 0.04
3.586IleSer: 3.586 ± 0.055
3.28IleThr: 3.28 ± 0.047
3.925IleVal: 3.925 ± 0.061
0.82IleTrp: 0.82 ± 0.025
1.77IleTyr: 1.77 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
3.187LysAla: 3.187 ± 0.046
0.224LysCys: 0.224 ± 0.012
1.561LysAsp: 1.561 ± 0.033
1.891LysGlu: 1.891 ± 0.04
1.218LysPhe: 1.218 ± 0.026
2.169LysGly: 2.169 ± 0.043
0.703LysHis: 0.703 ± 0.02
2.217LysIle: 2.217 ± 0.04
1.361LysLys: 1.361 ± 0.035
4.272LysLeu: 4.272 ± 0.054
0.644LysMet: 0.644 ± 0.021
1.229LysAsn: 1.229 ± 0.031
2.199LysPro: 2.199 ± 0.034
2.192LysGln: 2.192 ± 0.042
2.025LysArg: 2.025 ± 0.037
2.08LysSer: 2.08 ± 0.04
2.255LysThr: 2.255 ± 0.038
2.503LysVal: 2.503 ± 0.044
0.358LysTrp: 0.358 ± 0.017
0.952LysTyr: 0.952 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
11.448LeuAla: 11.448 ± 0.1
1.192LeuCys: 1.192 ± 0.027
5.514LeuAsp: 5.514 ± 0.057
7.768LeuGlu: 7.768 ± 0.079
3.986LeuPhe: 3.986 ± 0.064
8.352LeuGly: 8.352 ± 0.089
2.319LeuHis: 2.319 ± 0.045
6.778LeuIle: 6.778 ± 0.061
4.95LeuLys: 4.95 ± 0.064
13.752LeuLeu: 13.752 ± 0.147
2.206LeuMet: 2.206 ± 0.039
4.547LeuAsn: 4.547 ± 0.055
6.904LeuPro: 6.904 ± 0.083
7.283LeuGln: 7.283 ± 0.095
6.687LeuArg: 6.687 ± 0.073
8.129LeuSer: 8.129 ± 0.088
6.787LeuThr: 6.787 ± 0.079
7.888LeuVal: 7.888 ± 0.08
1.717LeuTrp: 1.717 ± 0.038
2.86LeuTyr: 2.86 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
2.061MetAla: 2.061 ± 0.036
0.106MetCys: 0.106 ± 0.008
0.853MetAsp: 0.853 ± 0.022
1.027MetGlu: 1.027 ± 0.029
0.51MetPhe: 0.51 ± 0.017
1.505MetGly: 1.505 ± 0.031
0.309MetHis: 0.309 ± 0.014
1.051MetIle: 1.051 ± 0.028
0.764MetLys: 0.764 ± 0.021
1.964MetLeu: 1.964 ± 0.037
0.403MetMet: 0.403 ± 0.017
0.777MetAsn: 0.777 ± 0.018
0.99MetPro: 0.99 ± 0.024
0.98MetGln: 0.98 ± 0.028
0.897MetArg: 0.897 ± 0.026
1.123MetSer: 1.123 ± 0.027
1.207MetThr: 1.207 ± 0.029
1.439MetVal: 1.439 ± 0.03
0.123MetTrp: 0.123 ± 0.01
0.345MetTyr: 0.345 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.554AsnAla: 2.554 ± 0.042
0.448AsnCys: 0.448 ± 0.017
1.292AsnAsp: 1.292 ± 0.031
1.316AsnGlu: 1.316 ± 0.026
1.498AsnPhe: 1.498 ± 0.031
2.245AsnGly: 2.245 ± 0.045
0.787AsnHis: 0.787 ± 0.021
1.623AsnIle: 1.623 ± 0.039
0.945AsnLys: 0.945 ± 0.029
4.907AsnLeu: 4.907 ± 0.074
0.48AsnMet: 0.48 ± 0.015
1.112AsnAsn: 1.112 ± 0.031
2.806AsnPro: 2.806 ± 0.044
2.245AsnGln: 2.245 ± 0.04
2.312AsnArg: 2.312 ± 0.037
2.094AsnSer: 2.094 ± 0.041
1.676AsnThr: 1.676 ± 0.034
1.791AsnVal: 1.791 ± 0.033
0.685AsnTrp: 0.685 ± 0.021
1.169AsnTyr: 1.169 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
4.803ProAla: 4.803 ± 0.082
0.476ProCys: 0.476 ± 0.02
3.326ProAsp: 3.326 ± 0.046
4.134ProGlu: 4.134 ± 0.053
1.905ProPhe: 1.905 ± 0.039
3.795ProGly: 3.795 ± 0.054
1.103ProHis: 1.103 ± 0.029
2.992ProIle: 2.992 ± 0.041
1.632ProLys: 1.632 ± 0.033
6.228ProLeu: 6.228 ± 0.075
0.893ProMet: 0.893 ± 0.025
1.786ProAsn: 1.786 ± 0.041
3.312ProPro: 3.312 ± 0.063
3.354ProGln: 3.354 ± 0.052
2.319ProArg: 2.319 ± 0.039
3.275ProSer: 3.275 ± 0.068
3.207ProThr: 3.207 ± 0.051
3.67ProVal: 3.67 ± 0.053
0.806ProTrp: 0.806 ± 0.021
1.476ProTyr: 1.476 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
5.977GlnAla: 5.977 ± 0.072
0.442GlnCys: 0.442 ± 0.016
2.365GlnAsp: 2.365 ± 0.037
3.412GlnGlu: 3.412 ± 0.057
2.204GlnPhe: 2.204 ± 0.038
3.909GlnGly: 3.909 ± 0.054
1.198GlnHis: 1.198 ± 0.03
3.78GlnIle: 3.78 ± 0.055
2.045GlnLys: 2.045 ± 0.041
7.16GlnLeu: 7.16 ± 0.085
1.131GlnMet: 1.131 ± 0.03
1.727GlnAsn: 1.727 ± 0.033
3.621GlnPro: 3.621 ± 0.057
4.826GlnGln: 4.826 ± 0.099
3.9GlnArg: 3.9 ± 0.056
3.482GlnSer: 3.482 ± 0.053
3.742GlnThr: 3.742 ± 0.061
4.555GlnVal: 4.555 ± 0.058
0.876GlnTrp: 0.876 ± 0.024
1.231GlnTyr: 1.231 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
4.035ArgAla: 4.035 ± 0.053
0.634ArgCys: 0.634 ± 0.022
2.79ArgAsp: 2.79 ± 0.044
3.397ArgGlu: 3.397 ± 0.053
2.459ArgPhe: 2.459 ± 0.04
3.422ArgGly: 3.422 ± 0.05
1.209ArgHis: 1.209 ± 0.029
3.335ArgIle: 3.335 ± 0.038
2.035ArgLys: 2.035 ± 0.042
7.324ArgLeu: 7.324 ± 0.059
1.148ArgMet: 1.148 ± 0.029
1.929ArgAsn: 1.929 ± 0.036
2.737ArgPro: 2.737 ± 0.044
4.27ArgGln: 4.27 ± 0.066
3.484ArgArg: 3.484 ± 0.054
4.023ArgSer: 4.023 ± 0.048
2.873ArgThr: 2.873 ± 0.047
3.926ArgVal: 3.926 ± 0.052
1.121ArgTrp: 1.121 ± 0.027
2.044ArgTyr: 2.044 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
4.604SerAla: 4.604 ± 0.053
0.601SerCys: 0.601 ± 0.023
2.911SerAsp: 2.911 ± 0.041
3.283SerGlu: 3.283 ± 0.047
2.358SerPhe: 2.358 ± 0.038
4.555SerGly: 4.555 ± 0.058
1.37SerHis: 1.37 ± 0.034
3.032SerIle: 3.032 ± 0.052
1.886SerLys: 1.886 ± 0.038
7.818SerLeu: 7.818 ± 0.084
1.103SerMet: 1.103 ± 0.026
2.011SerAsn: 2.011 ± 0.035
4.01SerPro: 4.01 ± 0.074
3.561SerGln: 3.561 ± 0.054
3.429SerArg: 3.429 ± 0.047
4.211SerSer: 4.211 ± 0.068
3.303SerThr: 3.303 ± 0.047
3.512SerVal: 3.512 ± 0.053
0.911SerTrp: 0.911 ± 0.026
1.747SerTyr: 1.747 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
5.588ThrAla: 5.588 ± 0.069
0.563ThrCys: 0.563 ± 0.02
2.583ThrAsp: 2.583 ± 0.04
3.155ThrGlu: 3.155 ± 0.044
1.929ThrPhe: 1.929 ± 0.037
4.364ThrGly: 4.364 ± 0.054
1.107ThrHis: 1.107 ± 0.028
3.321ThrIle: 3.321 ± 0.053
1.447ThrLys: 1.447 ± 0.03
7.025ThrLeu: 7.025 ± 0.072
0.832ThrMet: 0.832 ± 0.025
1.643ThrAsn: 1.643 ± 0.035
3.637ThrPro: 3.637 ± 0.051
2.748ThrGln: 2.748 ± 0.049
2.578ThrArg: 2.578 ± 0.039
3.025ThrSer: 3.025 ± 0.042
3.258ThrThr: 3.258 ± 0.05
4.01ThrVal: 4.01 ± 0.051
0.845ThrTrp: 0.845 ± 0.024
1.652ThrTyr: 1.652 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
5.96ValAla: 5.96 ± 0.06
0.723ValCys: 0.723 ± 0.02
3.35ValAsp: 3.35 ± 0.045
4.254ValGlu: 4.254 ± 0.05
2.432ValPhe: 2.432 ± 0.043
4.623ValGly: 4.623 ± 0.061
1.123ValHis: 1.123 ± 0.029
4.024ValIle: 4.024 ± 0.052
2.555ValLys: 2.555 ± 0.042
7.542ValLeu: 7.542 ± 0.07
1.454ValMet: 1.454 ± 0.029
2.475ValAsn: 2.475 ± 0.041
3.159ValPro: 3.159 ± 0.049
3.507ValGln: 3.507 ± 0.046
3.644ValArg: 3.644 ± 0.043
3.898ValSer: 3.898 ± 0.04
3.629ValThr: 3.629 ± 0.054
4.817ValVal: 4.817 ± 0.059
0.92ValTrp: 0.92 ± 0.024
1.817ValTyr: 1.817 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
1.024TrpAla: 1.024 ± 0.028
0.176TrpCys: 0.176 ± 0.01
0.667TrpAsp: 0.667 ± 0.023
0.899TrpGlu: 0.899 ± 0.022
0.672TrpPhe: 0.672 ± 0.02
1.117TrpGly: 1.117 ± 0.028
0.383TrpHis: 0.383 ± 0.015
0.889TrpIle: 0.889 ± 0.025
0.549TrpLys: 0.549 ± 0.019
2.25TrpLeu: 2.25 ± 0.048
0.361TrpMet: 0.361 ± 0.015
0.549TrpAsn: 0.549 ± 0.018
0.484TrpPro: 0.484 ± 0.019
1.564TrpGln: 1.564 ± 0.037
0.898TrpArg: 0.898 ± 0.024
0.962TrpSer: 0.962 ± 0.023
0.659TrpThr: 0.659 ± 0.019
1.037TrpVal: 1.037 ± 0.026
0.287TrpTrp: 0.287 ± 0.015
0.442TrpTyr: 0.442 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.214TyrAla: 2.214 ± 0.038
0.406TyrCys: 0.406 ± 0.015
1.387TyrAsp: 1.387 ± 0.03
1.542TyrGlu: 1.542 ± 0.036
1.272TyrPhe: 1.272 ± 0.036
2.219TyrGly: 2.219 ± 0.042
0.716TyrHis: 0.716 ± 0.023
1.346TyrIle: 1.346 ± 0.031
0.835TyrLys: 0.835 ± 0.027
3.769TyrLeu: 3.769 ± 0.056
0.398TyrMet: 0.398 ± 0.015
0.906TyrAsn: 0.906 ± 0.028
1.695TyrPro: 1.695 ± 0.031
1.883TyrGln: 1.883 ± 0.032
2.346TyrArg: 2.346 ± 0.038
1.727TyrSer: 1.727 ± 0.033
1.436TyrThr: 1.436 ± 0.024
1.559TyrVal: 1.559 ± 0.032
0.522TyrTrp: 0.522 ± 0.019
1.011TyrTyr: 1.011 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5232 proteins (1607800 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski