Amino acid dipepetide frequency for Leptolyngbya boryana NIES-2135

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.35AlaAla: 8.35 ± 0.097
0.863AlaCys: 0.863 ± 0.02
4.292AlaAsp: 4.292 ± 0.059
5.911AlaGlu: 5.911 ± 0.055
3.283AlaPhe: 3.283 ± 0.043
5.76AlaGly: 5.76 ± 0.07
1.533AlaHis: 1.533 ± 0.026
8.172AlaIle: 8.172 ± 0.072
3.941AlaLys: 3.941 ± 0.051
9.66AlaLeu: 9.66 ± 0.082
1.997AlaMet: 1.997 ± 0.033
3.225AlaAsn: 3.225 ± 0.049
3.624AlaPro: 3.624 ± 0.058
5.154AlaGln: 5.154 ± 0.058
4.337AlaArg: 4.337 ± 0.045
5.146AlaSer: 5.146 ± 0.057
5.154AlaThr: 5.154 ± 0.068
5.779AlaVal: 5.779 ± 0.064
1.19AlaTrp: 1.19 ± 0.025
2.383AlaTyr: 2.383 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.724CysAla: 0.724 ± 0.021
0.16CysCys: 0.16 ± 0.008
0.611CysAsp: 0.611 ± 0.017
0.523CysGlu: 0.523 ± 0.017
0.431CysPhe: 0.431 ± 0.015
0.742CysGly: 0.742 ± 0.021
0.264CysHis: 0.264 ± 0.011
0.496CysIle: 0.496 ± 0.015
0.306CysLys: 0.306 ± 0.014
1.055CysLeu: 1.055 ± 0.028
0.16CysMet: 0.16 ± 0.009
0.351CysAsn: 0.351 ± 0.015
0.551CysPro: 0.551 ± 0.017
0.533CysGln: 0.533 ± 0.016
0.587CysArg: 0.587 ± 0.021
0.615CysSer: 0.615 ± 0.018
0.471CysThr: 0.471 ± 0.016
0.576CysVal: 0.576 ± 0.016
0.142CysTrp: 0.142 ± 0.009
0.32CysTyr: 0.32 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
4.397AspAla: 4.397 ± 0.063
0.561AspCys: 0.561 ± 0.016
2.315AspAsp: 2.315 ± 0.044
2.999AspGlu: 2.999 ± 0.04
2.21AspPhe: 2.21 ± 0.036
3.262AspGly: 3.262 ± 0.05
0.988AspHis: 0.988 ± 0.024
2.634AspIle: 2.634 ± 0.036
1.494AspLys: 1.494 ± 0.03
6.197AspLeu: 6.197 ± 0.061
0.8AspMet: 0.8 ± 0.018
1.341AspAsn: 1.341 ± 0.029
2.846AspPro: 2.846 ± 0.044
2.664AspGln: 2.664 ± 0.042
5.125AspArg: 5.125 ± 0.056
2.915AspSer: 2.915 ± 0.046
2.217AspThr: 2.217 ± 0.046
3.139AspVal: 3.139 ± 0.046
0.964AspTrp: 0.964 ± 0.023
1.653AspTyr: 1.653 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
6.052GluAla: 6.052 ± 0.065
0.491GluCys: 0.491 ± 0.018
2.661GluAsp: 2.661 ± 0.042
3.545GluGlu: 3.545 ± 0.054
2.548GluPhe: 2.548 ± 0.035
3.117GluGly: 3.117 ± 0.041
1.109GluHis: 1.109 ± 0.027
4.521GluIle: 4.521 ± 0.047
2.869GluLys: 2.869 ± 0.047
6.834GluLeu: 6.834 ± 0.063
1.418GluMet: 1.418 ± 0.03
2.17GluAsn: 2.17 ± 0.035
2.653GluPro: 2.653 ± 0.043
3.851GluGln: 3.851 ± 0.047
3.701GluArg: 3.701 ± 0.051
3.621GluSer: 3.621 ± 0.044
3.744GluThr: 3.744 ± 0.047
4.231GluVal: 4.231 ± 0.052
0.829GluTrp: 0.829 ± 0.022
1.525GluTyr: 1.525 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
3.389PheAla: 3.389 ± 0.043
0.514PheCys: 0.514 ± 0.017
2.415PheAsp: 2.415 ± 0.04
2.42PheGlu: 2.42 ± 0.039
1.622PhePhe: 1.622 ± 0.031
2.98PheGly: 2.98 ± 0.047
0.769PheHis: 0.769 ± 0.019
1.974PheIle: 1.974 ± 0.032
1.391PheLys: 1.391 ± 0.029
4.028PheLeu: 4.028 ± 0.047
0.675PheMet: 0.675 ± 0.02
1.56PheAsn: 1.56 ± 0.03
1.84PhePro: 1.84 ± 0.034
1.938PheGln: 1.938 ± 0.035
2.121PheArg: 2.121 ± 0.036
2.995PheSer: 2.995 ± 0.039
2.266PheThr: 2.266 ± 0.036
2.603PheVal: 2.603 ± 0.035
0.697PheTrp: 0.697 ± 0.026
1.246PheTyr: 1.246 ± 0.024
0.0PheXaa: 0.0 ± 0.0
Gly
5.313GlyAla: 5.313 ± 0.065
0.753GlyCys: 0.753 ± 0.019
3.265GlyAsp: 3.265 ± 0.058
3.853GlyGlu: 3.853 ± 0.044
3.065GlyPhe: 3.065 ± 0.046
4.623GlyGly: 4.623 ± 0.078
1.219GlyHis: 1.219 ± 0.031
4.837GlyIle: 4.837 ± 0.052
3.4GlyLys: 3.4 ± 0.048
6.938GlyLeu: 6.938 ± 0.059
1.585GlyMet: 1.585 ± 0.031
2.68GlyAsn: 2.68 ± 0.058
0.9GlyPro: 0.9 ± 0.024
3.288GlyGln: 3.288 ± 0.049
3.535GlyArg: 3.535 ± 0.043
4.282GlySer: 4.282 ± 0.069
4.128GlyThr: 4.128 ± 0.067
4.579GlyVal: 4.579 ± 0.054
1.154GlyTrp: 1.154 ± 0.028
2.216GlyTyr: 2.216 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
1.323HisAla: 1.323 ± 0.026
0.303HisCys: 0.303 ± 0.012
0.923HisAsp: 0.923 ± 0.022
0.998HisGlu: 0.998 ± 0.025
0.821HisPhe: 0.821 ± 0.02
1.104HisGly: 1.104 ± 0.023
0.677HisHis: 0.677 ± 0.023
0.951HisIle: 0.951 ± 0.021
0.54HisLys: 0.54 ± 0.018
2.408HisLeu: 2.408 ± 0.039
0.272HisMet: 0.272 ± 0.011
0.566HisAsn: 0.566 ± 0.019
1.434HisPro: 1.434 ± 0.031
1.262HisGln: 1.262 ± 0.032
1.279HisArg: 1.279 ± 0.027
1.25HisSer: 1.25 ± 0.026
0.902HisThr: 0.902 ± 0.024
0.937HisVal: 0.937 ± 0.025
0.384HisTrp: 0.384 ± 0.015
0.677HisTyr: 0.677 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
8.084IleAla: 8.084 ± 0.062
0.676IleCys: 0.676 ± 0.018
3.906IleAsp: 3.906 ± 0.051
4.803IleGlu: 4.803 ± 0.055
2.111IlePhe: 2.111 ± 0.038
4.508IleGly: 4.508 ± 0.057
1.225IleHis: 1.225 ± 0.024
2.458IleIle: 2.458 ± 0.034
1.991IleLys: 1.991 ± 0.034
6.222IleLeu: 6.222 ± 0.054
0.77IleMet: 0.77 ± 0.019
2.104IleAsn: 2.104 ± 0.035
3.36IlePro: 3.36 ± 0.039
3.45IleGln: 3.45 ± 0.045
3.54IleArg: 3.54 ± 0.043
3.999IleSer: 3.999 ± 0.056
3.282IleThr: 3.282 ± 0.061
4.807IleVal: 4.807 ± 0.05
0.838IleTrp: 0.838 ± 0.022
1.598IleTyr: 1.598 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
3.847LysAla: 3.847 ± 0.051
0.255LysCys: 0.255 ± 0.012
1.852LysAsp: 1.852 ± 0.035
2.095LysGlu: 2.095 ± 0.04
1.489LysPhe: 1.489 ± 0.026
2.337LysGly: 2.337 ± 0.041
0.759LysHis: 0.759 ± 0.018
2.583LysIle: 2.583 ± 0.034
1.656LysLys: 1.656 ± 0.035
4.602LysLeu: 4.602 ± 0.054
0.812LysMet: 0.812 ± 0.016
1.364LysAsn: 1.364 ± 0.024
2.366LysPro: 2.366 ± 0.038
2.404LysGln: 2.404 ± 0.037
2.423LysArg: 2.423 ± 0.042
2.686LysSer: 2.686 ± 0.04
2.69LysThr: 2.69 ± 0.035
2.743LysVal: 2.743 ± 0.039
0.432LysTrp: 0.432 ± 0.015
0.946LysTyr: 0.946 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
9.606LeuAla: 9.606 ± 0.075
1.048LeuCys: 1.048 ± 0.026
5.708LeuAsp: 5.708 ± 0.051
6.994LeuGlu: 6.994 ± 0.059
3.823LeuPhe: 3.823 ± 0.05
7.261LeuGly: 7.261 ± 0.064
1.984LeuHis: 1.984 ± 0.035
6.508LeuIle: 6.508 ± 0.066
5.468LeuLys: 5.468 ± 0.058
11.052LeuLeu: 11.052 ± 0.101
2.301LeuMet: 2.301 ± 0.034
4.548LeuAsn: 4.548 ± 0.059
5.638LeuPro: 5.638 ± 0.068
5.513LeuGln: 5.513 ± 0.063
6.111LeuArg: 6.111 ± 0.063
7.997LeuSer: 7.997 ± 0.067
6.405LeuThr: 6.405 ± 0.062
6.739LeuVal: 6.739 ± 0.071
1.437LeuTrp: 1.437 ± 0.035
2.671LeuTyr: 2.671 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
1.709MetAla: 1.709 ± 0.031
0.123MetCys: 0.123 ± 0.007
0.756MetAsp: 0.756 ± 0.021
0.889MetGlu: 0.889 ± 0.021
0.587MetPhe: 0.587 ± 0.018
1.286MetGly: 1.286 ± 0.027
0.316MetHis: 0.316 ± 0.013
1.232MetIle: 1.232 ± 0.03
0.953MetLys: 0.953 ± 0.023
2.117MetLeu: 2.117 ± 0.036
0.53MetMet: 0.53 ± 0.018
0.916MetAsn: 0.916 ± 0.024
1.121MetPro: 1.121 ± 0.024
1.186MetGln: 1.186 ± 0.025
1.203MetArg: 1.203 ± 0.027
1.547MetSer: 1.547 ± 0.028
1.468MetThr: 1.468 ± 0.028
1.267MetVal: 1.267 ± 0.023
0.158MetTrp: 0.158 ± 0.01
0.357MetTyr: 0.357 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
3.278AsnAla: 3.278 ± 0.048
0.382AsnCys: 0.382 ± 0.014
1.645AsnAsp: 1.645 ± 0.044
1.626AsnGlu: 1.626 ± 0.029
1.576AsnPhe: 1.576 ± 0.031
2.588AsnGly: 2.588 ± 0.054
0.731AsnHis: 0.731 ± 0.021
1.817AsnIle: 1.817 ± 0.034
0.924AsnLys: 0.924 ± 0.029
4.743AsnLeu: 4.743 ± 0.047
0.553AsnMet: 0.553 ± 0.017
1.178AsnAsn: 1.178 ± 0.032
2.705AsnPro: 2.705 ± 0.041
2.465AsnGln: 2.465 ± 0.037
2.443AsnArg: 2.443 ± 0.039
2.436AsnSer: 2.436 ± 0.046
1.894AsnThr: 1.894 ± 0.046
2.095AsnVal: 2.095 ± 0.037
0.676AsnTrp: 0.676 ± 0.019
1.125AsnTyr: 1.125 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
3.824ProAla: 3.824 ± 0.051
0.361ProCys: 0.361 ± 0.014
3.174ProAsp: 3.174 ± 0.044
3.96ProGlu: 3.96 ± 0.047
1.809ProPhe: 1.809 ± 0.033
3.014ProGly: 3.014 ± 0.045
0.888ProHis: 0.888 ± 0.023
3.282ProIle: 3.282 ± 0.041
2.176ProLys: 2.176 ± 0.04
4.421ProLeu: 4.421 ± 0.047
0.922ProMet: 0.922 ± 0.022
2.209ProAsn: 2.209 ± 0.039
2.281ProPro: 2.281 ± 0.051
2.369ProGln: 2.369 ± 0.039
2.003ProArg: 2.003 ± 0.037
3.085ProSer: 3.085 ± 0.042
3.234ProThr: 3.234 ± 0.049
3.271ProVal: 3.271 ± 0.044
0.636ProTrp: 0.636 ± 0.018
1.222ProTyr: 1.222 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
5.668GlnAla: 5.668 ± 0.061
0.408GlnCys: 0.408 ± 0.014
2.274GlnAsp: 2.274 ± 0.039
3.111GlnGlu: 3.111 ± 0.04
2.194GlnPhe: 2.194 ± 0.036
3.412GlnGly: 3.412 ± 0.042
1.091GlnHis: 1.091 ± 0.028
3.784GlnIle: 3.784 ± 0.044
2.311GlnLys: 2.311 ± 0.037
5.949GlnLeu: 5.949 ± 0.065
1.171GlnMet: 1.171 ± 0.024
1.965GlnAsn: 1.965 ± 0.034
2.832GlnPro: 2.832 ± 0.045
3.885GlnGln: 3.885 ± 0.065
3.314GlnArg: 3.314 ± 0.043
3.631GlnSer: 3.631 ± 0.045
3.408GlnThr: 3.408 ± 0.047
4.0GlnVal: 4.0 ± 0.049
0.746GlnTrp: 0.746 ± 0.021
1.358GlnTyr: 1.358 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
4.267ArgAla: 4.267 ± 0.052
0.523ArgCys: 0.523 ± 0.015
3.113ArgAsp: 3.113 ± 0.043
3.549ArgGlu: 3.549 ± 0.047
2.631ArgPhe: 2.631 ± 0.041
3.183ArgGly: 3.183 ± 0.045
1.112ArgHis: 1.112 ± 0.023
3.732ArgIle: 3.732 ± 0.043
2.381ArgLys: 2.381 ± 0.037
6.567ArgLeu: 6.567 ± 0.063
1.276ArgMet: 1.276 ± 0.027
2.185ArgAsn: 2.185 ± 0.031
2.366ArgPro: 2.366 ± 0.04
3.482ArgGln: 3.482 ± 0.045
3.49ArgArg: 3.49 ± 0.049
4.789ArgSer: 4.789 ± 0.052
3.019ArgThr: 3.019 ± 0.037
3.855ArgVal: 3.855 ± 0.043
0.946ArgTrp: 0.946 ± 0.025
1.99ArgTyr: 1.99 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
5.578SerAla: 5.578 ± 0.066
0.568SerCys: 0.568 ± 0.019
3.624SerAsp: 3.624 ± 0.046
4.075SerGlu: 4.075 ± 0.047
2.589SerPhe: 2.589 ± 0.037
4.879SerGly: 4.879 ± 0.065
1.284SerHis: 1.284 ± 0.028
4.21SerIle: 4.21 ± 0.055
2.286SerLys: 2.286 ± 0.037
7.161SerLeu: 7.161 ± 0.062
1.321SerMet: 1.321 ± 0.025
2.489SerAsn: 2.489 ± 0.044
3.521SerPro: 3.521 ± 0.052
3.51SerGln: 3.51 ± 0.043
3.644SerArg: 3.644 ± 0.045
4.625SerSer: 4.625 ± 0.065
3.918SerThr: 3.918 ± 0.064
4.171SerVal: 4.171 ± 0.047
0.883SerTrp: 0.883 ± 0.019
1.682SerTyr: 1.682 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
5.249ThrAla: 5.249 ± 0.065
0.495ThrCys: 0.495 ± 0.016
2.659ThrAsp: 2.659 ± 0.043
3.33ThrGlu: 3.33 ± 0.043
2.192ThrPhe: 2.192 ± 0.04
4.269ThrGly: 4.269 ± 0.064
1.056ThrHis: 1.056 ± 0.023
4.146ThrIle: 4.146 ± 0.067
1.877ThrLys: 1.877 ± 0.031
6.605ThrLeu: 6.605 ± 0.06
0.88ThrMet: 0.88 ± 0.022
1.923ThrAsn: 1.923 ± 0.037
3.471ThrPro: 3.471 ± 0.054
3.173ThrGln: 3.173 ± 0.046
2.814ThrArg: 2.814 ± 0.039
3.409ThrSer: 3.409 ± 0.057
3.375ThrThr: 3.375 ± 0.063
4.283ThrVal: 4.283 ± 0.056
0.781ThrTrp: 0.781 ± 0.022
1.522ThrTyr: 1.522 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
5.856ValAla: 5.856 ± 0.06
0.636ValCys: 0.636 ± 0.017
3.442ValAsp: 3.442 ± 0.04
4.389ValGlu: 4.389 ± 0.052
2.588ValPhe: 2.588 ± 0.038
4.462ValGly: 4.462 ± 0.059
1.071ValHis: 1.071 ± 0.026
4.117ValIle: 4.117 ± 0.049
2.888ValLys: 2.888 ± 0.039
7.199ValLeu: 7.199 ± 0.068
1.512ValMet: 1.512 ± 0.03
2.503ValAsn: 2.503 ± 0.044
3.121ValPro: 3.121 ± 0.041
3.402ValGln: 3.402 ± 0.043
3.73ValArg: 3.73 ± 0.046
4.306ValSer: 4.306 ± 0.052
3.828ValThr: 3.828 ± 0.059
4.574ValVal: 4.574 ± 0.053
0.914ValTrp: 0.914 ± 0.026
1.654ValTyr: 1.654 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.063TrpAla: 1.063 ± 0.024
0.153TrpCys: 0.153 ± 0.008
0.616TrpAsp: 0.616 ± 0.017
0.807TrpGlu: 0.807 ± 0.023
0.677TrpPhe: 0.677 ± 0.02
0.872TrpGly: 0.872 ± 0.02
0.347TrpHis: 0.347 ± 0.013
1.032TrpIle: 1.032 ± 0.025
0.668TrpLys: 0.668 ± 0.02
1.834TrpLeu: 1.834 ± 0.036
0.391TrpMet: 0.391 ± 0.014
0.714TrpAsn: 0.714 ± 0.02
0.071TrpPro: 0.071 ± 0.006
1.186TrpGln: 1.186 ± 0.025
0.885TrpArg: 0.885 ± 0.023
0.942TrpSer: 0.942 ± 0.021
0.732TrpThr: 0.732 ± 0.02
0.936TrpVal: 0.936 ± 0.024
0.248TrpTrp: 0.248 ± 0.012
0.412TrpTyr: 0.412 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.178TyrAla: 2.178 ± 0.035
0.346TyrCys: 0.346 ± 0.015
1.473TyrAsp: 1.473 ± 0.041
1.612TyrGlu: 1.612 ± 0.029
1.199TyrPhe: 1.199 ± 0.028
1.954TyrGly: 1.954 ± 0.033
0.59TyrHis: 0.59 ± 0.017
1.244TyrIle: 1.244 ± 0.028
0.823TyrLys: 0.823 ± 0.022
3.216TyrLeu: 3.216 ± 0.04
0.375TyrMet: 0.375 ± 0.014
0.888TyrAsn: 0.888 ± 0.023
1.442TyrPro: 1.442 ± 0.029
1.797TyrGln: 1.797 ± 0.035
2.168TyrArg: 2.168 ± 0.035
1.742TyrSer: 1.742 ± 0.03
1.373TyrThr: 1.373 ± 0.027
1.61TyrVal: 1.61 ± 0.025
0.529TyrTrp: 0.529 ± 0.018
0.881TyrTyr: 0.881 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6678 proteins (2078917 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski