Amino acid dipepetide frequency for Nostoc punctiforme (strain ATCC 29133 / PCC 73102)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.259AlaAla: 7.259 ± 0.076
0.777AlaCys: 0.777 ± 0.023
3.984AlaAsp: 3.984 ± 0.055
5.157AlaGlu: 5.157 ± 0.055
3.012AlaPhe: 3.012 ± 0.039
5.404AlaGly: 5.404 ± 0.069
1.286AlaHis: 1.286 ± 0.024
7.003AlaIle: 7.003 ± 0.066
4.177AlaLys: 4.177 ± 0.05
8.544AlaLeu: 8.544 ± 0.075
1.56AlaMet: 1.56 ± 0.03
3.603AlaAsn: 3.603 ± 0.054
2.751AlaPro: 2.751 ± 0.037
4.13AlaGln: 4.13 ± 0.048
3.414AlaArg: 3.414 ± 0.045
5.018AlaSer: 5.018 ± 0.058
4.77AlaThr: 4.77 ± 0.052
5.474AlaVal: 5.474 ± 0.054
1.006AlaTrp: 1.006 ± 0.02
2.384AlaTyr: 2.384 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
0.598CysAla: 0.598 ± 0.017
0.158CysCys: 0.158 ± 0.007
0.537CysAsp: 0.537 ± 0.018
0.489CysGlu: 0.489 ± 0.014
0.394CysPhe: 0.394 ± 0.014
0.724CysGly: 0.724 ± 0.021
0.272CysHis: 0.272 ± 0.013
0.598CysIle: 0.598 ± 0.016
0.394CysLys: 0.394 ± 0.013
1.071CysLeu: 1.071 ± 0.021
0.141CysMet: 0.141 ± 0.009
0.416CysAsn: 0.416 ± 0.015
0.491CysPro: 0.491 ± 0.018
0.564CysGln: 0.564 ± 0.014
0.496CysArg: 0.496 ± 0.017
0.596CysSer: 0.596 ± 0.017
0.454CysThr: 0.454 ± 0.013
0.56CysVal: 0.56 ± 0.018
0.154CysTrp: 0.154 ± 0.008
0.343CysTyr: 0.343 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.653AspAla: 3.653 ± 0.043
0.498AspCys: 0.498 ± 0.018
2.327AspAsp: 2.327 ± 0.05
3.1AspGlu: 3.1 ± 0.041
2.341AspPhe: 2.341 ± 0.036
3.36AspGly: 3.36 ± 0.055
0.607AspHis: 0.607 ± 0.016
3.561AspIle: 3.561 ± 0.047
2.37AspLys: 2.37 ± 0.041
5.498AspLeu: 5.498 ± 0.068
0.756AspMet: 0.756 ± 0.016
2.122AspAsn: 2.122 ± 0.038
2.23AspPro: 2.23 ± 0.034
1.664AspGln: 1.664 ± 0.033
3.138AspArg: 3.138 ± 0.038
3.052AspSer: 3.052 ± 0.043
2.573AspThr: 2.573 ± 0.044
3.057AspVal: 3.057 ± 0.039
0.878AspTrp: 0.878 ± 0.019
1.894AspTyr: 1.894 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
5.389GluAla: 5.389 ± 0.055
0.516GluCys: 0.516 ± 0.016
2.692GluAsp: 2.692 ± 0.039
4.114GluGlu: 4.114 ± 0.06
2.489GluPhe: 2.489 ± 0.039
3.158GluGly: 3.158 ± 0.045
1.034GluHis: 1.034 ± 0.022
4.986GluIle: 4.986 ± 0.05
3.603GluLys: 3.603 ± 0.048
7.107GluLeu: 7.107 ± 0.063
1.296GluMet: 1.296 ± 0.025
2.858GluAsn: 2.858 ± 0.032
2.325GluPro: 2.325 ± 0.04
3.855GluGln: 3.855 ± 0.054
3.409GluArg: 3.409 ± 0.048
3.61GluSer: 3.61 ± 0.042
3.436GluThr: 3.436 ± 0.046
4.375GluVal: 4.375 ± 0.055
0.779GluTrp: 0.779 ± 0.019
1.98GluTyr: 1.98 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
3.136PheAla: 3.136 ± 0.042
0.535PheCys: 0.535 ± 0.017
2.279PheAsp: 2.279 ± 0.033
2.273PheGlu: 2.273 ± 0.038
1.687PhePhe: 1.687 ± 0.033
2.94PheGly: 2.94 ± 0.046
0.74PheHis: 0.74 ± 0.019
2.535PheIle: 2.535 ± 0.038
1.642PheLys: 1.642 ± 0.031
4.117PheLeu: 4.117 ± 0.051
0.676PheMet: 0.676 ± 0.018
1.864PheAsn: 1.864 ± 0.034
1.744PhePro: 1.744 ± 0.028
1.885PheGln: 1.885 ± 0.03
1.731PheArg: 1.731 ± 0.028
3.045PheSer: 3.045 ± 0.042
2.318PheThr: 2.318 ± 0.028
2.475PheVal: 2.475 ± 0.041
0.684PheTrp: 0.684 ± 0.019
1.361PheTyr: 1.361 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
4.811GlyAla: 4.811 ± 0.067
0.734GlyCys: 0.734 ± 0.02
3.302GlyAsp: 3.302 ± 0.061
4.057GlyGlu: 4.057 ± 0.051
2.958GlyPhe: 2.958 ± 0.049
4.757GlyGly: 4.757 ± 0.074
1.224GlyHis: 1.224 ± 0.028
5.155GlyIle: 5.155 ± 0.058
4.111GlyLys: 4.111 ± 0.047
6.767GlyLeu: 6.767 ± 0.063
1.384GlyMet: 1.384 ± 0.028
3.279GlyAsn: 3.279 ± 0.078
1.264GlyPro: 1.264 ± 0.026
3.02GlyGln: 3.02 ± 0.047
3.117GlyArg: 3.117 ± 0.037
4.226GlySer: 4.226 ± 0.057
3.876GlyThr: 3.876 ± 0.06
4.66GlyVal: 4.66 ± 0.05
1.026GlyTrp: 1.026 ± 0.025
2.363GlyTyr: 2.363 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
1.018HisAla: 1.018 ± 0.023
0.24HisCys: 0.24 ± 0.012
0.722HisAsp: 0.722 ± 0.019
0.997HisGlu: 0.997 ± 0.024
0.769HisPhe: 0.769 ± 0.019
1.056HisGly: 1.056 ± 0.022
0.56HisHis: 0.56 ± 0.019
1.152HisIle: 1.152 ± 0.028
0.794HisLys: 0.794 ± 0.019
2.244HisLeu: 2.244 ± 0.036
0.185HisMet: 0.185 ± 0.01
0.753HisAsn: 0.753 ± 0.018
1.264HisPro: 1.264 ± 0.025
1.177HisGln: 1.177 ± 0.026
1.03HisArg: 1.03 ± 0.021
1.188HisSer: 1.188 ± 0.027
0.941HisThr: 0.941 ± 0.023
0.78HisVal: 0.78 ± 0.021
0.33HisTrp: 0.33 ± 0.011
0.653HisTyr: 0.653 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
7.372IleAla: 7.372 ± 0.064
0.746IleCys: 0.746 ± 0.019
3.704IleAsp: 3.704 ± 0.047
4.427IleGlu: 4.427 ± 0.051
2.676IlePhe: 2.676 ± 0.034
4.478IleGly: 4.478 ± 0.054
1.263IleHis: 1.263 ± 0.026
3.983IleIle: 3.983 ± 0.048
3.311IleLys: 3.311 ± 0.047
6.953IleLeu: 6.953 ± 0.067
0.849IleMet: 0.849 ± 0.018
3.23IleAsn: 3.23 ± 0.044
3.589IlePro: 3.589 ± 0.047
3.179IleGln: 3.179 ± 0.036
3.084IleArg: 3.084 ± 0.039
4.774IleSer: 4.774 ± 0.054
3.872IleThr: 3.872 ± 0.048
4.402IleVal: 4.402 ± 0.04
0.894IleTrp: 0.894 ± 0.021
1.998IleTyr: 1.998 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
4.078LysAla: 4.078 ± 0.058
0.328LysCys: 0.328 ± 0.013
2.306LysAsp: 2.306 ± 0.041
2.933LysGlu: 2.933 ± 0.045
1.817LysPhe: 1.817 ± 0.035
2.842LysGly: 2.842 ± 0.039
0.857LysHis: 0.857 ± 0.023
3.689LysIle: 3.689 ± 0.044
2.438LysLys: 2.438 ± 0.047
5.74LysLeu: 5.74 ± 0.064
0.945LysMet: 0.945 ± 0.026
2.251LysAsn: 2.251 ± 0.037
2.512LysPro: 2.512 ± 0.034
2.812LysGln: 2.812 ± 0.039
2.371LysArg: 2.371 ± 0.036
3.197LysSer: 3.197 ± 0.042
3.057LysThr: 3.057 ± 0.043
3.323LysVal: 3.323 ± 0.041
0.53LysTrp: 0.53 ± 0.017
1.528LysTyr: 1.528 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
9.487LeuAla: 9.487 ± 0.075
0.999LeuCys: 0.999 ± 0.024
5.274LeuAsp: 5.274 ± 0.052
7.362LeuGlu: 7.362 ± 0.069
3.872LeuPhe: 3.872 ± 0.043
7.572LeuGly: 7.572 ± 0.07
1.968LeuHis: 1.968 ± 0.032
6.767LeuIle: 6.767 ± 0.067
5.744LeuLys: 5.744 ± 0.06
11.574LeuLeu: 11.574 ± 0.1
2.01LeuMet: 2.01 ± 0.033
4.863LeuAsn: 4.863 ± 0.058
5.637LeuPro: 5.637 ± 0.064
5.985LeuGln: 5.985 ± 0.064
5.551LeuArg: 5.551 ± 0.061
7.679LeuSer: 7.679 ± 0.082
6.461LeuThr: 6.461 ± 0.056
7.223LeuVal: 7.223 ± 0.059
1.458LeuTrp: 1.458 ± 0.036
2.779LeuTyr: 2.779 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
1.573MetAla: 1.573 ± 0.03
0.118MetCys: 0.118 ± 0.008
0.659MetAsp: 0.659 ± 0.016
0.962MetGlu: 0.962 ± 0.024
0.532MetPhe: 0.532 ± 0.015
1.206MetGly: 1.206 ± 0.028
0.289MetHis: 0.289 ± 0.01
1.059MetIle: 1.059 ± 0.022
0.892MetLys: 0.892 ± 0.022
1.897MetLeu: 1.897 ± 0.032
0.398MetMet: 0.398 ± 0.015
0.836MetAsn: 0.836 ± 0.017
0.909MetPro: 0.909 ± 0.019
0.909MetGln: 0.909 ± 0.02
0.9MetArg: 0.9 ± 0.022
1.294MetSer: 1.294 ± 0.026
1.187MetThr: 1.187 ± 0.025
1.183MetVal: 1.183 ± 0.026
0.158MetTrp: 0.158 ± 0.01
0.357MetTyr: 0.357 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
3.132AsnAla: 3.132 ± 0.044
0.444AsnCys: 0.444 ± 0.014
1.928AsnAsp: 1.928 ± 0.052
2.006AsnGlu: 2.006 ± 0.03
2.028AsnPhe: 2.028 ± 0.031
2.954AsnGly: 2.954 ± 0.075
0.878AsnHis: 0.878 ± 0.022
3.09AsnIle: 3.09 ± 0.045
1.894AsnLys: 1.894 ± 0.03
5.775AsnLeu: 5.775 ± 0.057
0.605AsnMet: 0.605 ± 0.017
2.317AsnAsn: 2.317 ± 0.046
2.926AsnPro: 2.926 ± 0.04
2.822AsnGln: 2.822 ± 0.049
2.323AsnArg: 2.323 ± 0.033
3.304AsnSer: 3.304 ± 0.047
2.434AsnThr: 2.434 ± 0.039
2.322AsnVal: 2.322 ± 0.035
0.75AsnTrp: 0.75 ± 0.021
1.636AsnTyr: 1.636 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
3.032ProAla: 3.032 ± 0.035
0.335ProCys: 0.335 ± 0.011
2.844ProAsp: 2.844 ± 0.038
3.748ProGlu: 3.748 ± 0.05
1.738ProPhe: 1.738 ± 0.03
2.927ProGly: 2.927 ± 0.036
0.824ProHis: 0.824 ± 0.021
3.139ProIle: 3.139 ± 0.039
2.283ProLys: 2.283 ± 0.036
4.663ProLeu: 4.663 ± 0.051
0.676ProMet: 0.676 ± 0.018
2.328ProAsn: 2.328 ± 0.034
2.121ProPro: 2.121 ± 0.041
2.544ProGln: 2.544 ± 0.037
1.653ProArg: 1.653 ± 0.029
2.974ProSer: 2.974 ± 0.038
2.854ProThr: 2.854 ± 0.04
3.119ProVal: 3.119 ± 0.044
0.565ProTrp: 0.565 ± 0.019
1.268ProTyr: 1.268 ± 0.024
0.0ProXaa: 0.0 ± 0.0
Gln
4.78GlnAla: 4.78 ± 0.051
0.347GlnCys: 0.347 ± 0.013
2.101GlnAsp: 2.101 ± 0.035
3.76GlnGlu: 3.76 ± 0.05
1.771GlnPhe: 1.771 ± 0.028
3.379GlnGly: 3.379 ± 0.043
0.9GlnHis: 0.9 ± 0.021
3.888GlnIle: 3.888 ± 0.043
2.852GlnLys: 2.852 ± 0.036
6.275GlnLeu: 6.275 ± 0.072
1.026GlnMet: 1.026 ± 0.02
2.26GlnAsn: 2.26 ± 0.033
2.603GlnPro: 2.603 ± 0.044
4.111GlnGln: 4.111 ± 0.06
2.902GlnArg: 2.902 ± 0.041
3.193GlnSer: 3.193 ± 0.041
3.049GlnThr: 3.049 ± 0.047
3.828GlnVal: 3.828 ± 0.043
0.639GlnTrp: 0.639 ± 0.017
1.294GlnTyr: 1.294 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
3.111ArgAla: 3.111 ± 0.038
0.485ArgCys: 0.485 ± 0.015
2.475ArgAsp: 2.475 ± 0.039
3.227ArgGlu: 3.227 ± 0.044
2.178ArgPhe: 2.178 ± 0.03
2.802ArgGly: 2.802 ± 0.036
0.971ArgHis: 0.971 ± 0.023
3.296ArgIle: 3.296 ± 0.041
2.189ArgLys: 2.189 ± 0.037
5.942ArgLeu: 5.942 ± 0.056
0.905ArgMet: 0.905 ± 0.021
2.061ArgAsn: 2.061 ± 0.031
1.973ArgPro: 1.973 ± 0.032
3.226ArgGln: 3.226 ± 0.044
2.861ArgArg: 2.861 ± 0.047
3.219ArgSer: 3.219 ± 0.04
2.547ArgThr: 2.547 ± 0.034
3.302ArgVal: 3.302 ± 0.044
0.784ArgTrp: 0.784 ± 0.02
1.763ArgTyr: 1.763 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
4.741SerAla: 4.741 ± 0.054
0.565SerCys: 0.565 ± 0.017
3.358SerAsp: 3.358 ± 0.041
3.954SerGlu: 3.954 ± 0.04
2.702SerPhe: 2.702 ± 0.039
4.779SerGly: 4.779 ± 0.063
1.247SerHis: 1.247 ± 0.023
4.011SerIle: 4.011 ± 0.044
2.967SerLys: 2.967 ± 0.039
7.67SerLeu: 7.67 ± 0.064
1.146SerMet: 1.146 ± 0.021
3.04SerAsn: 3.04 ± 0.046
3.391SerPro: 3.391 ± 0.049
3.811SerGln: 3.811 ± 0.046
3.18SerArg: 3.18 ± 0.042
4.852SerSer: 4.852 ± 0.056
3.666SerThr: 3.666 ± 0.039
4.212SerVal: 4.212 ± 0.05
0.909SerTrp: 0.909 ± 0.019
1.922SerTyr: 1.922 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
4.822ThrAla: 4.822 ± 0.056
0.454ThrCys: 0.454 ± 0.014
2.724ThrAsp: 2.724 ± 0.041
3.383ThrGlu: 3.383 ± 0.041
2.193ThrPhe: 2.193 ± 0.035
4.304ThrGly: 4.304 ± 0.059
1.0ThrHis: 1.0 ± 0.025
3.844ThrIle: 3.844 ± 0.047
2.492ThrLys: 2.492 ± 0.036
6.324ThrLeu: 6.324 ± 0.06
0.74ThrMet: 0.74 ± 0.017
2.46ThrAsn: 2.46 ± 0.036
3.317ThrPro: 3.317 ± 0.045
3.069ThrGln: 3.069 ± 0.039
2.359ThrArg: 2.359 ± 0.032
3.722ThrSer: 3.722 ± 0.042
3.557ThrThr: 3.557 ± 0.057
4.019ThrVal: 4.019 ± 0.047
0.679ThrTrp: 0.679 ± 0.018
1.622ThrTyr: 1.622 ± 0.026
0.0ThrXaa: 0.0 ± 0.0
Val
5.777ValAla: 5.777 ± 0.061
0.634ValCys: 0.634 ± 0.017
3.361ValAsp: 3.361 ± 0.036
4.416ValGlu: 4.416 ± 0.049
2.587ValPhe: 2.587 ± 0.037
4.508ValGly: 4.508 ± 0.049
0.974ValHis: 0.974 ± 0.024
4.411ValIle: 4.411 ± 0.049
3.438ValLys: 3.438 ± 0.04
6.709ValLeu: 6.709 ± 0.066
1.313ValMet: 1.313 ± 0.022
3.059ValAsn: 3.059 ± 0.036
2.772ValPro: 2.772 ± 0.037
2.98ValGln: 2.98 ± 0.041
3.124ValArg: 3.124 ± 0.041
4.326ValSer: 4.326 ± 0.048
3.824ValThr: 3.824 ± 0.043
4.767ValVal: 4.767 ± 0.052
0.831ValTrp: 0.831 ± 0.021
1.846ValTyr: 1.846 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
0.845TrpAla: 0.845 ± 0.021
0.16TrpCys: 0.16 ± 0.009
0.678TrpAsp: 0.678 ± 0.021
0.911TrpGlu: 0.911 ± 0.022
0.591TrpPhe: 0.591 ± 0.016
0.927TrpGly: 0.927 ± 0.024
0.327TrpHis: 0.327 ± 0.012
0.837TrpIle: 0.837 ± 0.018
0.619TrpLys: 0.619 ± 0.017
1.774TrpLeu: 1.774 ± 0.036
0.28TrpMet: 0.28 ± 0.012
0.667TrpAsn: 0.667 ± 0.021
0.261TrpPro: 0.261 ± 0.011
1.167TrpGln: 1.167 ± 0.024
0.794TrpArg: 0.794 ± 0.02
0.851TrpSer: 0.851 ± 0.023
0.6TrpThr: 0.6 ± 0.016
0.914TrpVal: 0.914 ± 0.021
0.213TrpTrp: 0.213 ± 0.01
0.418TrpTyr: 0.418 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.1TyrAla: 2.1 ± 0.032
0.375TyrCys: 0.375 ± 0.014
1.424TyrAsp: 1.424 ± 0.026
1.733TyrGlu: 1.733 ± 0.029
1.331TyrPhe: 1.331 ± 0.029
2.013TyrGly: 2.013 ± 0.032
0.631TyrHis: 0.631 ± 0.018
1.78TyrIle: 1.78 ± 0.027
1.346TyrLys: 1.346 ± 0.027
3.622TyrLeu: 3.622 ± 0.043
0.426TyrMet: 0.426 ± 0.015
1.304TyrAsn: 1.304 ± 0.026
1.557TyrPro: 1.557 ± 0.033
2.047TyrGln: 2.047 ± 0.033
1.876TyrArg: 1.876 ± 0.026
2.005TyrSer: 2.005 ± 0.034
1.611TyrThr: 1.611 ± 0.026
1.66TyrVal: 1.66 ± 0.032
0.57TyrTrp: 0.57 ± 0.016
1.027TyrTyr: 1.027 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6573 proteins (2290963 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski