Amino acid dipepetide frequency for Talaromyces islandicus (Penicillium islandicum)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.907AlaAla: 8.907 ± 0.06
1.02AlaCys: 1.02 ± 0.015
4.243AlaAsp: 4.243 ± 0.038
4.943AlaGlu: 4.943 ± 0.041
3.162AlaPhe: 3.162 ± 0.027
5.584AlaGly: 5.584 ± 0.04
1.735AlaHis: 1.735 ± 0.018
4.42AlaIle: 4.42 ± 0.037
3.872AlaLys: 3.872 ± 0.033
7.637AlaLeu: 7.637 ± 0.045
1.902AlaMet: 1.902 ± 0.021
3.059AlaAsn: 3.059 ± 0.028
4.319AlaPro: 4.319 ± 0.038
3.266AlaGln: 3.266 ± 0.029
4.543AlaArg: 4.543 ± 0.03
7.236AlaSer: 7.236 ± 0.041
5.262AlaThr: 5.262 ± 0.033
5.454AlaVal: 5.454 ± 0.032
1.156AlaTrp: 1.156 ± 0.017
2.183AlaTyr: 2.183 ± 0.021
0.0AlaXaa: 0.0 ± 0.0
Cys
0.889CysAla: 0.889 ± 0.014
0.208CysCys: 0.208 ± 0.008
0.642CysAsp: 0.642 ± 0.011
0.556CysGlu: 0.556 ± 0.011
0.539CysPhe: 0.539 ± 0.01
0.839CysGly: 0.839 ± 0.014
0.314CysHis: 0.314 ± 0.007
0.688CysIle: 0.688 ± 0.014
0.457CysLys: 0.457 ± 0.01
1.238CysLeu: 1.238 ± 0.017
0.247CysMet: 0.247 ± 0.008
0.416CysAsn: 0.416 ± 0.009
0.582CysPro: 0.582 ± 0.012
0.442CysGln: 0.442 ± 0.01
0.684CysArg: 0.684 ± 0.012
0.87CysSer: 0.87 ± 0.015
0.638CysThr: 0.638 ± 0.013
0.791CysVal: 0.791 ± 0.015
0.184CysTrp: 0.184 ± 0.006
0.352CysTyr: 0.352 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.571AspAla: 4.571 ± 0.031
0.587AspCys: 0.587 ± 0.01
4.194AspAsp: 4.194 ± 0.044
4.389AspGlu: 4.389 ± 0.034
2.28AspPhe: 2.28 ± 0.022
3.973AspGly: 3.973 ± 0.036
1.267AspHis: 1.267 ± 0.018
3.346AspIle: 3.346 ± 0.028
2.401AspLys: 2.401 ± 0.023
5.115AspLeu: 5.115 ± 0.034
1.264AspMet: 1.264 ± 0.015
2.126AspAsn: 2.126 ± 0.022
3.192AspPro: 3.192 ± 0.028
1.937AspGln: 1.937 ± 0.019
3.006AspArg: 3.006 ± 0.028
4.467AspSer: 4.467 ± 0.033
3.095AspThr: 3.095 ± 0.026
3.668AspVal: 3.668 ± 0.031
0.869AspTrp: 0.869 ± 0.013
1.711AspTyr: 1.711 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
5.004GluAla: 5.004 ± 0.04
0.583GluCys: 0.583 ± 0.011
4.071GluAsp: 4.071 ± 0.034
5.338GluGlu: 5.338 ± 0.065
2.047GluPhe: 2.047 ± 0.02
3.388GluGly: 3.388 ± 0.027
1.31GluHis: 1.31 ± 0.017
3.298GluIle: 3.298 ± 0.025
3.774GluLys: 3.774 ± 0.036
5.072GluLeu: 5.072 ± 0.035
1.407GluMet: 1.407 ± 0.018
2.665GluAsn: 2.665 ± 0.026
2.694GluPro: 2.694 ± 0.04
2.497GluGln: 2.497 ± 0.03
3.599GluArg: 3.599 ± 0.036
4.417GluSer: 4.417 ± 0.034
3.68GluThr: 3.68 ± 0.025
3.324GluVal: 3.324 ± 0.027
0.868GluTrp: 0.868 ± 0.015
1.732GluTyr: 1.732 ± 0.016
0.0GluXaa: 0.0 ± 0.0
Phe
3.083PheAla: 3.083 ± 0.027
0.551PheCys: 0.551 ± 0.01
2.359PheAsp: 2.359 ± 0.022
2.188PheGlu: 2.188 ± 0.023
1.719PhePhe: 1.719 ± 0.023
2.87PheGly: 2.87 ± 0.026
0.956PheHis: 0.956 ± 0.014
1.928PheIle: 1.928 ± 0.02
1.485PheLys: 1.485 ± 0.017
3.65PheLeu: 3.65 ± 0.035
0.78PheMet: 0.78 ± 0.013
1.505PheAsn: 1.505 ± 0.015
1.986PhePro: 1.986 ± 0.021
1.472PheGln: 1.472 ± 0.02
1.959PheArg: 1.959 ± 0.017
3.22PheSer: 3.22 ± 0.026
2.205PheThr: 2.205 ± 0.02
2.569PheVal: 2.569 ± 0.023
0.674PheTrp: 0.674 ± 0.012
1.186PheTyr: 1.186 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
5.005GlyAla: 5.005 ± 0.039
0.815GlyCys: 0.815 ± 0.015
3.602GlyAsp: 3.602 ± 0.026
3.426GlyGlu: 3.426 ± 0.026
2.887GlyPhe: 2.887 ± 0.024
5.414GlyGly: 5.414 ± 0.051
1.674GlyHis: 1.674 ± 0.021
3.736GlyIle: 3.736 ± 0.028
3.323GlyLys: 3.323 ± 0.029
6.114GlyLeu: 6.114 ± 0.045
1.454GlyMet: 1.454 ± 0.015
2.654GlyAsn: 2.654 ± 0.025
3.148GlyPro: 3.148 ± 0.025
2.577GlyGln: 2.577 ± 0.028
3.839GlyArg: 3.839 ± 0.035
5.647GlySer: 5.647 ± 0.041
3.813GlyThr: 3.813 ± 0.032
4.329GlyVal: 4.329 ± 0.035
1.132GlyTrp: 1.132 ± 0.016
2.2GlyTyr: 2.2 ± 0.023
0.0GlyXaa: 0.0 ± 0.0
His
1.826HisAla: 1.826 ± 0.019
0.306HisCys: 0.306 ± 0.008
1.342HisAsp: 1.342 ± 0.016
1.336HisGlu: 1.336 ± 0.017
0.918HisPhe: 0.918 ± 0.014
1.707HisGly: 1.707 ± 0.021
0.867HisHis: 0.867 ± 0.017
1.276HisIle: 1.276 ± 0.017
0.905HisLys: 0.905 ± 0.013
2.222HisLeu: 2.222 ± 0.02
0.476HisMet: 0.476 ± 0.009
0.882HisAsn: 0.882 ± 0.013
1.627HisPro: 1.627 ± 0.018
1.017HisGln: 1.017 ± 0.013
1.522HisArg: 1.522 ± 0.022
1.886HisSer: 1.886 ± 0.022
1.287HisThr: 1.287 ± 0.015
1.478HisVal: 1.478 ± 0.017
0.343HisTrp: 0.343 ± 0.009
0.704HisTyr: 0.704 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.283IleAla: 4.283 ± 0.028
0.756IleCys: 0.756 ± 0.013
3.085IleAsp: 3.085 ± 0.024
2.968IleGlu: 2.968 ± 0.028
2.155IlePhe: 2.155 ± 0.023
3.375IleGly: 3.375 ± 0.031
1.299IleHis: 1.299 ± 0.015
2.676IleIle: 2.676 ± 0.027
2.233IleLys: 2.233 ± 0.023
4.817IleLeu: 4.817 ± 0.038
1.033IleMet: 1.033 ± 0.015
1.979IleAsn: 1.979 ± 0.02
3.18IlePro: 3.18 ± 0.026
2.081IleGln: 2.081 ± 0.02
2.822IleArg: 2.822 ± 0.025
4.223IleSer: 4.223 ± 0.031
2.963IleThr: 2.963 ± 0.025
3.371IleVal: 3.371 ± 0.03
0.735IleTrp: 0.735 ± 0.014
1.529IleTyr: 1.529 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
4.049LysAla: 4.049 ± 0.035
0.456LysCys: 0.456 ± 0.009
2.754LysAsp: 2.754 ± 0.027
3.343LysGlu: 3.343 ± 0.04
1.49LysPhe: 1.49 ± 0.018
2.819LysGly: 2.819 ± 0.025
1.083LysHis: 1.083 ± 0.015
2.382LysIle: 2.382 ± 0.021
3.233LysLys: 3.233 ± 0.045
4.062LysLeu: 4.062 ± 0.03
0.987LysMet: 0.987 ± 0.016
1.901LysAsn: 1.901 ± 0.02
2.644LysPro: 2.644 ± 0.029
1.893LysGln: 1.893 ± 0.02
3.215LysArg: 3.215 ± 0.034
3.526LysSer: 3.526 ± 0.031
2.929LysThr: 2.929 ± 0.025
2.748LysVal: 2.748 ± 0.026
0.662LysTrp: 0.662 ± 0.011
1.441LysTyr: 1.441 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
7.82LeuAla: 7.82 ± 0.042
1.169LeuCys: 1.169 ± 0.017
5.343LeuAsp: 5.343 ± 0.034
5.454LeuGlu: 5.454 ± 0.039
3.503LeuPhe: 3.503 ± 0.032
5.97LeuGly: 5.97 ± 0.044
2.252LeuHis: 2.252 ± 0.025
4.108LeuIle: 4.108 ± 0.037
4.176LeuLys: 4.176 ± 0.032
8.53LeuLeu: 8.53 ± 0.06
1.751LeuMet: 1.751 ± 0.02
3.331LeuAsn: 3.331 ± 0.025
5.36LeuPro: 5.36 ± 0.032
3.995LeuGln: 3.995 ± 0.037
5.556LeuArg: 5.556 ± 0.041
7.709LeuSer: 7.709 ± 0.049
4.837LeuThr: 4.837 ± 0.033
5.617LeuVal: 5.617 ± 0.046
1.213LeuTrp: 1.213 ± 0.016
2.478LeuTyr: 2.478 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.142MetAla: 2.142 ± 0.02
0.231MetCys: 0.231 ± 0.007
1.188MetAsp: 1.188 ± 0.017
1.226MetGlu: 1.226 ± 0.017
0.749MetPhe: 0.749 ± 0.012
1.379MetGly: 1.379 ± 0.016
0.482MetHis: 0.482 ± 0.009
1.029MetIle: 1.029 ± 0.015
0.97MetLys: 0.97 ± 0.013
1.813MetLeu: 1.813 ± 0.021
0.548MetMet: 0.548 ± 0.011
0.816MetAsn: 0.816 ± 0.013
1.17MetPro: 1.17 ± 0.016
0.864MetGln: 0.864 ± 0.014
1.156MetArg: 1.156 ± 0.016
1.835MetSer: 1.835 ± 0.021
1.294MetThr: 1.294 ± 0.017
1.281MetVal: 1.281 ± 0.017
0.259MetTrp: 0.259 ± 0.007
0.51MetTyr: 0.51 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.214AsnAla: 3.214 ± 0.031
0.439AsnCys: 0.439 ± 0.009
2.209AsnAsp: 2.209 ± 0.018
2.226AsnGlu: 2.226 ± 0.022
1.45AsnPhe: 1.45 ± 0.016
3.012AsnGly: 3.012 ± 0.03
0.912AsnHis: 0.912 ± 0.014
2.279AsnIle: 2.279 ± 0.019
1.699AsnLys: 1.699 ± 0.017
3.425AsnLeu: 3.425 ± 0.025
0.87AsnMet: 0.87 ± 0.013
1.953AsnAsn: 1.953 ± 0.025
2.489AsnPro: 2.489 ± 0.021
1.455AsnGln: 1.455 ± 0.017
1.985AsnArg: 1.985 ± 0.018
3.092AsnSer: 3.092 ± 0.026
2.447AsnThr: 2.447 ± 0.022
2.487AsnVal: 2.487 ± 0.022
0.595AsnTrp: 0.595 ± 0.011
1.134AsnTyr: 1.134 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
4.898ProAla: 4.898 ± 0.041
0.488ProCys: 0.488 ± 0.011
3.158ProAsp: 3.158 ± 0.029
3.739ProGlu: 3.739 ± 0.032
2.087ProPhe: 2.087 ± 0.021
3.765ProGly: 3.765 ± 0.028
1.278ProHis: 1.278 ± 0.018
2.527ProIle: 2.527 ± 0.022
2.49ProLys: 2.49 ± 0.027
4.644ProLeu: 4.644 ± 0.034
0.988ProMet: 0.988 ± 0.015
2.171ProAsn: 2.171 ± 0.02
4.692ProPro: 4.692 ± 0.069
2.405ProGln: 2.405 ± 0.024
3.143ProArg: 3.143 ± 0.029
5.934ProSer: 5.934 ± 0.045
3.717ProThr: 3.717 ± 0.031
3.585ProVal: 3.585 ± 0.031
0.739ProTrp: 0.739 ± 0.012
1.515ProTyr: 1.515 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
3.319GlnAla: 3.319 ± 0.024
0.423GlnCys: 0.423 ± 0.011
2.111GlnAsp: 2.111 ± 0.02
2.401GlnGlu: 2.401 ± 0.026
1.368GlnPhe: 1.368 ± 0.015
2.452GlnGly: 2.452 ± 0.024
1.079GlnHis: 1.079 ± 0.015
1.994GlnIle: 1.994 ± 0.02
2.102GlnLys: 2.102 ± 0.021
3.579GlnLeu: 3.579 ± 0.03
0.844GlnMet: 0.844 ± 0.014
1.764GlnAsn: 1.764 ± 0.019
2.517GlnPro: 2.517 ± 0.028
2.621GlnGln: 2.621 ± 0.045
2.628GlnArg: 2.628 ± 0.025
3.293GlnSer: 3.293 ± 0.028
2.388GlnThr: 2.388 ± 0.021
2.261GlnVal: 2.261 ± 0.02
0.579GlnTrp: 0.579 ± 0.009
1.228GlnTyr: 1.228 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
4.334ArgAla: 4.334 ± 0.032
0.628ArgCys: 0.628 ± 0.012
3.309ArgAsp: 3.309 ± 0.032
3.597ArgGlu: 3.597 ± 0.036
2.147ArgPhe: 2.147 ± 0.022
3.5ArgGly: 3.5 ± 0.032
1.532ArgHis: 1.532 ± 0.017
2.907ArgIle: 2.907 ± 0.026
3.307ArgLys: 3.307 ± 0.03
5.432ArgLeu: 5.432 ± 0.038
1.198ArgMet: 1.198 ± 0.016
2.283ArgAsn: 2.283 ± 0.023
3.241ArgPro: 3.241 ± 0.033
2.601ArgGln: 2.601 ± 0.023
4.764ArgArg: 4.764 ± 0.046
4.536ArgSer: 4.536 ± 0.039
3.088ArgThr: 3.088 ± 0.023
3.333ArgVal: 3.333 ± 0.031
0.892ArgTrp: 0.892 ± 0.014
1.638ArgTyr: 1.638 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
6.681SerAla: 6.681 ± 0.04
0.837SerCys: 0.837 ± 0.014
4.419SerAsp: 4.419 ± 0.032
4.255SerGlu: 4.255 ± 0.029
3.189SerPhe: 3.189 ± 0.026
5.531SerGly: 5.531 ± 0.038
2.043SerHis: 2.043 ± 0.024
4.283SerIle: 4.283 ± 0.03
3.785SerLys: 3.785 ± 0.032
7.718SerLeu: 7.718 ± 0.046
1.714SerMet: 1.714 ± 0.019
3.305SerAsn: 3.305 ± 0.029
5.365SerPro: 5.365 ± 0.049
3.482SerGln: 3.482 ± 0.028
4.961SerArg: 4.961 ± 0.046
9.35SerSer: 9.35 ± 0.073
5.792SerThr: 5.792 ± 0.044
4.833SerVal: 4.833 ± 0.033
1.18SerTrp: 1.18 ± 0.014
2.211SerTyr: 2.211 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.343ThrAla: 5.343 ± 0.035
0.704ThrCys: 0.704 ± 0.014
2.941ThrAsp: 2.941 ± 0.022
3.205ThrGlu: 3.205 ± 0.029
2.293ThrPhe: 2.293 ± 0.021
4.215ThrGly: 4.215 ± 0.033
1.279ThrHis: 1.279 ± 0.016
3.195ThrIle: 3.195 ± 0.027
2.62ThrLys: 2.62 ± 0.023
5.295ThrLeu: 5.295 ± 0.033
1.178ThrMet: 1.178 ± 0.014
2.268ThrAsn: 2.268 ± 0.022
4.094ThrPro: 4.094 ± 0.037
2.139ThrGln: 2.139 ± 0.019
3.028ThrArg: 3.028 ± 0.025
5.412ThrSer: 5.412 ± 0.043
4.486ThrThr: 4.486 ± 0.048
3.933ThrVal: 3.933 ± 0.031
0.885ThrTrp: 0.885 ± 0.013
1.659ThrTyr: 1.659 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
5.205ValAla: 5.205 ± 0.035
0.79ValCys: 0.79 ± 0.012
3.862ValAsp: 3.862 ± 0.032
3.765ValGlu: 3.765 ± 0.035
2.582ValPhe: 2.582 ± 0.023
3.98ValGly: 3.98 ± 0.035
1.439ValHis: 1.439 ± 0.019
3.174ValIle: 3.174 ± 0.027
2.85ValLys: 2.85 ± 0.026
5.696ValLeu: 5.696 ± 0.037
1.291ValMet: 1.291 ± 0.017
2.416ValAsn: 2.416 ± 0.023
3.535ValPro: 3.535 ± 0.028
2.451ValGln: 2.451 ± 0.022
3.329ValArg: 3.329 ± 0.024
4.964ValSer: 4.964 ± 0.032
3.629ValThr: 3.629 ± 0.03
4.399ValVal: 4.399 ± 0.039
0.852ValTrp: 0.852 ± 0.014
1.83ValTyr: 1.83 ± 0.022
0.0ValXaa: 0.0 ± 0.0
Trp
1.148TrpAla: 1.148 ± 0.018
0.184TrpCys: 0.184 ± 0.006
0.892TrpAsp: 0.892 ± 0.015
0.825TrpGlu: 0.825 ± 0.012
0.544TrpPhe: 0.544 ± 0.01
0.894TrpGly: 0.894 ± 0.014
0.343TrpHis: 0.343 ± 0.01
0.81TrpIle: 0.81 ± 0.012
0.822TrpLys: 0.822 ± 0.014
1.344TrpLeu: 1.344 ± 0.017
0.357TrpMet: 0.357 ± 0.007
0.665TrpAsn: 0.665 ± 0.011
0.603TrpPro: 0.603 ± 0.011
0.593TrpGln: 0.593 ± 0.01
0.907TrpArg: 0.907 ± 0.014
1.103TrpSer: 1.103 ± 0.017
0.917TrpThr: 0.917 ± 0.013
0.888TrpVal: 0.888 ± 0.012
0.272TrpTrp: 0.272 ± 0.007
0.433TrpTyr: 0.433 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.182TyrAla: 2.182 ± 0.022
0.403TyrCys: 0.403 ± 0.009
1.732TyrAsp: 1.732 ± 0.017
1.588TyrGlu: 1.588 ± 0.018
1.237TyrPhe: 1.237 ± 0.017
2.125TyrGly: 2.125 ± 0.022
0.783TyrHis: 0.783 ± 0.011
1.509TyrIle: 1.509 ± 0.018
1.116TyrLys: 1.116 ± 0.014
2.769TyrLeu: 2.769 ± 0.026
0.651TyrMet: 0.651 ± 0.01
1.212TyrAsn: 1.212 ± 0.018
1.524TyrPro: 1.524 ± 0.019
1.165TyrGln: 1.165 ± 0.017
1.62TyrArg: 1.62 ± 0.016
2.212TyrSer: 2.212 ± 0.022
1.705TyrThr: 1.705 ± 0.021
1.689TyrVal: 1.689 ± 0.018
0.454TyrTrp: 0.454 ± 0.009
1.003TyrTyr: 1.003 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.012XaaXaa: 0.012 ± 0.006
Statistics based on 9927 proteins (5349850 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski