Amino acid dipepetide frequency for Trichomonas vaginalis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.666AlaAla: 3.666 ± 0.027
0.732AlaCys: 0.732 ± 0.008
3.075AlaAsp: 3.075 ± 0.02
3.773AlaGlu: 3.773 ± 0.026
2.773AlaPhe: 2.773 ± 0.02
2.013AlaGly: 2.013 ± 0.015
0.783AlaHis: 0.783 ± 0.008
4.241AlaIle: 4.241 ± 0.019
4.344AlaLys: 4.344 ± 0.024
4.582AlaLeu: 4.582 ± 0.024
1.373AlaMet: 1.373 ± 0.009
3.135AlaAsn: 3.135 ± 0.016
2.109AlaPro: 2.109 ± 0.018
2.163AlaGln: 2.163 ± 0.016
1.674AlaArg: 1.674 ± 0.015
3.318AlaSer: 3.318 ± 0.023
2.831AlaThr: 2.831 ± 0.017
2.781AlaVal: 2.781 ± 0.014
0.481AlaTrp: 0.481 ± 0.006
1.602AlaTyr: 1.602 ± 0.011
0.003AlaXaa: 0.003 ± 0.0
Cys
0.838CysAla: 0.838 ± 0.009
0.407CysCys: 0.407 ± 0.007
0.86CysAsp: 0.86 ± 0.008
0.834CysGlu: 0.834 ± 0.008
1.152CysPhe: 1.152 ± 0.009
0.816CysGly: 0.816 ± 0.008
0.297CysHis: 0.297 ± 0.005
1.122CysIle: 1.122 ± 0.008
1.228CysLys: 1.228 ± 0.011
1.463CysLeu: 1.463 ± 0.011
0.353CysMet: 0.353 ± 0.005
0.852CysAsn: 0.852 ± 0.008
0.625CysPro: 0.625 ± 0.007
0.585CysGln: 0.585 ± 0.007
0.61CysArg: 0.61 ± 0.006
1.35CysSer: 1.35 ± 0.011
0.835CysThr: 0.835 ± 0.008
0.873CysVal: 0.873 ± 0.008
0.124CysTrp: 0.124 ± 0.006
0.719CysTyr: 0.719 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
2.74AspAla: 2.74 ± 0.017
1.069AspCys: 1.069 ± 0.009
3.898AspAsp: 3.898 ± 0.026
4.913AspGlu: 4.913 ± 0.029
3.522AspPhe: 3.522 ± 0.017
2.468AspGly: 2.468 ± 0.016
1.024AspHis: 1.024 ± 0.008
4.889AspIle: 4.889 ± 0.019
4.769AspLys: 4.769 ± 0.027
5.108AspLeu: 5.108 ± 0.025
1.578AspMet: 1.578 ± 0.012
3.22AspAsn: 3.22 ± 0.015
2.242AspPro: 2.242 ± 0.015
1.93AspGln: 1.93 ± 0.013
2.062AspArg: 2.062 ± 0.013
4.068AspSer: 4.068 ± 0.019
2.539AspThr: 2.539 ± 0.012
3.273AspVal: 3.273 ± 0.018
0.626AspTrp: 0.626 ± 0.007
2.246AspTyr: 2.246 ± 0.014
0.002AspXaa: 0.002 ± 0.0
Glu
3.413GluAla: 3.413 ± 0.022
1.02GluCys: 1.02 ± 0.009
3.89GluAsp: 3.89 ± 0.023
7.153GluGlu: 7.153 ± 0.091
3.858GluPhe: 3.858 ± 0.018
2.505GluGly: 2.505 ± 0.014
0.991GluHis: 0.991 ± 0.008
6.663GluIle: 6.663 ± 0.03
7.107GluLys: 7.107 ± 0.056
6.415GluLeu: 6.415 ± 0.029
2.041GluMet: 2.041 ± 0.012
5.612GluAsn: 5.612 ± 0.033
2.177GluPro: 2.177 ± 0.019
2.731GluGln: 2.731 ± 0.021
2.606GluArg: 2.606 ± 0.02
4.204GluSer: 4.204 ± 0.029
4.265GluThr: 4.265 ± 0.024
3.619GluVal: 3.619 ± 0.017
0.445GluTrp: 0.445 ± 0.005
3.124GluTyr: 3.124 ± 0.016
0.003GluXaa: 0.003 ± 0.0
Phe
2.73PheAla: 2.73 ± 0.014
0.945PheCys: 0.945 ± 0.009
3.808PheAsp: 3.808 ± 0.017
3.617PheGlu: 3.617 ± 0.018
2.205PhePhe: 2.205 ± 0.016
2.19PheGly: 2.19 ± 0.016
1.072PheHis: 1.072 ± 0.009
4.137PheIle: 4.137 ± 0.024
3.454PheLys: 3.454 ± 0.018
4.625PheLeu: 4.625 ± 0.021
1.178PheMet: 1.178 ± 0.01
3.411PheAsn: 3.411 ± 0.016
2.04PhePro: 2.04 ± 0.013
1.751PheGln: 1.751 ± 0.012
1.677PheArg: 1.677 ± 0.011
4.043PheSer: 4.043 ± 0.021
2.902PheThr: 2.902 ± 0.014
2.949PheVal: 2.949 ± 0.016
0.481PheTrp: 0.481 ± 0.006
2.267PheTyr: 2.267 ± 0.013
0.002PheXaa: 0.002 ± 0.0
Gly
2.541GlyAla: 2.541 ± 0.02
0.877GlyCys: 0.877 ± 0.01
1.944GlyAsp: 1.944 ± 0.012
2.321GlyGlu: 2.321 ± 0.012
2.21GlyPhe: 2.21 ± 0.013
2.453GlyGly: 2.453 ± 0.027
0.915GlyHis: 0.915 ± 0.009
3.001GlyIle: 3.001 ± 0.017
3.532GlyLys: 3.532 ± 0.018
2.953GlyLeu: 2.953 ± 0.016
0.773GlyMet: 0.773 ± 0.009
2.459GlyAsn: 2.459 ± 0.017
0.963GlyPro: 0.963 ± 0.01
1.438GlyGln: 1.438 ± 0.011
1.623GlyArg: 1.623 ± 0.011
2.957GlySer: 2.957 ± 0.018
2.343GlyThr: 2.343 ± 0.014
2.031GlyVal: 2.031 ± 0.014
0.435GlyTrp: 0.435 ± 0.006
1.811GlyTyr: 1.811 ± 0.013
0.002GlyXaa: 0.002 ± 0.0
His
0.863HisAla: 0.863 ± 0.008
0.448HisCys: 0.448 ± 0.005
0.948HisAsp: 0.948 ± 0.007
1.213HisGlu: 1.213 ± 0.01
1.105HisPhe: 1.105 ± 0.01
1.058HisGly: 1.058 ± 0.015
0.449HisHis: 0.449 ± 0.007
1.508HisIle: 1.508 ± 0.011
1.359HisLys: 1.359 ± 0.01
1.802HisLeu: 1.802 ± 0.013
0.379HisMet: 0.379 ± 0.005
1.047HisAsn: 1.047 ± 0.009
0.964HisPro: 0.964 ± 0.007
0.679HisGln: 0.679 ± 0.007
0.942HisArg: 0.942 ± 0.008
1.41HisSer: 1.41 ± 0.012
1.183HisThr: 1.183 ± 0.015
0.962HisVal: 0.962 ± 0.007
0.193HisTrp: 0.193 ± 0.004
0.982HisTyr: 0.982 ± 0.011
0.001HisXaa: 0.001 ± 0.0
Ile
4.112IleAla: 4.112 ± 0.018
1.263IleCys: 1.263 ± 0.008
5.117IleAsp: 5.117 ± 0.018
5.697IleGlu: 5.697 ± 0.02
3.406IlePhe: 3.406 ± 0.021
3.14IleGly: 3.14 ± 0.017
1.672IleHis: 1.672 ± 0.013
5.697IleIle: 5.697 ± 0.033
6.008IleLys: 6.008 ± 0.026
6.714IleLeu: 6.714 ± 0.027
1.476IleMet: 1.476 ± 0.01
5.037IleAsn: 5.037 ± 0.026
3.95IlePro: 3.95 ± 0.019
3.414IleGln: 3.414 ± 0.016
2.983IleArg: 2.983 ± 0.013
7.283IleSer: 7.283 ± 0.039
4.401IleThr: 4.401 ± 0.021
4.162IleVal: 4.162 ± 0.018
0.64IleTrp: 0.64 ± 0.007
3.25IleTyr: 3.25 ± 0.017
0.004IleXaa: 0.004 ± 0.0
Lys
4.029LysAla: 4.029 ± 0.028
1.178LysCys: 1.178 ± 0.007
5.008LysAsp: 5.008 ± 0.028
7.119LysGlu: 7.119 ± 0.048
4.349LysPhe: 4.349 ± 0.021
2.779LysGly: 2.779 ± 0.018
1.305LysHis: 1.305 ± 0.014
6.269LysIle: 6.269 ± 0.022
7.6LysLys: 7.6 ± 0.05
7.18LysLeu: 7.18 ± 0.03
2.061LysMet: 2.061 ± 0.018
5.317LysAsn: 5.317 ± 0.023
3.307LysPro: 3.307 ± 0.027
3.52LysGln: 3.52 ± 0.024
3.088LysArg: 3.088 ± 0.017
5.928LysSer: 5.928 ± 0.029
4.554LysThr: 4.554 ± 0.022
4.172LysVal: 4.172 ± 0.024
0.482LysTrp: 0.482 ± 0.007
3.714LysTyr: 3.714 ± 0.017
0.004LysXaa: 0.004 ± 0.001
Leu
4.351LeuAla: 4.351 ± 0.019
1.351LeuCys: 1.351 ± 0.011
4.573LeuAsp: 4.573 ± 0.018
5.664LeuGlu: 5.664 ± 0.027
4.46LeuPhe: 4.46 ± 0.029
3.046LeuGly: 3.046 ± 0.017
2.051LeuHis: 2.051 ± 0.017
6.809LeuIle: 6.809 ± 0.034
6.948LeuLys: 6.948 ± 0.034
7.998LeuLeu: 7.998 ± 0.036
2.013LeuMet: 2.013 ± 0.014
5.377LeuAsn: 5.377 ± 0.024
4.222LeuPro: 4.222 ± 0.036
4.042LeuGln: 4.042 ± 0.025
3.481LeuArg: 3.481 ± 0.016
6.85LeuSer: 6.85 ± 0.029
4.989LeuThr: 4.989 ± 0.024
4.381LeuVal: 4.381 ± 0.021
0.58LeuTrp: 0.58 ± 0.006
3.347LeuTyr: 3.347 ± 0.016
0.004LeuXaa: 0.004 ± 0.001
Met
1.025MetAla: 1.025 ± 0.008
0.288MetCys: 0.288 ± 0.004
1.108MetAsp: 1.108 ± 0.01
1.342MetGlu: 1.342 ± 0.009
1.088MetPhe: 1.088 ± 0.008
0.634MetGly: 0.634 ± 0.006
0.439MetHis: 0.439 ± 0.005
1.628MetIle: 1.628 ± 0.013
2.398MetLys: 2.398 ± 0.013
2.249MetLeu: 2.249 ± 0.018
0.668MetMet: 0.668 ± 0.008
1.744MetAsn: 1.744 ± 0.013
0.757MetPro: 0.757 ± 0.009
1.098MetGln: 1.098 ± 0.01
1.117MetArg: 1.117 ± 0.009
1.921MetSer: 1.921 ± 0.011
1.461MetThr: 1.461 ± 0.01
0.941MetVal: 0.941 ± 0.007
0.202MetTrp: 0.202 ± 0.004
0.906MetTyr: 0.906 ± 0.007
0.002MetXaa: 0.002 ± 0.0
Asn
3.175AsnAla: 3.175 ± 0.017
1.131AsnCys: 1.131 ± 0.014
3.933AsnAsp: 3.933 ± 0.019
5.259AsnGlu: 5.259 ± 0.034
3.321AsnPhe: 3.321 ± 0.015
2.682AsnGly: 2.682 ± 0.018
1.319AsnHis: 1.319 ± 0.009
5.613AsnIle: 5.613 ± 0.025
4.874AsnLys: 4.874 ± 0.026
5.451AsnLeu: 5.451 ± 0.025
1.14AsnMet: 1.14 ± 0.009
3.875AsnAsn: 3.875 ± 0.027
2.872AsnPro: 2.872 ± 0.015
2.466AsnGln: 2.466 ± 0.018
2.243AsnArg: 2.243 ± 0.013
5.124AsnSer: 5.124 ± 0.024
3.128AsnThr: 3.128 ± 0.015
3.323AsnVal: 3.323 ± 0.015
0.486AsnTrp: 0.486 ± 0.006
2.813AsnTyr: 2.813 ± 0.016
0.002AsnXaa: 0.002 ± 0.0
Pro
2.154ProAla: 2.154 ± 0.02
0.356ProCys: 0.356 ± 0.005
2.202ProAsp: 2.202 ± 0.013
3.536ProGlu: 3.536 ± 0.019
1.902ProPhe: 1.902 ± 0.014
1.329ProGly: 1.329 ± 0.014
0.714ProHis: 0.714 ± 0.007
2.852ProIle: 2.852 ± 0.016
3.414ProLys: 3.414 ± 0.027
3.695ProLeu: 3.695 ± 0.034
0.749ProMet: 0.749 ± 0.007
2.597ProAsn: 2.597 ± 0.013
2.622ProPro: 2.622 ± 0.031
2.232ProGln: 2.232 ± 0.018
1.587ProArg: 1.587 ± 0.013
3.265ProSer: 3.265 ± 0.019
2.736ProThr: 2.736 ± 0.022
2.031ProVal: 2.031 ± 0.017
0.257ProTrp: 0.257 ± 0.004
1.564ProTyr: 1.564 ± 0.013
0.003ProXaa: 0.003 ± 0.0
Gln
1.847GlnAla: 1.847 ± 0.013
0.673GlnCys: 0.673 ± 0.006
2.01GlnAsp: 2.01 ± 0.012
2.95GlnGlu: 2.95 ± 0.021
1.864GlnPhe: 1.864 ± 0.012
1.354GlnGly: 1.354 ± 0.012
0.628GlnHis: 0.628 ± 0.007
3.701GlnIle: 3.701 ± 0.018
3.696GlnLys: 3.696 ± 0.023
3.402GlnLeu: 3.402 ± 0.02
1.061GlnMet: 1.061 ± 0.01
3.001GlnAsn: 3.001 ± 0.024
1.625GlnPro: 1.625 ± 0.018
2.685GlnGln: 2.685 ± 0.036
1.67GlnArg: 1.67 ± 0.015
2.736GlnSer: 2.736 ± 0.015
2.777GlnThr: 2.777 ± 0.015
1.731GlnVal: 1.731 ± 0.012
0.318GlnTrp: 0.318 ± 0.005
1.554GlnTyr: 1.554 ± 0.012
0.002GlnXaa: 0.002 ± 0.0
Arg
1.887ArgAla: 1.887 ± 0.013
0.677ArgCys: 0.677 ± 0.006
2.125ArgAsp: 2.125 ± 0.012
2.638ArgGlu: 2.638 ± 0.016
1.776ArgPhe: 1.776 ± 0.01
1.404ArgGly: 1.404 ± 0.012
0.801ArgHis: 0.801 ± 0.01
3.061ArgIle: 3.061 ± 0.021
3.311ArgLys: 3.311 ± 0.023
3.114ArgLeu: 3.114 ± 0.022
0.934ArgMet: 0.934 ± 0.008
2.508ArgAsn: 2.508 ± 0.013
1.198ArgPro: 1.198 ± 0.01
1.483ArgGln: 1.483 ± 0.012
1.94ArgArg: 1.94 ± 0.016
2.402ArgSer: 2.402 ± 0.012
1.966ArgThr: 1.966 ± 0.013
1.995ArgVal: 1.995 ± 0.012
0.352ArgTrp: 0.352 ± 0.005
1.693ArgTyr: 1.693 ± 0.012
0.003ArgXaa: 0.003 ± 0.0
Ser
3.27SerAla: 3.27 ± 0.018
1.029SerCys: 1.029 ± 0.009
4.655SerAsp: 4.655 ± 0.022
5.105SerGlu: 5.105 ± 0.029
4.191SerPhe: 4.191 ± 0.018
3.201SerGly: 3.201 ± 0.02
1.699SerHis: 1.699 ± 0.016
6.362SerIle: 6.362 ± 0.029
6.025SerLys: 6.025 ± 0.026
6.814SerLeu: 6.814 ± 0.032
1.759SerMet: 1.759 ± 0.01
4.682SerAsn: 4.682 ± 0.023
3.051SerPro: 3.051 ± 0.018
2.937SerGln: 2.937 ± 0.018
2.467SerArg: 2.467 ± 0.023
6.923SerSer: 6.923 ± 0.058
4.269SerThr: 4.269 ± 0.023
4.052SerVal: 4.052 ± 0.017
0.467SerTrp: 0.467 ± 0.006
2.587SerTyr: 2.587 ± 0.015
0.005SerXaa: 0.005 ± 0.001
Thr
3.38ThrAla: 3.38 ± 0.02
0.698ThrCys: 0.698 ± 0.007
2.913ThrAsp: 2.913 ± 0.014
3.8ThrGlu: 3.8 ± 0.018
3.168ThrPhe: 3.168 ± 0.016
2.131ThrGly: 2.131 ± 0.018
1.155ThrHis: 1.155 ± 0.012
4.719ThrIle: 4.719 ± 0.024
4.572ThrLys: 4.572 ± 0.02
4.491ThrLeu: 4.491 ± 0.021
1.028ThrMet: 1.028 ± 0.008
3.401ThrAsn: 3.401 ± 0.02
3.184ThrPro: 3.184 ± 0.03
1.981ThrGln: 1.981 ± 0.012
1.709ThrArg: 1.709 ± 0.011
4.015ThrSer: 4.015 ± 0.022
3.47ThrThr: 3.47 ± 0.021
3.245ThrVal: 3.245 ± 0.018
0.369ThrTrp: 0.369 ± 0.005
2.229ThrTyr: 2.229 ± 0.013
0.004ThrXaa: 0.004 ± 0.0
Val
2.738ValAla: 2.738 ± 0.015
0.858ValCys: 0.858 ± 0.007
2.965ValAsp: 2.965 ± 0.014
3.594ValGlu: 3.594 ± 0.017
2.526ValPhe: 2.526 ± 0.015
2.066ValGly: 2.066 ± 0.014
1.059ValHis: 1.059 ± 0.009
3.515ValIle: 3.515 ± 0.018
4.724ValLys: 4.724 ± 0.019
4.532ValLeu: 4.532 ± 0.026
1.289ValMet: 1.289 ± 0.01
3.321ValAsn: 3.321 ± 0.015
2.017ValPro: 2.017 ± 0.013
2.353ValGln: 2.353 ± 0.015
2.057ValArg: 2.057 ± 0.013
4.064ValSer: 4.064 ± 0.018
2.571ValThr: 2.571 ± 0.016
2.93ValVal: 2.93 ± 0.017
0.634ValTrp: 0.634 ± 0.006
2.074ValTyr: 2.074 ± 0.012
0.003ValXaa: 0.003 ± 0.0
Trp
0.425TrpAla: 0.425 ± 0.005
0.151TrpCys: 0.151 ± 0.003
0.452TrpAsp: 0.452 ± 0.006
0.565TrpGlu: 0.565 ± 0.006
0.446TrpPhe: 0.446 ± 0.006
0.314TrpGly: 0.314 ± 0.005
0.17TrpHis: 0.17 ± 0.004
0.593TrpIle: 0.593 ± 0.006
0.589TrpLys: 0.589 ± 0.006
0.611TrpLeu: 0.611 ± 0.007
0.192TrpMet: 0.192 ± 0.004
0.593TrpAsn: 0.593 ± 0.007
0.19TrpPro: 0.19 ± 0.003
0.34TrpGln: 0.34 ± 0.008
0.293TrpArg: 0.293 ± 0.004
0.595TrpSer: 0.595 ± 0.007
0.465TrpThr: 0.465 ± 0.005
0.452TrpVal: 0.452 ± 0.005
0.063TrpTrp: 0.063 ± 0.002
0.422TrpTyr: 0.422 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.268TyrAla: 2.268 ± 0.015
0.79TyrCys: 0.79 ± 0.008
2.703TyrAsp: 2.703 ± 0.015
2.747TyrGlu: 2.747 ± 0.014
2.16TyrPhe: 2.16 ± 0.014
1.996TyrGly: 1.996 ± 0.015
0.952TyrHis: 0.952 ± 0.008
3.039TyrIle: 3.039 ± 0.016
3.007TyrLys: 3.007 ± 0.016
3.335TyrLeu: 3.335 ± 0.019
0.967TyrMet: 0.967 ± 0.009
3.04TyrAsn: 3.04 ± 0.017
1.678TyrPro: 1.678 ± 0.012
1.453TyrGln: 1.453 ± 0.01
1.436TyrArg: 1.436 ± 0.011
3.089TyrSer: 3.089 ± 0.017
1.973TyrThr: 1.973 ± 0.014
1.985TyrVal: 1.985 ± 0.012
0.287TyrTrp: 0.287 ± 0.005
1.796TyrTyr: 1.796 ± 0.014
0.002TyrXaa: 0.002 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.002XaaCys: 0.002 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.004XaaPhe: 0.004 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.004XaaIle: 0.004 ± 0.0
0.003XaaLys: 0.003 ± 0.0
0.005XaaLeu: 0.005 ± 0.001
0.003XaaMet: 0.003 ± 0.0
0.003XaaAsn: 0.003 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.002XaaGln: 0.002 ± 0.0
0.003XaaArg: 0.003 ± 0.0
0.005XaaSer: 0.005 ± 0.001
0.004XaaThr: 0.004 ± 0.0
0.003XaaVal: 0.003 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.002XaaTyr: 0.002 ± 0.0
1.713XaaXaa: 1.713 ± 0.062
Statistics based on 50190 proteins (17055200 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski