Amino acid dipepetide frequency for Tomato aspermy virus (TAV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.111AlaAla: 4.111 ± 1.546
0.343AlaCys: 0.343 ± 0.237
5.824AlaAsp: 5.824 ± 1.009
4.796AlaGlu: 4.796 ± 0.831
3.426AlaPhe: 3.426 ± 0.7
4.111AlaGly: 4.111 ± 0.76
1.713AlaHis: 1.713 ± 0.615
4.454AlaIle: 4.454 ± 1.338
1.028AlaLys: 1.028 ± 0.788
5.824AlaLeu: 5.824 ± 1.287
1.37AlaMet: 1.37 ± 0.949
1.713AlaAsn: 1.713 ± 0.56
2.741AlaPro: 2.741 ± 0.355
2.741AlaGln: 2.741 ± 1.045
3.083AlaArg: 3.083 ± 1.317
7.879AlaSer: 7.879 ± 2.088
2.055AlaThr: 2.055 ± 0.396
5.481AlaVal: 5.481 ± 1.643
0.343AlaTrp: 0.343 ± 0.263
1.37AlaTyr: 1.37 ± 0.689
0.0AlaXaa: 0.0 ± 0.0
Cys
2.055CysAla: 2.055 ± 0.618
0.343CysCys: 0.343 ± 0.237
2.741CysAsp: 2.741 ± 0.787
0.685CysGlu: 0.685 ± 0.24
1.37CysPhe: 1.37 ± 0.468
1.713CysGly: 1.713 ± 0.596
0.0CysHis: 0.0 ± 0.0
0.343CysIle: 0.343 ± 0.237
0.685CysLys: 0.685 ± 0.475
3.768CysLeu: 3.768 ± 1.133
0.685CysMet: 0.685 ± 0.475
0.0CysAsn: 0.0 ± 0.0
1.713CysPro: 1.713 ± 0.481
0.343CysGln: 0.343 ± 0.237
1.37CysArg: 1.37 ± 0.763
2.398CysSer: 2.398 ± 0.489
0.685CysThr: 0.685 ± 0.475
2.055CysVal: 2.055 ± 0.666
0.0CysTrp: 0.0 ± 0.0
1.028CysTyr: 1.028 ± 0.444
0.0CysXaa: 0.0 ± 0.0
Asp
3.426AspAla: 3.426 ± 0.967
1.713AspCys: 1.713 ± 0.76
4.111AspAsp: 4.111 ± 1.837
5.824AspGlu: 5.824 ± 0.682
3.426AspPhe: 3.426 ± 1.328
2.741AspGly: 2.741 ± 0.959
1.028AspHis: 1.028 ± 0.399
2.398AspIle: 2.398 ± 1.352
5.481AspLys: 5.481 ± 0.602
7.537AspLeu: 7.537 ± 1.391
2.398AspMet: 2.398 ± 1.031
2.398AspAsn: 2.398 ± 0.516
2.055AspPro: 2.055 ± 0.888
1.028AspGln: 1.028 ± 0.623
4.111AspArg: 4.111 ± 0.685
4.454AspSer: 4.454 ± 0.725
5.481AspThr: 5.481 ± 1.873
4.796AspVal: 4.796 ± 1.167
0.343AspTrp: 0.343 ± 0.513
1.37AspTyr: 1.37 ± 0.481
0.0AspXaa: 0.0 ± 0.0
Glu
3.426GluAla: 3.426 ± 0.359
0.685GluCys: 0.685 ± 0.24
2.741GluAsp: 2.741 ± 1.193
2.055GluGlu: 2.055 ± 0.888
0.343GluPhe: 0.343 ± 0.237
3.426GluGly: 3.426 ± 0.886
2.055GluHis: 2.055 ± 0.582
3.426GluIle: 3.426 ± 1.068
1.713GluLys: 1.713 ± 0.514
6.509GluLeu: 6.509 ± 1.323
2.741GluMet: 2.741 ± 0.639
2.055GluAsn: 2.055 ± 0.746
2.398GluPro: 2.398 ± 0.818
3.083GluGln: 3.083 ± 1.332
3.426GluArg: 3.426 ± 1.37
3.768GluSer: 3.768 ± 1.074
3.768GluThr: 3.768 ± 0.206
2.055GluVal: 2.055 ± 0.721
0.685GluTrp: 0.685 ± 0.475
1.713GluTyr: 1.713 ± 0.56
0.0GluXaa: 0.0 ± 0.0
Phe
1.37PheAla: 1.37 ± 0.468
1.028PheCys: 1.028 ± 0.399
3.768PheAsp: 3.768 ± 0.833
1.028PheGlu: 1.028 ± 0.399
0.343PhePhe: 0.343 ± 0.237
3.083PheGly: 3.083 ± 0.717
2.055PheHis: 2.055 ± 0.996
2.398PheIle: 2.398 ± 0.656
2.055PheLys: 2.055 ± 0.899
2.741PheLeu: 2.741 ± 0.962
1.028PheMet: 1.028 ± 0.788
3.768PheAsn: 3.768 ± 0.622
1.37PhePro: 1.37 ± 0.611
1.37PheGln: 1.37 ± 0.829
0.685PheArg: 0.685 ± 0.475
5.824PheSer: 5.824 ± 2.713
2.741PheThr: 2.741 ± 0.862
2.741PheVal: 2.741 ± 0.931
0.0PheTrp: 0.0 ± 0.0
1.028PheTyr: 1.028 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
2.398GlyAla: 2.398 ± 0.656
0.685GlyCys: 0.685 ± 0.24
4.796GlyAsp: 4.796 ± 0.781
2.055GlyGlu: 2.055 ± 0.396
1.028GlyPhe: 1.028 ± 0.399
3.083GlyGly: 3.083 ± 1.075
1.713GlyHis: 1.713 ± 0.653
3.426GlyIle: 3.426 ± 0.741
3.768GlyLys: 3.768 ± 0.206
4.454GlyLeu: 4.454 ± 1.428
1.028GlyMet: 1.028 ± 0.399
0.685GlyAsn: 0.685 ± 0.525
1.028GlyPro: 1.028 ± 0.587
1.713GlyGln: 1.713 ± 0.514
3.083GlyArg: 3.083 ± 1.004
4.454GlySer: 4.454 ± 1.373
4.796GlyThr: 4.796 ± 0.875
2.741GlyVal: 2.741 ± 0.458
0.0GlyTrp: 0.0 ± 0.0
4.454GlyTyr: 4.454 ± 1.873
0.0GlyXaa: 0.0 ± 0.0
His
1.37HisAla: 1.37 ± 0.562
0.685HisCys: 0.685 ± 0.475
1.37HisAsp: 1.37 ± 0.532
1.028HisGlu: 1.028 ± 0.623
1.713HisPhe: 1.713 ± 0.615
2.398HisGly: 2.398 ± 1.301
0.685HisHis: 0.685 ± 0.63
1.37HisIle: 1.37 ± 0.457
1.713HisLys: 1.713 ± 1.122
1.028HisLeu: 1.028 ± 0.444
0.685HisMet: 0.685 ± 0.525
2.398HisAsn: 2.398 ± 1.818
1.37HisPro: 1.37 ± 0.788
1.37HisGln: 1.37 ± 0.457
1.37HisArg: 1.37 ± 0.689
1.37HisSer: 1.37 ± 0.481
1.028HisThr: 1.028 ± 0.712
1.713HisVal: 1.713 ± 0.466
0.685HisTrp: 0.685 ± 0.63
1.028HisTyr: 1.028 ± 0.399
0.0HisXaa: 0.0 ± 0.0
Ile
3.768IleAla: 3.768 ± 1.027
1.37IleCys: 1.37 ± 0.562
4.111IleAsp: 4.111 ± 1.5
3.083IleGlu: 3.083 ± 0.805
0.343IlePhe: 0.343 ± 0.263
2.741IleGly: 2.741 ± 0.737
1.713IleHis: 1.713 ± 0.466
2.055IleIle: 2.055 ± 0.864
3.083IleLys: 3.083 ± 1.078
3.426IleLeu: 3.426 ± 0.709
0.685IleMet: 0.685 ± 0.545
1.713IleAsn: 1.713 ± 0.739
3.768IlePro: 3.768 ± 1.043
3.083IleGln: 3.083 ± 0.834
2.398IleArg: 2.398 ± 1.051
4.111IleSer: 4.111 ± 1.457
2.055IleThr: 2.055 ± 0.505
3.768IleVal: 3.768 ± 0.547
1.028IleTrp: 1.028 ± 0.492
1.713IleTyr: 1.713 ± 0.727
0.0IleXaa: 0.0 ± 0.0
Lys
3.768LysAla: 3.768 ± 1.026
2.741LysCys: 2.741 ± 0.846
3.083LysAsp: 3.083 ± 0.892
2.398LysGlu: 2.398 ± 0.516
3.426LysPhe: 3.426 ± 0.6
2.398LysGly: 2.398 ± 1.123
0.343LysHis: 0.343 ± 0.263
2.398LysIle: 2.398 ± 0.556
5.481LysLys: 5.481 ± 1.624
4.111LysLeu: 4.111 ± 1.118
1.028LysMet: 1.028 ± 0.418
2.398LysAsn: 2.398 ± 0.489
2.398LysPro: 2.398 ± 1.029
1.713LysGln: 1.713 ± 0.739
2.398LysArg: 2.398 ± 1.191
5.824LysSer: 5.824 ± 1.0
3.083LysThr: 3.083 ± 1.548
3.768LysVal: 3.768 ± 1.026
1.37LysTrp: 1.37 ± 0.481
2.741LysTyr: 2.741 ± 0.689
0.0LysXaa: 0.0 ± 0.0
Leu
6.852LeuAla: 6.852 ± 1.722
3.768LeuCys: 3.768 ± 0.755
4.796LeuAsp: 4.796 ± 1.662
4.454LeuGlu: 4.454 ± 0.792
4.796LeuPhe: 4.796 ± 0.781
4.111LeuGly: 4.111 ± 0.94
3.768LeuHis: 3.768 ± 0.947
5.824LeuIle: 5.824 ± 1.535
5.139LeuLys: 5.139 ± 1.234
7.194LeuLeu: 7.194 ± 1.002
1.028LeuMet: 1.028 ± 0.399
5.139LeuAsn: 5.139 ± 1.574
5.481LeuPro: 5.481 ± 2.144
2.055LeuGln: 2.055 ± 0.792
5.481LeuArg: 5.481 ± 0.814
7.537LeuSer: 7.537 ± 1.711
5.824LeuThr: 5.824 ± 0.581
6.509LeuVal: 6.509 ± 1.844
0.343LeuTrp: 0.343 ± 0.513
1.713LeuTyr: 1.713 ± 0.664
0.0LeuXaa: 0.0 ± 0.0
Met
4.111MetAla: 4.111 ± 1.355
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.37MetGlu: 1.37 ± 0.647
1.028MetPhe: 1.028 ± 0.399
1.028MetGly: 1.028 ± 0.492
0.685MetHis: 0.685 ± 0.345
0.343MetIle: 0.343 ± 0.263
0.685MetLys: 0.685 ± 0.24
1.028MetLeu: 1.028 ± 0.399
0.343MetMet: 0.343 ± 0.237
1.37MetAsn: 1.37 ± 0.562
0.343MetPro: 0.343 ± 0.263
0.343MetGln: 0.343 ± 0.237
2.398MetArg: 2.398 ± 1.123
4.454MetSer: 4.454 ± 0.984
2.055MetThr: 2.055 ± 0.798
2.055MetVal: 2.055 ± 0.746
0.685MetTrp: 0.685 ± 0.475
0.343MetTyr: 0.343 ± 0.237
0.0MetXaa: 0.0 ± 0.0
Asn
3.768AsnAla: 3.768 ± 0.849
1.028AsnCys: 1.028 ± 0.444
1.028AsnAsp: 1.028 ± 0.712
1.028AsnGlu: 1.028 ± 0.712
0.685AsnPhe: 0.685 ± 0.525
4.111AsnGly: 4.111 ± 1.58
0.343AsnHis: 0.343 ± 0.263
2.398AsnIle: 2.398 ± 0.541
1.713AsnLys: 1.713 ± 0.56
2.741AsnLeu: 2.741 ± 0.678
1.37AsnMet: 1.37 ± 0.468
5.824AsnAsn: 5.824 ± 3.198
1.37AsnPro: 1.37 ± 0.457
1.37AsnGln: 1.37 ± 0.562
3.083AsnArg: 3.083 ± 1.022
3.426AsnSer: 3.426 ± 1.417
1.37AsnThr: 1.37 ± 1.05
3.083AsnVal: 3.083 ± 1.103
1.028AsnTrp: 1.028 ± 0.587
1.028AsnTyr: 1.028 ± 0.582
0.0AsnXaa: 0.0 ± 0.0
Pro
2.398ProAla: 2.398 ± 1.029
0.343ProCys: 0.343 ± 0.237
2.055ProAsp: 2.055 ± 0.798
3.768ProGlu: 3.768 ± 1.041
0.343ProPhe: 0.343 ± 0.263
1.028ProGly: 1.028 ± 0.587
0.685ProHis: 0.685 ± 0.24
2.398ProIle: 2.398 ± 0.818
2.398ProLys: 2.398 ± 1.078
3.768ProLeu: 3.768 ± 1.111
1.028ProMet: 1.028 ± 0.587
1.37ProAsn: 1.37 ± 0.788
1.37ProPro: 1.37 ± 0.689
2.055ProGln: 2.055 ± 1.084
2.398ProArg: 2.398 ± 0.757
6.166ProSer: 6.166 ± 1.91
5.139ProThr: 5.139 ± 1.205
4.796ProVal: 4.796 ± 1.244
0.0ProTrp: 0.0 ± 0.0
1.37ProTyr: 1.37 ± 0.896
0.0ProXaa: 0.0 ± 0.0
Gln
2.398GlnAla: 2.398 ± 1.051
1.028GlnCys: 1.028 ± 0.788
1.028GlnAsp: 1.028 ± 0.712
1.37GlnGlu: 1.37 ± 0.468
1.37GlnPhe: 1.37 ± 0.689
1.028GlnGly: 1.028 ± 0.444
2.055GlnHis: 2.055 ± 0.983
0.685GlnIle: 0.685 ± 0.475
1.713GlnLys: 1.713 ± 0.547
4.111GlnLeu: 4.111 ± 1.465
0.343GlnMet: 0.343 ± 0.461
0.685GlnAsn: 0.685 ± 0.527
2.398GlnPro: 2.398 ± 1.24
2.055GlnGln: 2.055 ± 0.562
5.481GlnArg: 5.481 ± 1.381
2.055GlnSer: 2.055 ± 0.817
2.055GlnThr: 2.055 ± 0.721
4.454GlnVal: 4.454 ± 2.026
0.0GlnTrp: 0.0 ± 0.0
0.685GlnTyr: 0.685 ± 0.24
0.0GlnXaa: 0.0 ± 0.0
Arg
4.796ArgAla: 4.796 ± 1.78
2.398ArgCys: 2.398 ± 0.844
4.111ArgAsp: 4.111 ± 0.306
1.713ArgGlu: 1.713 ± 0.615
1.37ArgPhe: 1.37 ± 0.562
2.055ArgGly: 2.055 ± 0.679
2.398ArgHis: 2.398 ± 0.983
3.768ArgIle: 3.768 ± 1.223
3.426ArgLys: 3.426 ± 1.466
7.879ArgLeu: 7.879 ± 1.355
2.398ArgMet: 2.398 ± 0.775
2.055ArgAsn: 2.055 ± 0.491
3.768ArgPro: 3.768 ± 0.815
1.37ArgGln: 1.37 ± 0.774
6.166ArgArg: 6.166 ± 2.631
2.398ArgSer: 2.398 ± 0.997
4.111ArgThr: 4.111 ± 0.76
3.768ArgVal: 3.768 ± 0.713
1.028ArgTrp: 1.028 ± 0.587
1.37ArgTyr: 1.37 ± 0.637
0.0ArgXaa: 0.0 ± 0.0
Ser
5.824SerAla: 5.824 ± 1.684
1.37SerCys: 1.37 ± 0.696
6.166SerAsp: 6.166 ± 2.039
7.537SerGlu: 7.537 ± 1.376
4.454SerPhe: 4.454 ± 0.906
3.426SerGly: 3.426 ± 1.775
0.343SerHis: 0.343 ± 0.237
1.713SerIle: 1.713 ± 0.547
9.592SerLys: 9.592 ± 0.599
8.565SerLeu: 8.565 ± 2.168
1.37SerMet: 1.37 ± 0.689
4.454SerAsn: 4.454 ± 0.603
3.768SerPro: 3.768 ± 0.569
3.768SerGln: 3.768 ± 0.701
4.454SerArg: 4.454 ± 1.258
7.537SerSer: 7.537 ± 0.311
6.852SerThr: 6.852 ± 0.99
6.166SerVal: 6.166 ± 2.182
0.685SerTrp: 0.685 ± 0.475
2.741SerTyr: 2.741 ± 1.104
0.0SerXaa: 0.0 ± 0.0
Thr
4.454ThrAla: 4.454 ± 0.989
0.343ThrCys: 0.343 ± 0.237
4.454ThrAsp: 4.454 ± 0.532
3.083ThrGlu: 3.083 ± 0.718
6.852ThrPhe: 6.852 ± 1.461
3.768ThrGly: 3.768 ± 0.854
1.37ThrHis: 1.37 ± 0.611
3.083ThrIle: 3.083 ± 0.718
1.713ThrLys: 1.713 ± 0.481
7.879ThrLeu: 7.879 ± 0.6
1.713ThrMet: 1.713 ± 0.837
0.685ThrAsn: 0.685 ± 0.475
1.37ThrPro: 1.37 ± 0.481
3.083ThrGln: 3.083 ± 0.267
5.481ThrArg: 5.481 ± 0.694
5.824ThrSer: 5.824 ± 1.536
6.509ThrThr: 6.509 ± 1.158
1.713ThrVal: 1.713 ± 0.481
0.0ThrTrp: 0.0 ± 0.0
3.768ThrTyr: 3.768 ± 0.547
0.0ThrXaa: 0.0 ± 0.0
Val
3.083ValAla: 3.083 ± 0.718
2.398ValCys: 2.398 ± 1.029
5.139ValAsp: 5.139 ± 1.234
1.37ValGlu: 1.37 ± 0.524
2.398ValPhe: 2.398 ± 0.99
3.768ValGly: 3.768 ± 0.413
2.055ValHis: 2.055 ± 0.721
4.796ValIle: 4.796 ± 1.22
3.083ValLys: 3.083 ± 1.198
6.509ValLeu: 6.509 ± 0.999
1.37ValMet: 1.37 ± 0.465
2.741ValAsn: 2.741 ± 0.478
5.139ValPro: 5.139 ± 0.884
2.398ValGln: 2.398 ± 0.757
4.796ValArg: 4.796 ± 1.181
7.194ValSer: 7.194 ± 1.749
5.139ValThr: 5.139 ± 0.861
4.111ValVal: 4.111 ± 1.852
0.0ValTrp: 0.0 ± 0.0
1.028ValTyr: 1.028 ± 0.492
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.37TrpCys: 1.37 ± 0.562
0.343TrpAsp: 0.343 ± 0.237
0.343TrpGlu: 0.343 ± 0.237
1.028TrpPhe: 1.028 ± 0.587
0.0TrpGly: 0.0 ± 0.0
0.343TrpHis: 0.343 ± 0.513
0.0TrpIle: 0.0 ± 0.0
0.685TrpLys: 0.685 ± 0.475
0.685TrpLeu: 0.685 ± 0.475
0.685TrpMet: 0.685 ± 0.63
0.343TrpAsn: 0.343 ± 0.237
0.0TrpPro: 0.0 ± 0.0
1.028TrpGln: 1.028 ± 0.582
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.37TrpVal: 1.37 ± 0.564
0.343TrpTrp: 0.343 ± 0.263
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.028TyrAla: 1.028 ± 0.492
0.343TyrCys: 0.343 ± 0.237
4.454TyrAsp: 4.454 ± 1.089
3.083TyrGlu: 3.083 ± 0.542
0.685TyrPhe: 0.685 ± 0.442
1.37TyrGly: 1.37 ± 0.392
1.028TyrHis: 1.028 ± 0.399
3.083TyrIle: 3.083 ± 0.542
1.713TyrLys: 1.713 ± 0.461
2.055TyrLeu: 2.055 ± 0.564
1.028TyrMet: 1.028 ± 0.582
0.0TyrAsn: 0.0 ± 0.0
1.028TyrPro: 1.028 ± 0.559
1.028TyrGln: 1.028 ± 0.559
0.685TyrArg: 0.685 ± 0.24
4.111TyrSer: 4.111 ± 1.21
2.398TyrThr: 2.398 ± 1.235
1.37TyrVal: 1.37 ± 0.958
0.0TyrTrp: 0.0 ± 0.0
0.343TyrTyr: 0.343 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2920 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski