Amino acid dipepetide frequency for Tanay virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.627AlaAla: 5.627 ± 2.226
1.324AlaCys: 1.324 ± 0.837
2.317AlaAsp: 2.317 ± 0.646
1.324AlaGlu: 1.324 ± 0.694
1.986AlaPhe: 1.986 ± 1.041
1.655AlaGly: 1.655 ± 0.86
1.986AlaHis: 1.986 ± 0.916
3.641AlaIle: 3.641 ± 1.497
3.972AlaLys: 3.972 ± 0.999
4.303AlaLeu: 4.303 ± 1.056
0.662AlaMet: 0.662 ± 0.347
2.979AlaAsn: 2.979 ± 0.666
1.655AlaPro: 1.655 ± 0.693
0.662AlaGln: 0.662 ± 0.347
0.993AlaArg: 0.993 ± 0.849
2.648AlaSer: 2.648 ± 1.06
2.317AlaThr: 2.317 ± 0.541
3.31AlaVal: 3.31 ± 1.132
0.331AlaTrp: 0.331 ± 0.56
4.303AlaTyr: 4.303 ± 2.164
0.0AlaXaa: 0.0 ± 0.0
Cys
1.324CysAla: 1.324 ± 0.837
0.331CysCys: 0.331 ± 0.173
0.993CysAsp: 0.993 ± 0.52
1.986CysGlu: 1.986 ± 1.041
2.317CysPhe: 2.317 ± 1.567
0.0CysGly: 0.0 ± 0.0
0.662CysHis: 0.662 ± 0.347
2.648CysIle: 2.648 ± 0.803
1.324CysLys: 1.324 ± 0.694
1.655CysLeu: 1.655 ± 0.377
0.0CysMet: 0.0 ± 0.0
1.324CysAsn: 1.324 ± 0.815
1.324CysPro: 1.324 ± 0.849
0.993CysGln: 0.993 ± 0.52
0.662CysArg: 0.662 ± 0.347
3.641CysSer: 3.641 ± 0.801
1.986CysThr: 1.986 ± 0.916
1.324CysVal: 1.324 ± 0.308
0.0CysTrp: 0.0 ± 0.0
1.986CysTyr: 1.986 ± 0.655
0.0CysXaa: 0.0 ± 0.0
Asp
1.986AspAla: 1.986 ± 1.041
2.648AspCys: 2.648 ± 0.803
2.979AspAsp: 2.979 ± 1.561
1.324AspGlu: 1.324 ± 0.308
3.972AspPhe: 3.972 ± 3.117
2.317AspGly: 2.317 ± 0.646
1.655AspHis: 1.655 ± 0.377
4.303AspIle: 4.303 ± 1.142
1.655AspLys: 1.655 ± 0.867
6.62AspLeu: 6.62 ± 1.507
1.655AspMet: 1.655 ± 0.867
1.986AspAsn: 1.986 ± 1.041
2.648AspPro: 2.648 ± 1.387
0.993AspGln: 0.993 ± 0.328
1.986AspArg: 1.986 ± 1.041
4.965AspSer: 4.965 ± 1.638
3.31AspThr: 3.31 ± 1.132
5.627AspVal: 5.627 ± 1.02
0.0AspTrp: 0.0 ± 0.0
4.303AspTyr: 4.303 ± 1.084
0.0AspXaa: 0.0 ± 0.0
Glu
1.324GluAla: 1.324 ± 0.694
1.324GluCys: 1.324 ± 0.308
1.655GluAsp: 1.655 ± 0.738
2.648GluGlu: 2.648 ± 0.803
3.972GluPhe: 3.972 ± 0.923
0.331GluGly: 0.331 ± 0.173
1.655GluHis: 1.655 ± 0.693
5.627GluIle: 5.627 ± 0.478
2.648GluLys: 2.648 ± 0.803
4.634GluLeu: 4.634 ± 1.037
2.317GluMet: 2.317 ± 1.214
1.324GluAsn: 1.324 ± 0.694
3.31GluPro: 3.31 ± 1.734
1.655GluGln: 1.655 ± 0.738
2.648GluArg: 2.648 ± 1.673
2.979GluSer: 2.979 ± 1.581
2.648GluThr: 2.648 ± 0.803
4.634GluVal: 4.634 ± 1.81
0.0GluTrp: 0.0 ± 0.0
3.641GluTyr: 3.641 ± 0.801
0.0GluXaa: 0.0 ± 0.0
Phe
2.648PheAla: 2.648 ± 0.615
1.655PheCys: 1.655 ± 0.867
5.627PheAsp: 5.627 ± 0.825
3.641PheGlu: 3.641 ± 1.908
3.972PhePhe: 3.972 ± 0.522
1.986PheGly: 1.986 ± 1.041
0.993PheHis: 0.993 ± 0.328
6.289PheIle: 6.289 ± 3.275
4.303PheLys: 4.303 ± 0.351
3.972PheLeu: 3.972 ± 1.194
0.662PheMet: 0.662 ± 0.424
3.641PheAsn: 3.641 ± 1.497
1.324PhePro: 1.324 ± 0.694
0.993PheGln: 0.993 ± 0.52
1.324PheArg: 1.324 ± 0.308
6.289PheSer: 6.289 ± 1.578
4.634PheThr: 4.634 ± 0.896
9.268PheVal: 9.268 ± 4.028
0.662PheTrp: 0.662 ± 0.424
1.324PheTyr: 1.324 ± 0.308
0.0PheXaa: 0.0 ± 0.0
Gly
2.317GlyAla: 2.317 ± 0.646
0.0GlyCys: 0.0 ± 0.0
3.31GlyAsp: 3.31 ± 1.132
0.331GlyGlu: 0.331 ± 0.173
1.655GlyPhe: 1.655 ± 0.867
1.986GlyGly: 1.986 ± 1.273
0.662GlyHis: 0.662 ± 0.424
1.324GlyIle: 1.324 ± 0.837
2.648GlyLys: 2.648 ± 1.104
2.317GlyLeu: 2.317 ± 0.611
0.0GlyMet: 0.0 ± 0.0
1.986GlyAsn: 1.986 ± 0.5
0.0GlyPro: 0.0 ± 0.0
0.0GlyGln: 0.0 ± 0.0
1.655GlyArg: 1.655 ± 0.693
2.317GlySer: 2.317 ± 0.646
1.324GlyThr: 1.324 ± 0.815
1.986GlyVal: 1.986 ± 0.597
0.0GlyTrp: 0.0 ± 0.0
0.662GlyTyr: 0.662 ± 1.119
0.0GlyXaa: 0.0 ± 0.0
His
0.993HisAla: 0.993 ± 0.52
1.324HisCys: 1.324 ± 0.308
0.993HisAsp: 0.993 ± 0.52
1.986HisGlu: 1.986 ± 0.597
1.324HisPhe: 1.324 ± 0.308
1.986HisGly: 1.986 ± 0.597
0.662HisHis: 0.662 ± 0.347
2.648HisIle: 2.648 ± 0.803
1.324HisLys: 1.324 ± 0.694
3.31HisLeu: 3.31 ± 0.318
0.331HisMet: 0.331 ± 0.56
0.993HisAsn: 0.993 ± 0.978
0.331HisPro: 0.331 ± 0.173
0.0HisGln: 0.0 ± 0.0
0.662HisArg: 0.662 ± 0.347
1.986HisSer: 1.986 ± 0.942
2.648HisThr: 2.648 ± 0.615
2.979HisVal: 2.979 ± 1.581
0.0HisTrp: 0.0 ± 0.0
2.317HisTyr: 2.317 ± 0.646
0.0HisXaa: 0.0 ± 0.0
Ile
4.965IleAla: 4.965 ± 0.074
3.641IleCys: 3.641 ± 0.801
4.965IleAsp: 4.965 ± 1.131
4.303IleGlu: 4.303 ± 2.428
4.303IlePhe: 4.303 ± 2.478
3.31IleGly: 3.31 ± 1.553
1.986IleHis: 1.986 ± 0.655
6.62IleIle: 6.62 ± 5.217
5.296IleLys: 5.296 ± 2.12
6.62IleLeu: 6.62 ± 1.094
1.986IleMet: 1.986 ± 1.559
5.958IleAsn: 5.958 ± 1.499
2.979IlePro: 2.979 ± 2.859
1.324IleGln: 1.324 ± 0.308
2.979IleArg: 2.979 ± 0.589
3.31IleSer: 3.31 ± 1.476
5.296IleThr: 5.296 ± 1.241
5.296IleVal: 5.296 ± 1.783
0.331IleTrp: 0.331 ± 0.173
1.986IleTyr: 1.986 ± 0.655
0.0IleXaa: 0.0 ± 0.0
Lys
1.324LysAla: 1.324 ± 0.694
1.986LysCys: 1.986 ± 1.041
3.31LysAsp: 3.31 ± 0.681
4.634LysGlu: 4.634 ± 1.81
6.289LysPhe: 6.289 ± 0.901
1.986LysGly: 1.986 ± 1.273
2.648LysHis: 2.648 ± 0.615
5.296LysIle: 5.296 ± 2.74
3.972LysLys: 3.972 ± 0.262
5.958LysLeu: 5.958 ± 1.932
0.993LysMet: 0.993 ± 0.978
3.972LysAsn: 3.972 ± 0.262
2.648LysPro: 2.648 ± 1.06
1.655LysGln: 1.655 ± 0.377
2.648LysArg: 2.648 ± 0.538
5.958LysSer: 5.958 ± 0.776
3.972LysThr: 3.972 ± 5.366
4.965LysVal: 4.965 ± 2.079
0.331LysTrp: 0.331 ± 0.173
3.641LysTyr: 3.641 ± 0.868
0.0LysXaa: 0.0 ± 0.0
Leu
2.648LeuAla: 2.648 ± 0.538
1.986LeuCys: 1.986 ± 1.698
5.627LeuAsp: 5.627 ± 1.541
4.303LeuGlu: 4.303 ± 0.966
5.296LeuPhe: 5.296 ± 0.192
2.317LeuGly: 2.317 ± 1.214
0.993LeuHis: 0.993 ± 0.328
7.282LeuIle: 7.282 ± 2.556
9.599LeuLys: 9.599 ± 2.349
6.289LeuLeu: 6.289 ± 0.901
1.324LeuMet: 1.324 ± 1.536
6.289LeuAsn: 6.289 ± 0.701
3.641LeuPro: 3.641 ± 0.694
3.641LeuGln: 3.641 ± 0.868
3.972LeuArg: 3.972 ± 0.923
7.613LeuSer: 7.613 ± 2.25
5.958LeuThr: 5.958 ± 1.946
5.627LeuVal: 5.627 ± 0.358
1.324LeuTrp: 1.324 ± 1.281
4.965LeuTyr: 4.965 ± 1.131
0.0LeuXaa: 0.0 ± 0.0
Met
0.662MetAla: 0.662 ± 0.347
0.331MetCys: 0.331 ± 0.173
1.655MetAsp: 1.655 ± 0.867
0.662MetGlu: 0.662 ± 0.347
1.655MetPhe: 1.655 ± 0.693
0.0MetGly: 0.0 ± 0.0
0.331MetHis: 0.331 ± 0.971
1.324MetIle: 1.324 ± 1.281
1.986MetLys: 1.986 ± 0.655
2.317MetLeu: 2.317 ± 0.775
1.986MetMet: 1.986 ± 0.669
0.662MetAsn: 0.662 ± 1.119
0.331MetPro: 0.331 ± 0.971
0.993MetGln: 0.993 ± 0.953
1.655MetArg: 1.655 ± 0.867
1.986MetSer: 1.986 ± 1.041
0.662MetThr: 0.662 ± 0.424
0.993MetVal: 0.993 ± 0.849
0.0MetTrp: 0.0 ± 0.0
0.662MetTyr: 0.662 ± 0.347
0.0MetXaa: 0.0 ± 0.0
Asn
3.641AsnAla: 3.641 ± 0.868
0.662AsnCys: 0.662 ± 0.424
2.979AsnAsp: 2.979 ± 0.666
2.979AsnGlu: 2.979 ± 0.666
4.634AsnPhe: 4.634 ± 1.238
2.648AsnGly: 2.648 ± 1.104
1.324AsnHis: 1.324 ± 0.308
3.641AsnIle: 3.641 ± 1.385
4.303AsnLys: 4.303 ± 0.351
5.958AsnLeu: 5.958 ± 1.52
1.655AsnMet: 1.655 ± 0.86
3.31AsnAsn: 3.31 ± 0.866
2.979AsnPro: 2.979 ± 1.224
1.324AsnGln: 1.324 ± 0.837
1.986AsnArg: 1.986 ± 0.916
3.641AsnSer: 3.641 ± 0.234
2.648AsnThr: 2.648 ± 0.803
3.972AsnVal: 3.972 ± 0.999
0.331AsnTrp: 0.331 ± 0.173
3.641AsnTyr: 3.641 ± 0.694
0.0AsnXaa: 0.0 ± 0.0
Pro
1.986ProAla: 1.986 ± 0.597
0.662ProCys: 0.662 ± 0.347
2.979ProAsp: 2.979 ± 1.224
3.31ProGlu: 3.31 ± 0.866
1.986ProPhe: 1.986 ± 0.597
0.331ProGly: 0.331 ± 0.173
0.662ProHis: 0.662 ± 0.347
1.324ProIle: 1.324 ± 0.308
3.31ProLys: 3.31 ± 1.39
2.979ProLeu: 2.979 ± 1.039
0.662ProMet: 0.662 ± 0.895
1.324ProAsn: 1.324 ± 0.308
0.993ProPro: 0.993 ± 0.328
1.324ProGln: 1.324 ± 0.849
0.993ProArg: 0.993 ± 0.328
3.641ProSer: 3.641 ± 1.282
2.317ProThr: 2.317 ± 1.214
4.303ProVal: 4.303 ± 0.351
0.331ProTrp: 0.331 ± 0.173
3.31ProTyr: 3.31 ± 0.681
0.0ProXaa: 0.0 ± 0.0
Gln
0.993GlnAla: 0.993 ± 0.978
0.993GlnCys: 0.993 ± 0.52
1.986GlnAsp: 1.986 ± 1.041
0.0GlnGlu: 0.0 ± 0.0
0.662GlnPhe: 0.662 ± 0.895
0.993GlnGly: 0.993 ± 0.52
0.662GlnHis: 0.662 ± 0.424
3.31GlnIle: 3.31 ± 2.122
0.993GlnLys: 0.993 ± 0.52
2.648GlnLeu: 2.648 ± 0.615
0.993GlnMet: 0.993 ± 0.6
2.648GlnAsn: 2.648 ± 0.538
0.662GlnPro: 0.662 ± 0.424
0.331GlnGln: 0.331 ± 0.971
1.986GlnArg: 1.986 ± 1.041
1.324GlnSer: 1.324 ± 0.308
0.993GlnThr: 0.993 ± 0.52
1.655GlnVal: 1.655 ± 0.738
0.662GlnTrp: 0.662 ± 0.424
1.655GlnTyr: 1.655 ± 0.86
0.0GlnXaa: 0.0 ± 0.0
Arg
1.324ArgAla: 1.324 ± 0.694
1.655ArgCys: 1.655 ± 0.377
1.324ArgAsp: 1.324 ± 0.308
2.317ArgGlu: 2.317 ± 1.214
3.31ArgPhe: 3.31 ± 1.132
0.662ArgGly: 0.662 ± 0.424
1.655ArgHis: 1.655 ± 0.867
2.648ArgIle: 2.648 ± 1.06
3.31ArgLys: 3.31 ± 1.72
3.641ArgLeu: 3.641 ± 1.3
0.662ArgMet: 0.662 ± 0.895
3.972ArgAsn: 3.972 ± 1.469
0.993ArgPro: 0.993 ± 0.849
0.662ArgGln: 0.662 ± 0.424
0.993ArgArg: 0.993 ± 0.52
2.979ArgSer: 2.979 ± 1.224
3.31ArgThr: 3.31 ± 1.386
2.317ArgVal: 2.317 ± 0.999
0.0ArgTrp: 0.0 ± 0.0
2.317ArgTyr: 2.317 ± 1.214
0.0ArgXaa: 0.0 ± 0.0
Ser
4.634SerAla: 4.634 ± 1.083
0.993SerCys: 0.993 ± 0.978
3.972SerAsp: 3.972 ± 0.999
4.634SerGlu: 4.634 ± 1.037
2.979SerPhe: 2.979 ± 0.666
1.986SerGly: 1.986 ± 0.942
2.979SerHis: 2.979 ± 0.589
4.303SerIle: 4.303 ± 0.351
5.296SerLys: 5.296 ± 2.637
8.937SerLeu: 8.937 ± 0.203
0.993SerMet: 0.993 ± 0.52
4.634SerAsn: 4.634 ± 0.185
4.303SerPro: 4.303 ± 1.255
2.979SerGln: 2.979 ± 1.224
1.655SerArg: 1.655 ± 0.867
5.958SerSer: 5.958 ± 0.396
4.303SerThr: 4.303 ± 1.255
5.296SerVal: 5.296 ± 2.011
0.662SerTrp: 0.662 ± 0.347
3.31SerTyr: 3.31 ± 1.72
0.0SerXaa: 0.0 ± 0.0
Thr
4.634ThrAla: 4.634 ± 1.549
1.324ThrCys: 1.324 ± 0.694
2.317ThrAsp: 2.317 ± 1.214
2.648ThrGlu: 2.648 ± 1.673
5.296ThrPhe: 5.296 ± 1.222
0.331ThrGly: 0.331 ± 0.173
2.317ThrHis: 2.317 ± 0.611
6.289ThrIle: 6.289 ± 1.906
3.641ThrLys: 3.641 ± 0.801
5.627ThrLeu: 5.627 ± 2.634
1.324ThrMet: 1.324 ± 0.694
1.986ThrAsn: 1.986 ± 1.659
3.972ThrPro: 3.972 ± 1.373
1.655ThrGln: 1.655 ± 0.377
3.31ThrArg: 3.31 ± 0.754
4.965ThrSer: 4.965 ± 3.123
4.965ThrThr: 4.965 ± 1.385
3.31ThrVal: 3.31 ± 1.39
0.0ThrTrp: 0.0 ± 0.0
2.979ThrTyr: 2.979 ± 1.224
0.0ThrXaa: 0.0 ± 0.0
Val
3.641ValAla: 3.641 ± 2.511
1.986ValCys: 1.986 ± 1.659
2.979ValAsp: 2.979 ± 0.983
4.303ValGlu: 4.303 ± 1.142
4.965ValPhe: 4.965 ± 1.453
0.993ValGly: 0.993 ± 0.52
2.648ValHis: 2.648 ± 1.06
4.965ValIle: 4.965 ± 1.01
5.627ValLys: 5.627 ± 0.825
8.275ValLeu: 8.275 ± 0.872
0.993ValMet: 0.993 ± 0.328
3.972ValAsn: 3.972 ± 0.522
2.979ValPro: 2.979 ± 1.503
3.641ValGln: 3.641 ± 0.234
4.303ValArg: 4.303 ± 1.639
5.958ValSer: 5.958 ± 0.776
5.627ValThr: 5.627 ± 2.079
6.951ValVal: 6.951 ± 1.834
0.0ValTrp: 0.0 ± 0.0
3.31ValTyr: 3.31 ± 0.754
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.331TrpAsp: 0.331 ± 0.173
0.331TrpGlu: 0.331 ± 0.173
0.662TrpPhe: 0.662 ± 0.424
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.662TrpLeu: 0.662 ± 0.347
0.331TrpMet: 0.331 ± 0.173
0.331TrpAsn: 0.331 ± 0.56
0.0TrpPro: 0.0 ± 0.0
0.331TrpGln: 0.331 ± 0.56
0.0TrpArg: 0.0 ± 0.0
0.662TrpSer: 0.662 ± 0.424
0.331TrpThr: 0.331 ± 0.971
0.331TrpVal: 0.331 ± 0.173
0.0TrpTrp: 0.0 ± 0.0
0.993TrpTyr: 0.993 ± 0.978
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.324TyrAla: 1.324 ± 0.694
1.655TyrCys: 1.655 ± 0.377
4.634TyrAsp: 4.634 ± 1.292
3.31TyrGlu: 3.31 ± 0.318
3.972TyrPhe: 3.972 ± 1.643
0.662TyrGly: 0.662 ± 0.347
2.317TyrHis: 2.317 ± 0.611
3.972TyrIle: 3.972 ± 0.522
2.979TyrLys: 2.979 ± 0.983
3.972TyrLeu: 3.972 ± 1.407
1.324TyrMet: 1.324 ± 1.791
4.965TyrAsn: 4.965 ± 1.447
1.655TyrPro: 1.655 ± 0.867
0.993TyrGln: 0.993 ± 0.328
3.641TyrArg: 3.641 ± 0.801
1.986TyrSer: 1.986 ± 0.655
3.641TyrThr: 3.641 ± 0.801
3.972TyrVal: 3.972 ± 1.373
0.331TyrTrp: 0.331 ± 0.56
4.303TyrTyr: 4.303 ± 1.795
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3022 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski