Amino acid dipepetide frequency for Thelephora terrestris virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.735AlaAla: 9.735 ± 3.259
0.885AlaCys: 0.885 ± 0.752
5.31AlaAsp: 5.31 ± 2.407
4.13AlaGlu: 4.13 ± 1.204
1.77AlaPhe: 1.77 ± 0.582
7.67AlaGly: 7.67 ± 1.324
1.77AlaHis: 1.77 ± 0.582
4.13AlaIle: 4.13 ± 1.103
2.655AlaLys: 2.655 ± 0.511
9.44AlaLeu: 9.44 ± 2.949
1.77AlaMet: 1.77 ± 0.341
4.72AlaAsn: 4.72 ± 1.524
7.375AlaPro: 7.375 ± 1.113
2.95AlaGln: 2.95 ± 0.261
6.785AlaArg: 6.785 ± 1.154
5.015AlaSer: 5.015 ± 1.274
6.195AlaThr: 6.195 ± 0.732
5.015AlaVal: 5.015 ± 2.196
0.885AlaTrp: 0.885 ± 0.17
2.655AlaTyr: 2.655 ± 1.334
0.0AlaXaa: 0.0 ± 0.0
Cys
2.95CysAla: 2.95 ± 0.201
0.295CysCys: 0.295 ± 0.211
0.295CysAsp: 0.295 ± 0.211
0.0CysGlu: 0.0 ± 0.0
0.59CysPhe: 0.59 ± 0.04
1.77CysGly: 1.77 ± 0.12
0.885CysHis: 0.885 ± 0.291
0.295CysIle: 0.295 ± 0.211
0.295CysLys: 0.295 ± 0.251
2.36CysLeu: 2.36 ± 0.762
0.295CysMet: 0.295 ± 0.211
0.0CysAsn: 0.0 ± 0.0
0.885CysPro: 0.885 ± 0.17
0.0CysGln: 0.0 ± 0.0
1.77CysArg: 1.77 ± 0.12
0.59CysSer: 0.59 ± 0.421
0.295CysThr: 0.295 ± 0.211
0.885CysVal: 0.885 ± 0.17
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.835AspAla: 3.835 ± 0.893
2.065AspCys: 2.065 ± 0.552
2.655AspAsp: 2.655 ± 0.511
4.425AspGlu: 4.425 ± 0.07
3.54AspPhe: 3.54 ± 0.22
3.245AspGly: 3.245 ± 0.452
1.18AspHis: 1.18 ± 0.542
4.13AspIle: 4.13 ± 0.642
3.54AspLys: 3.54 ± 0.702
6.195AspLeu: 6.195 ± 1.114
1.77AspMet: 1.77 ± 0.341
1.77AspAsn: 1.77 ± 0.12
2.95AspPro: 2.95 ± 0.662
0.0AspGln: 0.0 ± 0.0
5.015AspArg: 5.015 ± 2.418
4.13AspSer: 4.13 ± 1.103
5.015AspThr: 5.015 ± 0.351
3.54AspVal: 3.54 ± 0.682
1.475AspTrp: 1.475 ± 0.331
2.065AspTyr: 2.065 ± 0.833
0.0AspXaa: 0.0 ± 0.0
Glu
1.475GluAla: 1.475 ± 0.792
0.885GluCys: 0.885 ± 0.17
4.72GluAsp: 4.72 ± 1.063
2.36GluGlu: 2.36 ± 0.301
2.655GluPhe: 2.655 ± 1.896
3.54GluGly: 3.54 ± 0.702
0.295GluHis: 0.295 ± 0.211
3.245GluIle: 3.245 ± 0.452
3.245GluLys: 3.245 ± 0.471
5.31GluLeu: 5.31 ± 1.746
0.885GluMet: 0.885 ± 0.632
0.295GluAsn: 0.295 ± 0.251
2.36GluPro: 2.36 ± 0.161
1.18GluGln: 1.18 ± 1.003
2.36GluArg: 2.36 ± 1.545
2.36GluSer: 2.36 ± 0.161
2.655GluThr: 2.655 ± 0.873
2.95GluVal: 2.95 ± 0.201
1.475GluTrp: 1.475 ± 0.331
4.13GluTyr: 4.13 ± 0.642
0.0GluXaa: 0.0 ± 0.0
Phe
2.065PheAla: 2.065 ± 0.552
0.59PheCys: 0.59 ± 0.04
3.835PheAsp: 3.835 ± 1.414
1.77PheGlu: 1.77 ± 0.341
1.77PhePhe: 1.77 ± 0.802
4.425PheGly: 4.425 ± 0.852
0.885PheHis: 0.885 ± 0.17
0.885PheIle: 0.885 ± 0.17
1.475PheLys: 1.475 ± 0.331
3.245PheLeu: 3.245 ± 0.933
0.885PheMet: 0.885 ± 0.632
1.77PheAsn: 1.77 ± 0.582
2.065PhePro: 2.065 ± 1.474
0.885PheGln: 0.885 ± 0.291
2.36PheArg: 2.36 ± 0.161
2.065PheSer: 2.065 ± 0.09
2.655PheThr: 2.655 ± 0.511
4.425PheVal: 4.425 ± 0.07
0.59PheTrp: 0.59 ± 0.04
1.475PheTyr: 1.475 ± 0.592
0.0PheXaa: 0.0 ± 0.0
Gly
8.26GlyAla: 8.26 ± 2.206
0.885GlyCys: 0.885 ± 0.632
4.72GlyAsp: 4.72 ± 0.14
3.245GlyGlu: 3.245 ± 0.452
3.835GlyPhe: 3.835 ± 0.953
5.31GlyGly: 5.31 ± 1.946
2.065GlyHis: 2.065 ± 0.552
3.835GlyIle: 3.835 ± 2.337
2.065GlyLys: 2.065 ± 0.833
7.08GlyLeu: 7.08 ± 0.441
1.18GlyMet: 1.18 ± 0.08
1.475GlyAsn: 1.475 ± 0.13
4.425GlyPro: 4.425 ± 1.314
2.36GlyGln: 2.36 ± 0.622
5.605GlyArg: 5.605 ± 0.151
7.965GlySer: 7.965 ± 0.773
6.195GlyThr: 6.195 ± 1.575
5.015GlyVal: 5.015 ± 0.572
1.77GlyTrp: 1.77 ± 0.341
2.655GlyTyr: 2.655 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
1.475HisAla: 1.475 ± 0.792
0.295HisCys: 0.295 ± 0.211
0.885HisAsp: 0.885 ± 0.291
1.475HisGlu: 1.475 ± 0.13
2.065HisPhe: 2.065 ± 0.552
2.065HisGly: 2.065 ± 0.371
0.885HisHis: 0.885 ± 0.17
0.59HisIle: 0.59 ± 0.502
1.475HisLys: 1.475 ± 0.331
2.065HisLeu: 2.065 ± 0.371
0.295HisMet: 0.295 ± 0.251
1.475HisAsn: 1.475 ± 0.13
2.065HisPro: 2.065 ± 1.013
1.18HisGln: 1.18 ± 0.381
1.475HisArg: 1.475 ± 0.13
0.885HisSer: 0.885 ± 0.632
1.18HisThr: 1.18 ± 0.08
1.77HisVal: 1.77 ± 0.341
0.0HisTrp: 0.0 ± 0.0
1.18HisTyr: 1.18 ± 0.542
0.0HisXaa: 0.0 ± 0.0
Ile
3.54IleAla: 3.54 ± 0.22
0.59IleCys: 0.59 ± 0.04
4.425IleAsp: 4.425 ± 1.455
2.655IleGlu: 2.655 ± 0.411
2.36IlePhe: 2.36 ± 0.762
4.425IleGly: 4.425 ± 0.532
1.475IleHis: 1.475 ± 0.13
2.655IleIle: 2.655 ± 0.411
1.18IleLys: 1.18 ± 0.542
3.835IleLeu: 3.835 ± 0.492
0.0IleMet: 0.0 ± 0.0
1.77IleAsn: 1.77 ± 0.341
4.13IlePro: 4.13 ± 0.18
2.065IleGln: 2.065 ± 0.09
2.36IleArg: 2.36 ± 1.545
2.36IleSer: 2.36 ± 0.622
2.065IleThr: 2.065 ± 0.371
4.13IleVal: 4.13 ± 1.103
0.885IleTrp: 0.885 ± 0.291
1.475IleTyr: 1.475 ± 0.13
0.0IleXaa: 0.0 ± 0.0
Lys
2.655LysAla: 2.655 ± 0.05
0.0LysCys: 0.0 ± 0.0
1.475LysAsp: 1.475 ± 0.13
2.065LysGlu: 2.065 ± 0.552
1.77LysPhe: 1.77 ± 0.12
2.36LysGly: 2.36 ± 0.622
0.885LysHis: 0.885 ± 0.291
0.885LysIle: 0.885 ± 0.291
2.95LysLys: 2.95 ± 0.201
2.36LysLeu: 2.36 ± 1.083
0.295LysMet: 0.295 ± 0.211
0.295LysAsn: 0.295 ± 0.251
1.475LysPro: 1.475 ± 0.13
1.18LysGln: 1.18 ± 0.542
4.425LysArg: 4.425 ± 0.993
1.77LysSer: 1.77 ± 0.12
2.36LysThr: 2.36 ± 1.545
4.72LysVal: 4.72 ± 0.321
0.885LysTrp: 0.885 ± 0.17
1.77LysTyr: 1.77 ± 0.582
0.0LysXaa: 0.0 ± 0.0
Leu
10.324LeuAla: 10.324 ± 1.856
2.36LeuCys: 2.36 ± 0.161
6.195LeuAsp: 6.195 ± 0.652
3.835LeuGlu: 3.835 ± 0.953
3.835LeuPhe: 3.835 ± 0.492
5.9LeuGly: 5.9 ± 1.786
3.245LeuHis: 3.245 ± 0.471
4.13LeuIle: 4.13 ± 0.281
2.95LeuLys: 2.95 ± 0.201
6.49LeuLeu: 6.49 ± 2.287
0.59LeuMet: 0.59 ± 0.04
4.425LeuAsn: 4.425 ± 0.391
7.375LeuPro: 7.375 ± 1.574
4.13LeuGln: 4.13 ± 0.742
5.605LeuArg: 5.605 ± 0.311
6.785LeuSer: 6.785 ± 0.692
5.015LeuThr: 5.015 ± 0.111
4.425LeuVal: 4.425 ± 0.07
1.77LeuTrp: 1.77 ± 0.12
2.655LeuTyr: 2.655 ± 1.334
0.0LeuXaa: 0.0 ± 0.0
Met
1.77MetAla: 1.77 ± 0.802
0.0MetCys: 0.0 ± 0.0
0.59MetAsp: 0.59 ± 0.04
0.295MetGlu: 0.295 ± 0.251
1.18MetPhe: 1.18 ± 0.08
0.295MetGly: 0.295 ± 0.211
0.59MetHis: 0.59 ± 0.421
0.59MetIle: 0.59 ± 0.502
0.59MetLys: 0.59 ± 0.04
1.77MetLeu: 1.77 ± 0.582
0.295MetMet: 0.295 ± 0.211
1.18MetAsn: 1.18 ± 0.381
1.18MetPro: 1.18 ± 0.381
1.18MetGln: 1.18 ± 0.381
0.885MetArg: 0.885 ± 0.17
1.475MetSer: 1.475 ± 0.592
0.59MetThr: 0.59 ± 0.421
1.77MetVal: 1.77 ± 0.802
0.59MetTrp: 0.59 ± 0.421
0.59MetTyr: 0.59 ± 0.502
0.0MetXaa: 0.0 ± 0.0
Asn
3.54AsnAla: 3.54 ± 0.241
0.295AsnCys: 0.295 ± 0.211
1.18AsnAsp: 1.18 ± 0.08
1.18AsnGlu: 1.18 ± 0.542
1.475AsnPhe: 1.475 ± 0.331
6.195AsnGly: 6.195 ± 1.655
0.885AsnHis: 0.885 ± 0.632
1.18AsnIle: 1.18 ± 0.542
1.475AsnLys: 1.475 ± 0.792
2.95AsnLeu: 2.95 ± 0.201
1.18AsnMet: 1.18 ± 0.381
3.245AsnAsn: 3.245 ± 1.855
3.54AsnPro: 3.54 ± 0.682
1.475AsnGln: 1.475 ± 0.13
1.18AsnArg: 1.18 ± 0.542
1.475AsnSer: 1.475 ± 0.13
0.885AsnThr: 0.885 ± 0.17
1.18AsnVal: 1.18 ± 0.842
0.295AsnTrp: 0.295 ± 0.211
2.655AsnTyr: 2.655 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
7.375ProAla: 7.375 ± 2.036
0.59ProCys: 0.59 ± 0.04
4.13ProAsp: 4.13 ± 0.642
3.245ProGlu: 3.245 ± 0.01
1.18ProPhe: 1.18 ± 0.542
5.9ProGly: 5.9 ± 0.983
1.475ProHis: 1.475 ± 0.331
4.425ProIle: 4.425 ± 0.852
1.77ProLys: 1.77 ± 0.582
5.9ProLeu: 5.9 ± 2.367
0.885ProMet: 0.885 ± 0.17
2.655ProAsn: 2.655 ± 1.434
3.835ProPro: 3.835 ± 1.354
2.655ProGln: 2.655 ± 0.973
4.425ProArg: 4.425 ± 0.391
4.72ProSer: 4.72 ± 0.14
3.245ProThr: 3.245 ± 0.933
3.835ProVal: 3.835 ± 0.431
0.295ProTrp: 0.295 ± 0.211
0.885ProTyr: 0.885 ± 0.632
0.0ProXaa: 0.0 ± 0.0
Gln
4.13GlnAla: 4.13 ± 0.642
0.295GlnCys: 0.295 ± 0.211
3.54GlnAsp: 3.54 ± 0.241
1.77GlnGlu: 1.77 ± 0.12
0.885GlnPhe: 0.885 ± 0.17
3.54GlnGly: 3.54 ± 1.625
1.18GlnHis: 1.18 ± 0.08
2.36GlnIle: 2.36 ± 0.622
0.885GlnLys: 0.885 ± 0.291
2.065GlnLeu: 2.065 ± 0.552
0.885GlnMet: 0.885 ± 0.17
0.885GlnAsn: 0.885 ± 0.17
0.885GlnPro: 0.885 ± 0.291
1.18GlnGln: 1.18 ± 0.381
1.77GlnArg: 1.77 ± 0.582
1.18GlnSer: 1.18 ± 0.381
1.18GlnThr: 1.18 ± 0.542
1.77GlnVal: 1.77 ± 0.12
0.885GlnTrp: 0.885 ± 0.291
1.77GlnTyr: 1.77 ± 0.12
0.0GlnXaa: 0.0 ± 0.0
Arg
6.49ArgAla: 6.49 ± 0.903
1.18ArgCys: 1.18 ± 0.542
4.13ArgAsp: 4.13 ± 0.742
2.655ArgGlu: 2.655 ± 1.334
1.77ArgPhe: 1.77 ± 0.582
5.605ArgGly: 5.605 ± 0.151
1.77ArgHis: 1.77 ± 1.043
3.835ArgIle: 3.835 ± 0.492
2.95ArgLys: 2.95 ± 0.662
6.49ArgLeu: 6.49 ± 1.364
1.77ArgMet: 1.77 ± 0.802
2.95ArgAsn: 2.95 ± 0.662
3.245ArgPro: 3.245 ± 0.933
2.36ArgGln: 2.36 ± 0.161
6.785ArgArg: 6.785 ± 2.077
3.54ArgSer: 3.54 ± 1.164
4.72ArgThr: 4.72 ± 0.14
4.72ArgVal: 4.72 ± 0.14
0.885ArgTrp: 0.885 ± 0.17
2.95ArgTyr: 2.95 ± 0.201
0.0ArgXaa: 0.0 ± 0.0
Ser
5.605SerAla: 5.605 ± 1.695
1.18SerCys: 1.18 ± 0.842
3.835SerAsp: 3.835 ± 0.492
3.54SerGlu: 3.54 ± 1.164
3.835SerPhe: 3.835 ± 2.277
4.13SerGly: 4.13 ± 0.18
0.59SerHis: 0.59 ± 0.04
2.95SerIle: 2.95 ± 1.585
2.655SerLys: 2.655 ± 0.411
5.015SerLeu: 5.015 ± 0.572
1.18SerMet: 1.18 ± 0.08
1.77SerAsn: 1.77 ± 0.341
3.54SerPro: 3.54 ± 2.066
2.95SerGln: 2.95 ± 0.201
3.245SerArg: 3.245 ± 0.452
3.835SerSer: 3.835 ± 0.03
3.54SerThr: 3.54 ± 0.682
5.015SerVal: 5.015 ± 0.572
1.475SerTrp: 1.475 ± 0.13
1.18SerTyr: 1.18 ± 0.381
0.0SerXaa: 0.0 ± 0.0
Thr
5.31ThrAla: 5.31 ± 0.361
0.59ThrCys: 0.59 ± 0.04
2.655ThrAsp: 2.655 ± 1.796
2.655ThrGlu: 2.655 ± 0.973
1.77ThrPhe: 1.77 ± 0.582
6.49ThrGly: 6.49 ± 0.903
1.18ThrHis: 1.18 ± 0.381
3.835ThrIle: 3.835 ± 1.815
1.475ThrLys: 1.475 ± 0.792
5.015ThrLeu: 5.015 ± 0.572
1.475ThrMet: 1.475 ± 0.245
2.655ThrAsn: 2.655 ± 0.05
4.13ThrPro: 4.13 ± 0.18
1.77ThrGln: 1.77 ± 0.12
4.425ThrArg: 4.425 ± 0.391
2.36ThrSer: 2.36 ± 0.301
5.015ThrThr: 5.015 ± 0.572
3.54ThrVal: 3.54 ± 0.682
1.18ThrTrp: 1.18 ± 0.542
1.18ThrTyr: 1.18 ± 0.542
0.0ThrXaa: 0.0 ± 0.0
Val
4.72ValAla: 4.72 ± 0.602
0.59ValCys: 0.59 ± 0.04
4.425ValAsp: 4.425 ± 0.391
4.13ValGlu: 4.13 ± 1.103
3.245ValPhe: 3.245 ± 1.394
5.015ValGly: 5.015 ± 0.111
1.18ValHis: 1.18 ± 0.542
2.655ValIle: 2.655 ± 0.873
2.36ValLys: 2.36 ± 0.301
8.26ValLeu: 8.26 ± 1.283
0.59ValMet: 0.59 ± 0.386
2.36ValAsn: 2.36 ± 0.161
4.72ValPro: 4.72 ± 0.14
1.77ValGln: 1.77 ± 0.12
4.72ValArg: 4.72 ± 0.602
4.72ValSer: 4.72 ± 0.321
4.13ValThr: 4.13 ± 0.642
2.95ValVal: 2.95 ± 0.261
1.18ValTrp: 1.18 ± 0.381
2.065ValTyr: 2.065 ± 0.09
0.0ValXaa: 0.0 ± 0.0
Trp
1.18TrpAla: 1.18 ± 0.381
0.59TrpCys: 0.59 ± 0.04
0.59TrpAsp: 0.59 ± 0.502
0.885TrpGlu: 0.885 ± 0.17
0.0TrpPhe: 0.0 ± 0.0
0.295TrpGly: 0.295 ± 0.251
0.295TrpHis: 0.295 ± 0.211
0.59TrpIle: 0.59 ± 0.04
0.0TrpLys: 0.0 ± 0.0
2.655TrpLeu: 2.655 ± 0.05
0.0TrpMet: 0.0 ± 0.0
0.295TrpAsn: 0.295 ± 0.251
0.885TrpPro: 0.885 ± 0.291
1.18TrpGln: 1.18 ± 0.542
2.36TrpArg: 2.36 ± 0.161
1.475TrpSer: 1.475 ± 0.592
0.295TrpThr: 0.295 ± 0.251
1.77TrpVal: 1.77 ± 0.341
0.295TrpTrp: 0.295 ± 0.251
1.475TrpTyr: 1.475 ± 0.592
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.425TyrAla: 4.425 ± 0.532
0.295TyrCys: 0.295 ± 0.211
2.95TyrAsp: 2.95 ± 0.261
2.065TyrGlu: 2.065 ± 0.09
0.59TyrPhe: 0.59 ± 0.421
1.77TyrGly: 1.77 ± 0.12
2.065TyrHis: 2.065 ± 0.552
1.18TyrIle: 1.18 ± 0.381
0.59TyrLys: 0.59 ± 0.04
3.54TyrLeu: 3.54 ± 0.702
0.885TyrMet: 0.885 ± 0.17
1.475TyrAsn: 1.475 ± 0.792
2.36TyrPro: 2.36 ± 0.762
0.59TyrGln: 0.59 ± 0.502
3.245TyrArg: 3.245 ± 0.01
2.36TyrSer: 2.36 ± 0.161
1.77TyrThr: 1.77 ± 1.043
2.36TyrVal: 2.36 ± 1.545
0.295TyrTrp: 0.295 ± 0.251
2.36TyrTyr: 2.36 ± 0.161
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3391 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski