Amino acid dipepetide frequency for Planaria asexual strain-specific virus-like element type 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.107AlaAla: 1.107 ± 0.618
1.107AlaCys: 1.107 ± 0.676
2.215AlaAsp: 2.215 ± 0.668
2.769AlaGlu: 2.769 ± 0.688
3.322AlaPhe: 3.322 ± 0.978
4.983AlaGly: 4.983 ± 1.743
2.769AlaHis: 2.769 ± 0.532
3.876AlaIle: 3.876 ± 1.014
3.322AlaLys: 3.322 ± 1.531
2.769AlaLeu: 2.769 ± 1.104
0.554AlaMet: 0.554 ± 0.645
2.215AlaAsn: 2.215 ± 0.627
2.769AlaPro: 2.769 ± 1.214
0.554AlaGln: 0.554 ± 0.568
1.107AlaArg: 1.107 ± 0.571
2.769AlaSer: 2.769 ± 0.953
1.661AlaThr: 1.661 ± 0.609
1.661AlaVal: 1.661 ± 0.735
0.554AlaTrp: 0.554 ± 0.499
1.107AlaTyr: 1.107 ± 0.779
0.0AlaXaa: 0.0 ± 0.0
Cys
0.554CysAla: 0.554 ± 0.645
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.554CysGlu: 0.554 ± 0.499
0.0CysPhe: 0.0 ± 0.0
0.554CysGly: 0.554 ± 0.645
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.661CysLys: 1.661 ± 0.867
1.661CysLeu: 1.661 ± 1.663
0.554CysMet: 0.554 ± 0.567
0.554CysAsn: 0.554 ± 0.645
1.107CysPro: 1.107 ± 0.522
0.554CysGln: 0.554 ± 0.554
2.215CysArg: 2.215 ± 0.767
0.0CysSer: 0.0 ± 0.0
1.107CysThr: 1.107 ± 1.135
1.661CysVal: 1.661 ± 0.586
0.0CysTrp: 0.0 ± 0.0
0.554CysTyr: 0.554 ± 0.499
0.0CysXaa: 0.0 ± 0.0
Asp
1.107AspAla: 1.107 ± 0.552
0.554AspCys: 0.554 ± 0.568
1.661AspAsp: 1.661 ± 0.7
3.322AspGlu: 3.322 ± 0.737
1.661AspPhe: 1.661 ± 1.044
3.322AspGly: 3.322 ± 1.403
0.554AspHis: 0.554 ± 0.645
3.322AspIle: 3.322 ± 1.027
4.43AspLys: 4.43 ± 1.703
3.322AspLeu: 3.322 ± 2.338
1.661AspMet: 1.661 ± 0.597
3.322AspAsn: 3.322 ± 1.412
1.661AspPro: 1.661 ± 0.7
2.215AspGln: 2.215 ± 1.029
0.554AspArg: 0.554 ± 0.39
1.661AspSer: 1.661 ± 1.343
2.769AspThr: 2.769 ± 0.884
4.43AspVal: 4.43 ± 1.924
0.554AspTrp: 0.554 ± 0.583
2.769AspTyr: 2.769 ± 1.374
0.0AspXaa: 0.0 ± 0.0
Glu
2.215GluAla: 2.215 ± 1.115
2.215GluCys: 2.215 ± 1.008
3.876GluAsp: 3.876 ± 0.666
4.43GluGlu: 4.43 ± 2.562
3.876GluPhe: 3.876 ± 2.166
3.322GluGly: 3.322 ± 1.347
0.554GluHis: 0.554 ± 0.499
3.322GluIle: 3.322 ± 1.194
5.537GluLys: 5.537 ± 1.352
2.769GluLeu: 2.769 ± 0.532
1.661GluMet: 1.661 ± 0.7
3.322GluAsn: 3.322 ± 1.33
0.554GluPro: 0.554 ± 0.583
2.769GluGln: 2.769 ± 0.933
4.983GluArg: 4.983 ± 2.184
2.769GluSer: 2.769 ± 0.928
2.769GluThr: 2.769 ± 1.053
2.215GluVal: 2.215 ± 0.681
1.107GluTrp: 1.107 ± 0.571
1.661GluTyr: 1.661 ± 0.916
0.0GluXaa: 0.0 ± 0.0
Phe
1.661PheAla: 1.661 ± 1.144
0.554PheCys: 0.554 ± 0.645
1.661PheAsp: 1.661 ± 0.984
3.876PheGlu: 3.876 ± 1.742
2.215PhePhe: 2.215 ± 1.237
2.769PheGly: 2.769 ± 1.053
0.0PheHis: 0.0 ± 0.0
3.322PheIle: 3.322 ± 0.543
5.537PheLys: 5.537 ± 1.226
1.661PheLeu: 1.661 ± 0.766
2.215PheMet: 2.215 ± 0.652
2.215PheAsn: 2.215 ± 1.206
3.322PhePro: 3.322 ± 0.358
3.322PheGln: 3.322 ± 1.67
2.769PheArg: 2.769 ± 1.194
1.661PheSer: 1.661 ± 1.343
2.215PheThr: 2.215 ± 1.388
5.537PheVal: 5.537 ± 2.967
1.107PheTrp: 1.107 ± 0.82
0.554PheTyr: 0.554 ± 0.499
0.0PheXaa: 0.0 ± 0.0
Gly
2.769GlyAla: 2.769 ± 0.985
0.0GlyCys: 0.0 ± 0.0
1.107GlyAsp: 1.107 ± 0.552
1.661GlyGlu: 1.661 ± 1.07
2.769GlyPhe: 2.769 ± 2.224
3.876GlyGly: 3.876 ± 1.531
0.554GlyHis: 0.554 ± 0.39
3.876GlyIle: 3.876 ± 2.214
5.537GlyLys: 5.537 ± 2.159
4.983GlyLeu: 4.983 ± 0.545
1.107GlyMet: 1.107 ± 0.676
0.554GlyAsn: 0.554 ± 0.645
3.322GlyPro: 3.322 ± 1.789
3.322GlyGln: 3.322 ± 1.673
2.769GlyArg: 2.769 ± 1.023
3.322GlySer: 3.322 ± 1.722
4.983GlyThr: 4.983 ± 1.982
7.198GlyVal: 7.198 ± 1.108
1.661GlyTrp: 1.661 ± 0.586
0.554GlyTyr: 0.554 ± 0.554
0.0GlyXaa: 0.0 ± 0.0
His
1.107HisAla: 1.107 ± 0.715
0.0HisCys: 0.0 ± 0.0
0.554HisAsp: 0.554 ± 0.568
0.0HisGlu: 0.0 ± 0.0
2.215HisPhe: 2.215 ± 0.681
1.661HisGly: 1.661 ± 0.609
1.107HisHis: 1.107 ± 0.558
2.215HisIle: 2.215 ± 1.115
1.661HisLys: 1.661 ± 0.609
2.215HisLeu: 2.215 ± 0.861
1.661HisMet: 1.661 ± 1.063
2.769HisAsn: 2.769 ± 1.009
0.0HisPro: 0.0 ± 0.0
2.215HisGln: 2.215 ± 1.242
1.107HisArg: 1.107 ± 0.618
2.769HisSer: 2.769 ± 1.895
0.554HisThr: 0.554 ± 0.39
1.661HisVal: 1.661 ± 0.858
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.215IleAla: 2.215 ± 1.068
2.215IleCys: 2.215 ± 0.988
0.554IleAsp: 0.554 ± 0.39
3.876IleGlu: 3.876 ± 0.637
1.661IlePhe: 1.661 ± 0.621
2.769IleGly: 2.769 ± 1.053
1.661IleHis: 1.661 ± 1.169
5.537IleIle: 5.537 ± 2.117
4.983IleLys: 4.983 ± 0.94
9.413IleLeu: 9.413 ± 2.635
1.661IleMet: 1.661 ± 0.75
4.43IleAsn: 4.43 ± 0.646
6.091IlePro: 6.091 ± 1.978
5.537IleGln: 5.537 ± 0.779
3.876IleArg: 3.876 ± 1.032
4.983IleSer: 4.983 ± 1.564
2.769IleThr: 2.769 ± 0.551
1.661IleVal: 1.661 ± 0.766
0.0IleTrp: 0.0 ± 0.0
2.215IleTyr: 2.215 ± 0.953
0.0IleXaa: 0.0 ± 0.0
Lys
3.322LysAla: 3.322 ± 0.827
0.0LysCys: 0.0 ± 0.0
4.983LysAsp: 4.983 ± 1.676
2.769LysGlu: 2.769 ± 1.117
4.43LysPhe: 4.43 ± 1.907
2.215LysGly: 2.215 ± 1.16
2.769LysHis: 2.769 ± 1.05
6.645LysIle: 6.645 ± 1.936
3.876LysLys: 3.876 ± 1.192
4.983LysLeu: 4.983 ± 1.457
2.215LysMet: 2.215 ± 0.668
5.537LysAsn: 5.537 ± 1.81
4.43LysPro: 4.43 ± 1.989
3.322LysGln: 3.322 ± 0.887
4.43LysArg: 4.43 ± 1.828
5.537LysSer: 5.537 ± 0.977
6.091LysThr: 6.091 ± 1.554
1.661LysVal: 1.661 ± 0.638
1.107LysTrp: 1.107 ± 0.676
2.769LysTyr: 2.769 ± 0.688
0.0LysXaa: 0.0 ± 0.0
Leu
2.215LeuAla: 2.215 ± 0.681
0.554LeuCys: 0.554 ± 0.645
5.537LeuAsp: 5.537 ± 2.521
3.876LeuGlu: 3.876 ± 2.148
3.322LeuPhe: 3.322 ± 1.864
4.983LeuGly: 4.983 ± 1.535
2.215LeuHis: 2.215 ± 1.352
6.645LeuIle: 6.645 ± 2.722
3.322LeuLys: 3.322 ± 1.027
6.091LeuLeu: 6.091 ± 2.428
3.322LeuMet: 3.322 ± 1.691
4.983LeuAsn: 4.983 ± 0.989
3.876LeuPro: 3.876 ± 1.647
3.876LeuGln: 3.876 ± 2.61
6.645LeuArg: 6.645 ± 1.508
6.645LeuSer: 6.645 ± 1.508
4.43LeuThr: 4.43 ± 1.367
2.215LeuVal: 2.215 ± 0.817
0.0LeuTrp: 0.0 ± 0.0
4.43LeuTyr: 4.43 ± 0.851
0.0LeuXaa: 0.0 ± 0.0
Met
1.107MetAla: 1.107 ± 0.618
0.0MetCys: 0.0 ± 0.0
2.215MetAsp: 2.215 ± 1.528
2.769MetGlu: 2.769 ± 1.2
0.554MetPhe: 0.554 ± 0.499
1.661MetGly: 1.661 ± 0.621
1.661MetHis: 1.661 ± 0.603
0.0MetIle: 0.0 ± 0.0
2.215MetLys: 2.215 ± 0.731
3.876MetLeu: 3.876 ± 2.943
0.554MetMet: 0.554 ± 0.499
2.215MetAsn: 2.215 ± 1.236
1.661MetPro: 1.661 ± 0.916
1.661MetGln: 1.661 ± 0.708
1.661MetArg: 1.661 ± 1.07
1.661MetSer: 1.661 ± 0.609
2.769MetThr: 2.769 ± 1.252
3.322MetVal: 3.322 ± 1.363
0.0MetTrp: 0.0 ± 0.0
0.554MetTyr: 0.554 ± 0.568
0.0MetXaa: 0.0 ± 0.0
Asn
2.215AsnAla: 2.215 ± 1.528
2.215AsnCys: 2.215 ± 1.905
2.769AsnAsp: 2.769 ± 1.603
3.876AsnGlu: 3.876 ± 1.192
4.983AsnPhe: 4.983 ± 0.833
2.215AsnGly: 2.215 ± 0.767
0.0AsnHis: 0.0 ± 0.0
5.537AsnIle: 5.537 ± 2.455
5.537AsnLys: 5.537 ± 1.354
8.306AsnLeu: 8.306 ± 1.054
1.661AsnMet: 1.661 ± 0.698
4.43AsnAsn: 4.43 ± 1.715
3.876AsnPro: 3.876 ± 2.214
1.661AsnGln: 1.661 ± 0.785
1.661AsnArg: 1.661 ± 0.757
3.876AsnSer: 3.876 ± 1.091
2.215AsnThr: 2.215 ± 1.16
4.43AsnVal: 4.43 ± 2.456
0.0AsnTrp: 0.0 ± 0.0
1.661AsnTyr: 1.661 ± 0.808
0.0AsnXaa: 0.0 ± 0.0
Pro
3.322ProAla: 3.322 ± 1.271
0.0ProCys: 0.0 ± 0.0
0.554ProAsp: 0.554 ± 0.645
2.215ProGlu: 2.215 ± 1.029
1.661ProPhe: 1.661 ± 0.708
2.769ProGly: 2.769 ± 1.422
2.769ProHis: 2.769 ± 1.004
3.322ProIle: 3.322 ± 1.471
3.322ProLys: 3.322 ± 2.338
6.091ProLeu: 6.091 ± 1.672
1.107ProMet: 1.107 ± 0.558
3.876ProAsn: 3.876 ± 1.3
6.091ProPro: 6.091 ± 2.628
3.876ProGln: 3.876 ± 2.157
6.645ProArg: 6.645 ± 1.997
5.537ProSer: 5.537 ± 1.665
5.537ProThr: 5.537 ± 1.693
2.215ProVal: 2.215 ± 0.82
2.215ProTrp: 2.215 ± 1.322
2.215ProTyr: 2.215 ± 1.195
0.0ProXaa: 0.0 ± 0.0
Gln
4.43GlnAla: 4.43 ± 1.177
0.554GlnCys: 0.554 ± 0.554
3.322GlnAsp: 3.322 ± 0.666
3.322GlnGlu: 3.322 ± 0.892
1.107GlnPhe: 1.107 ± 0.558
0.554GlnGly: 0.554 ± 0.645
0.554GlnHis: 0.554 ± 0.39
4.983GlnIle: 4.983 ± 1.348
4.983GlnLys: 4.983 ± 1.495
2.769GlnLeu: 2.769 ± 1.646
1.107GlnMet: 1.107 ± 1.166
2.769GlnAsn: 2.769 ± 0.741
6.645GlnPro: 6.645 ± 1.503
1.107GlnGln: 1.107 ± 0.558
2.215GlnArg: 2.215 ± 1.242
0.0GlnSer: 0.0 ± 0.0
2.215GlnThr: 2.215 ± 1.16
2.215GlnVal: 2.215 ± 1.16
0.0GlnTrp: 0.0 ± 0.0
3.876GlnTyr: 3.876 ± 1.689
0.0GlnXaa: 0.0 ± 0.0
Arg
3.876ArgAla: 3.876 ± 1.468
0.554ArgCys: 0.554 ± 0.39
2.215ArgAsp: 2.215 ± 1.413
4.983ArgGlu: 4.983 ± 1.183
2.769ArgPhe: 2.769 ± 1.334
1.661ArgGly: 1.661 ± 0.823
2.215ArgHis: 2.215 ± 1.19
3.322ArgIle: 3.322 ± 0.943
1.661ArgLys: 1.661 ± 0.796
4.983ArgLeu: 4.983 ± 1.538
3.322ArgMet: 3.322 ± 1.714
4.43ArgAsn: 4.43 ± 1.989
3.876ArgPro: 3.876 ± 0.863
1.661ArgGln: 1.661 ± 0.757
4.43ArgArg: 4.43 ± 1.297
3.322ArgSer: 3.322 ± 1.127
2.215ArgThr: 2.215 ± 1.055
2.215ArgVal: 2.215 ± 1.142
0.554ArgTrp: 0.554 ± 0.568
3.876ArgTyr: 3.876 ± 1.492
0.0ArgXaa: 0.0 ± 0.0
Ser
2.769SerAla: 2.769 ± 1.208
1.107SerCys: 1.107 ± 0.558
1.661SerAsp: 1.661 ± 0.839
1.107SerGlu: 1.107 ± 0.571
2.215SerPhe: 2.215 ± 0.731
4.43SerGly: 4.43 ± 1.674
2.215SerHis: 2.215 ± 1.239
2.769SerIle: 2.769 ± 1.088
2.769SerLys: 2.769 ± 1.523
4.43SerLeu: 4.43 ± 1.825
3.322SerMet: 3.322 ± 2.013
3.876SerAsn: 3.876 ± 0.527
3.876SerPro: 3.876 ± 1.561
2.215SerGln: 2.215 ± 0.607
1.107SerArg: 1.107 ± 0.618
4.983SerSer: 4.983 ± 1.197
4.43SerThr: 4.43 ± 0.715
4.43SerVal: 4.43 ± 1.481
0.554SerTrp: 0.554 ± 0.568
3.876SerTyr: 3.876 ± 1.216
0.0SerXaa: 0.0 ± 0.0
Thr
2.769ThrAla: 2.769 ± 1.103
0.554ThrCys: 0.554 ± 0.568
4.983ThrAsp: 4.983 ± 0.833
2.769ThrGlu: 2.769 ± 1.02
3.322ThrPhe: 3.322 ± 1.903
3.322ThrGly: 3.322 ± 1.396
1.107ThrHis: 1.107 ± 0.694
2.769ThrIle: 2.769 ± 1.02
6.091ThrLys: 6.091 ± 2.06
1.661ThrLeu: 1.661 ± 0.785
1.661ThrMet: 1.661 ± 0.858
3.876ThrAsn: 3.876 ± 1.076
7.198ThrPro: 7.198 ± 3.409
3.876ThrGln: 3.876 ± 1.152
2.215ThrArg: 2.215 ± 0.956
2.769ThrSer: 2.769 ± 1.411
3.876ThrThr: 3.876 ± 0.666
3.876ThrVal: 3.876 ± 1.177
0.554ThrTrp: 0.554 ± 0.568
1.107ThrTyr: 1.107 ± 0.552
0.0ThrXaa: 0.0 ± 0.0
Val
1.107ValAla: 1.107 ± 0.571
0.554ValCys: 0.554 ± 0.499
2.769ValAsp: 2.769 ± 1.26
4.983ValGlu: 4.983 ± 1.502
3.322ValPhe: 3.322 ± 1.707
6.091ValGly: 6.091 ± 2.058
2.215ValHis: 2.215 ± 0.861
3.876ValIle: 3.876 ± 2.145
3.322ValLys: 3.322 ± 1.181
4.43ValLeu: 4.43 ± 1.244
1.661ValMet: 1.661 ± 0.708
3.876ValAsn: 3.876 ± 1.481
2.769ValPro: 2.769 ± 1.234
3.876ValGln: 3.876 ± 1.008
2.215ValArg: 2.215 ± 0.731
2.769ValSer: 2.769 ± 1.107
4.43ValThr: 4.43 ± 1.434
2.769ValVal: 2.769 ± 1.069
0.554ValTrp: 0.554 ± 0.499
1.661ValTyr: 1.661 ± 1.153
0.0ValXaa: 0.0 ± 0.0
Trp
1.107TrpAla: 1.107 ± 0.676
0.0TrpCys: 0.0 ± 0.0
0.554TrpAsp: 0.554 ± 0.499
0.0TrpGlu: 0.0 ± 0.0
0.554TrpPhe: 0.554 ± 0.568
1.107TrpGly: 1.107 ± 0.558
0.0TrpHis: 0.0 ± 0.0
0.554TrpIle: 0.554 ± 0.583
1.661TrpLys: 1.661 ± 1.039
0.554TrpLeu: 0.554 ± 0.645
0.0TrpMet: 0.0 ± 0.0
1.107TrpAsn: 1.107 ± 0.715
1.107TrpPro: 1.107 ± 1.135
0.0TrpGln: 0.0 ± 0.0
0.554TrpArg: 0.554 ± 0.568
0.554TrpSer: 0.554 ± 0.39
0.0TrpThr: 0.0 ± 0.0
1.107TrpVal: 1.107 ± 1.135
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.215TyrAla: 2.215 ± 0.693
0.554TyrCys: 0.554 ± 0.568
1.661TyrAsp: 1.661 ± 1.241
2.769TyrGlu: 2.769 ± 1.43
2.215TyrPhe: 2.215 ± 0.817
1.661TyrGly: 1.661 ± 1.179
0.554TyrHis: 0.554 ± 0.583
2.215TyrIle: 2.215 ± 0.612
1.661TyrLys: 1.661 ± 1.039
2.215TyrLeu: 2.215 ± 1.831
0.554TyrMet: 0.554 ± 0.696
2.769TyrAsn: 2.769 ± 0.622
1.107TyrPro: 1.107 ± 0.552
1.107TyrGln: 1.107 ± 0.916
4.983TyrArg: 4.983 ± 1.434
0.554TyrSer: 0.554 ± 0.39
3.322TyrThr: 3.322 ± 1.194
2.769TyrVal: 2.769 ± 0.722
0.0TyrTrp: 0.0 ± 0.0
2.215TyrTyr: 2.215 ± 0.984
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1807 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski