-
Notifications
You must be signed in to change notification settings - Fork 0
/
rannismain2021.tex
496 lines (429 loc) · 20.3 KB
/
rannismain2021.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
% -*- coding: utf-8 -*-
\documentclass{rannis}
\usepackage{xargs}
\usepackage{ifthen}
\input{patternmacros}
\usepackage{todonotes}
\usepackage{amsmath}
\usepackage{amsthm}
\usepackage{subcaption}
\newcommand{\motheralg}{\textsf{Combex}}
\newcommand{\explofmothername}{\underline{Comb}inatorial \underline{Ex}ploration}
\newcommand{\tilealg}{\textsf{PermScope}}
\newcommand{\partialg}{\textsf{PartiScope}}
\newcommand{\bisc}{\textsf{BiSC}}
\newcommand{\struct}{\textsf{Struct}}
\newcommand{\yearofappl}{2021}
\newcommand{\yearofapplp}{2022}
\newcommand{\yearofapplpp}{2023}
\newcommand{\yearofappls}{21}
\newcommand{\yearofapplsp}{22}
\newcommand{\yearofapplspp}{23}
\newcommand{\disjoint}{\sqcup}
\newcommand{\eqg}{\cong}
% mathcal letters
\newcommand{\mc}[1]{\mathcal{#1}}
\newcommand{\mb}[1]{\mathbb{#1}}
\newcommand{\cA}{{\mathcal{A}}}
\newcommand{\cB}{{\mathcal{B}}}
\newcommand{\cC}{{\mathcal{C}}}
\newcommand{\cD}{{\mathcal{D}}}
\newcommand{\cE}{{\mathcal{E}}}
\newcommand{\cI}{{\mathcal{I}}}
\newcommand{\cS}{{\mathcal{S}}}
\newcommand{\perms}{\mathcal{S}}
\newcommand{\gclass}{\mathcal{G}}
\newcommand{\Av}{\mathrm{Av}}
\newcommand{\Co}{\mathrm{Co}}
% For tikz pictures
\usepackage{tikz}
\usetikzlibrary{matrix,arrows}
\usetikzlibrary{positioning}
\usetikzlibrary{fit}
\usetikzlibrary{patterns}
\newcommand{\imopattern}[6]{ % mesh pattern with white and black dots overlapping
\raisebox{0.6ex}{
\begin{tikzpicture}[scale=0.35, baseline=(current bounding box.center), #1]
\foreach \x/\y in {#6}
\fill[pattern=north east lines] (\x,\y) rectangle +(1,1);
\draw (0.01,0.01) grid (#2+0.99,#2+0.99);
\foreach \x/\y in {#4}
\draw[fill=white] (\x,\y) circle (6pt);
\foreach \x/\y in {#5}
\draw[fill=white] (\x,\y) circle (14pt);
\foreach \x/\y in {#3}
\filldraw (\x,\y) circle (6pt);
\end{tikzpicture}
}
}
% Some environments
\newtheorem{theorem}{Theorem}
\newtheorem{conjecture}[theorem]{Conjecture}
\newtheorem*{conjecture*}{Conjecture}
\theoremstyle{definition}
\newtheorem{example}[theorem]{Example}
\newtheorem{definition}[theorem]{Definition}
\newenvironment{packed_item}{
\begin{itemize}
\setlength{\itemsep}{-1pt}
\setlength{\parskip}{-1pt}
\setlength{\parsep}{-1pt}
}{\end{itemize}}
\newenvironment{packed_enum}{
\begin{enumerate}
\setlength{\itemsep}{-1pt}
\setlength{\parskip}{-1pt}
\setlength{\parsep}{-1pt}
}{\end{enumerate}}
\usepackage[utf8]{inputenc}
% Language and input.
\usepackage[icelandic,british]{babel}
\usepackage{wrapfig}
\usepackage{graphicx}
\title{Extending the Combex framework}
\author{\'Emile Nadeau}
\begin{document}
\newcommand{\lastpageref}{\pageref{LastPageOfDescription}}
\thispagestyle{empty}
\maketitle
\newpage
\section*{Guidelines}\label{guidelines}
\noindent
Please note that \emph{all proposals and appendices} should be in
English.\\
\noindent
The text in angle brackets (<text>) in this document is provided for guidance
purposes only, and should be overwritten, by applicant, with the \mbox{requested information}.
Please insert appropriate material (i.e.\ text, pictures and/or tables)
and add the principal investigator's name to the header and the
project's name to the footer.\\
\noindent
The detailed project description should provide the following
information, and be divided into the following sections (the order and
titles should not be changed):\\
{\it
\begin{enumerate}
\def\labelenumi{\Alph{enumi}.}
\setlength\itemsep{-0.5em}
\item
\begin{quote}
Specific aims of the project, research questions/hypotheses,
feasibility, originality and impact
\end{quote}
\item
\begin{quote}
Present state of knowledge in the field
\end{quote}
\item
\begin{quote}
Research plan (time and work plan, present status of project,
methodology and milestones) and deliverables
\end{quote}
\item
\begin{quote}
Management and co-operation (domestic/foreign)
\end{quote}
\item
\begin{quote}
Proposed publication of results and data storage (including open
access policy)
\end{quote}
\item
\begin{quote}
Contribution of doctoral and master's degree students to the
project\\
\end{quote}
\end{enumerate}
}
\noindent
The project description should not exceed \textbf{5 pages} plus
front page and guidelines (1.5 line spacing, 12 point Times/Times New
Roman, or similar, 2.5 cm side margins). The \mbox{bibliography} is submitted
in a separate file with no length limits. Convert the file to pdf before submitting. Please note:
\begin{itemize}
\setlength\itemsep{-0.5em}
\item[$\rightarrow$]
\emph{\textbf{Incomplete proposals will be rejected}}
\item[$\rightarrow$]
\emph{\textbf{Proposals which exceed length limitations will be
rejected without review}}
\item[$\rightarrow$]
\emph{\textbf{Corrections or amendments after the previously announced
deadline will be rejected}}
\item[$\rightarrow$]
\emph{\textbf{The clarity and the overall quality of the presentation
are taken into consideration when proposals are reviewed}}
\item[$\rightarrow$]
\emph{\textbf{Applications that are not submitted using the most
recent application form available will be rejected}}\\
\end{itemize}
\noindent
For further information please refer to the \emph{\textbf{Icelandic
Research Fund Handbook 2021.}}\\
\emph{\textbf{Please do not delete, overwrite, or amend this Guideline
page in your submission\\
}}
\newpage
\spacing{1}
\section{Specific aims of the project, research questions/hypotheses, feasibility, originality and impact}
\spacing{1.5}
\emph{Combinatorial exploration} is an experimental approach that
rigorously derives structural results about mathematical objects.
When a human has discovered the structure of an object there
are several tools, often automatic, which allow various properties of the object
to be computed. However, the steps from the problem statement and the original
object to the structure is often ad-hoc.
A framework is being developed at Reykjavik University to automate and formalize
these steps.
A beta version of the framework, called
\motheralg{} (\explofmothername) exists. It has mainly been applied in
the field of permutation patterns and is called in that context the
\tilealg{} algorithm. This algorithm has been able
to discover new theorems and rediscover several results
spanning dozens of papers in the research literature.
We propose to integrate techniques from machine learning to enhance the beta,
as well as improving the algorithm to find more general descriptions
of mathematical objects.
The outcome of this proposal will be
publications in journals and presentations at international
conferences. The implementation of
these theoretical algorithms will be made available free and open source and
will be accompanied by a comprehensive manual.
\subsection*{The \motheralg{} framework}
\motheralg{} performs combinatorial
exploration in two main stages. First is the expansion stage, where \emph{strategies} are
applied to the objects of interest, resulting in a universe of rules
relating the objects. The second stage is to check if there exists
a \emph{specification} for the original object of interest
within this set of rules. The term \emph{specification} has a technical meaning,
but we can think of it as a system of equations describing the structure of an
object. If a specification exists in the universe \motheralg{} is guaranteed to
find it.
If no specifications are found, \motheralg{} re-enters the expansion stage,
enlarging the universe, until searching again, \emph{etc}.
We will show below how \motheralg{} has been applied to the field of permutation
patterns, but ask the reader to remember that the framework can be applied in
other fields.
As is to be expected with a beta version there is room for improvement. In
particular, even though the
framework is guaranteed to find a specification in the universe if
one exists, there are cases where the universe contains enough information to
describe the original object, without any specification existing for it.
There is no existing theory for this phenomenon but we say
that the universe contains a \emph{combinatorial system} for the object.
Even if some combinatorial systems have been found by hand~\cite{BeanPhd},
there is currently no way to find them automatically.
The proposer will develop heuristics and algorithms to find combinatorial systems.
\subsection*{The \tilealg{} algorithm}
A \emph{permutation} is a reordering of the integers $1, 2, \dotsc, n$
for some $n$. Although
these are simple objects, they have deep connections to other fields. Famous
examples include Schubert varieties~\cite{MR1051089}, posets~\cite{MR2652101},
water waves~\cite{MR2813307}, genome rearrangements~\cite{MR2518996}, planar maps~\cite{CKS09}
meanders~\cite{fukuda:042202}, and sorting operators~\cite{MR0445948}.
A permutation $\pi$ \emph{contains} the permutation $p$ if certain technical
conditions are satisfied. We only give an example here: The permutation $\pi = 526413$
contains $p = 132$, in the subsequences $264$, $263$ and $243$; see
Figure~\ref{fig:perm526413} where we have additionally circled the occurrences of $p$.
\begin{figure}[htbp]
\begin{center}
\imopattern{scale=0.7}{ 6 }{ 1/5, 2/2, 3/6, 4/4, 5/1, 6/3 }
{}{ 2/2, 3/6, 4/4 }{}\qquad
\imopattern{scale=0.7}{ 6 }{ 1/5, 2/2, 3/6, 4/4, 5/1, 6/3 }
{}{ 2/2, 3/6, 6/3 }{}\qquad
\imopattern{scale=0.7}{ 6 }{ 1/5, 2/2, 3/6, 4/4, 5/1, 6/3 }
{}{ 2/2, 4/4, 6/3 }{}
\caption{The permutation $526413$ and three occurrences of the $132$}%
\label{fig:perm526413}
\end{center}
\end{figure}
\noindent
The same permutation \emph{avoids} $123$, as it has no
increasing subsequence of length three. In this context we refer to $p$ as a
\emph{pattern}. The main object of study in this field is
the (usually infinite) set of permutations that avoid a given pattern, denoted $\Av(p)$,
and called a \emph{permutation class}.
Domain-specific knowledge from the field of permutation
patterns was added to \motheralg{} to create the
\tilealg{} algorithm.
When \tilealg{} succeeds it produces a specification in the form of a tree,
shown in Figure~\ref{fig:123_1432}, for the permutations avoiding the patterns $123$
and $1432$, first enumerated by West~\cite{West:1996cy}.
\begin{figure}[ht]
\centering
\begin{tikzpicture}[scale = 0.7, vertex/.style = {rectangle,draw = black!50,fill = gray!5,thick}, auto]
\node [vertex] (root) at (0,0) {$\cB$};
\node [vertex] (empty) at (2,-2) {$\cE$};
\node [vertex] (nempty) at (2,0) {$\metapatt{0.4}{1/1}{1/v,1/h}{}{1/1/p}[0/1,1/1][0/0/1/1/$\cD$,1/0/2/1/$\cB$]$};
\node [vertex] (nemptyl) at (5,0) {$\metapatt{0.4}{1/1}{1/v,1/h}{}{1/1/p}[0/1,1/1,0/0][1/0/2/1/$\cB$]$};
\node [vertex] (nemptyr) at (5,-2.5) {$\metapatt{0.4}{2/2}{1/v,2/v,1/h,2/h}{}{1/1/p,2/2/p}[0/2,1/2,2/2,0/0,1/0,1/1][0/1/1/2/$\cD$,2/1/3/2/$\cD$,2/0/3/1/$\cB$]$};
\node [vertex] (nemptyrl) at (9,-3) {$\metapatt{0.4}{2/2}{1/v,2/v,1/h,2/h}{}{1/1/p,2/2/p}[0/2,1/2,2/2,0/0,1/0,1/1,2/1][0/1/1/2/$\cD$,2/0/3/1/$\cB$]$};
\node [vertex] (nemptyrr) at (9,0) {$\metapatt{0.4}{4/3}{1/v,2/v,3/v,4/v,1/h,2/h,3/h}{}{2/1/p,3/3/p,4/2/p}[0/3,1/3,2/3,3/3,0/0,1/0,0/1,1/2,2/1,2/2,3/1,3/2,2/0,4/1,4/2,4/3][1/1/2/2/$\cD$,0/2/1/3/$\cD$,3/0/4/1/$\cD$,4/0/5/1/$\cB$]$};
\node [vertex] (factorl) at (13,.2) {$\metapatt{0.4}{3/3}{1/v,2/v,3/v,1/h,2/h,3/h}{}{2/1/p,3/3/p}[0/3,1/3,2/3,3/3,0/0,1/0,0/1,1/2,2/1,2/2,3/0,3/1,3/2,2/0][1/1/2/2/$\cD$,0/2/1/3/$\cD$]$};
\node [vertex] (factorr) at (13.5,-2.5) {$\metapatt{0.4}{1/1}{1/v,1/h}{}{1/1/p}[0/1,1/1][0/0/1/1/$\cD$,1/0/2/1/$\cB$]$};
\draw [-,semithick] (root) to (empty);
\draw [-,semithick] (root) to (nempty);
\draw [-,semithick] (nempty) to (nemptyl);
\draw [-,semithick] (nempty) to (nemptyr);
\draw [-,semithick] (nemptyr) to (nemptyrl);
\draw [-,semithick] (nemptyr) to (nemptyrr);
\draw [-,semithick] (nemptyrr) to (factorl);
\draw [-,semithick] (nemptyrr) to (factorr);
\end{tikzpicture}
\caption{The structure of $\Av(123, 1432)$ in terms of simpler substructures}%
\label{fig:123_1432}
\end{figure}
So far, the \tilealg{} algorithm has proved to be a excellent
tool, providing structural and enumerative results that together subsume over
a dozen of existing research papers.
The proposer will undertake the creation
of a generalization of the \tilealg{} algorithm that describes the structure of permutation
classes in a finer way by accounting for permutation statistics.
The study of permutation statistics is an active area of research in
combinatorics~\cite{bukata2019statisitics, dokos2012statistics,
babson2000vincular,branden2011meshpattern}.
and similar research is being done more widely on
statistics of other combinatorial objects~\cite{findstat, MR4011551, MR4109929}.
One important example of a
permutation statistic is the inversion number. An \emph{inversion} is a pair of
entries in a permutation such that the leftmost entry of the pair has value
greater than the rightmost entry on the pair. The \emph{inversion number} of a
permutation is the number of pairs of entries it has that form an inversion;
as such, the
inversion number is a function from the set of all permutations to $\mathbb{N}$.
For example, the inversions of the permutation $526413$
(see Figure~\ref{fig:perm526413}) are $52, 54,51,53,21,64,61,63,41$ and $43$.
Hence, the inversion count of $526413$ is $10$.
More generally, a \emph{permutation statistic} is any function from the set of
all permutations to $\mathbb{N}$.
\subsection*{Machine learning and heuristics}
The \motheralg{} beta is a naive breadth-first search algorithm. The
general objectives which the algorithm is trying to accomplish have clear
analogies with other tree-search paradigms. For instance
\emph{iterative deepening}~\cite{KORF198597,russell2002artificial},
i.e., pursuing a search deeper at promising nodes
before returning to a generally breadth-first approach. Since there is no
existing expert knowledge in the field to determine how to evaluate what
constitutes a ``promising'' node this approach would need to be coupled with
machine learning of such evaluation functions based on the beta's
experience over a wide variety of training examples, both successful and
unsuccessful.
\subsection*{Feasibility, originality, and impact}
Given a
specification for a mathematical object there are methods, often automatic, for
understanding the object structurally and enumeratively.
To find a
specification for an object, mathematicians have often
used ad-hoc techniques and the theory has not been developed as much.
\motheralg{} and the theory of combinatorial exploration will bridge this gap
and automate the process of going from an object to a
specification. Rather than rediscovering similar methods and repeating the same
ideas, \motheralg{} will push mathematical developments in new directions.
The beta version has been very successful and found
specifications for classes that had entire papers devoted to them, including but
not limited to~\cite{inflationsofgridclasses, smoothclasses, le-wilfclasses,
dbis, jaysgridclasses}.
Despite these successes, numerous interesting cases still remain, in particular,
$\Av(1324)$, the biggest open problem in the field, against which every new
proof method is tested.
If we are successful in finding a structure
and generating function for unenumerated classes, in particular
$\Av(1324)$, it would be considered a major breakthrough. Even tighter bounds
on the enumeration of this set are worthy of publication.
\\
\spacing{1}
\section{Present state of knowledge in the field}
\spacing{1.5}
There exist specialized algorithms for enumerating permutation classes, such as
enumeration schemes of \mbox{Zeilberger}~\cite{MR1682929} with a generalization
that accounts for statistics of \mbox{Baxter}~\cite{baxter2014scheme_statistics};
regular insertion encoding of \mbox{Albert}, \mbox{Linton}, and \mbox{Ru\v{s}kuc}~\cite{MR2176523};
substitution decomposition of classes with finitely many simple permutations
of \mbox{Bassino} et al.~\cite{subdecompimplement}; and
guessing a disjoint union of grid classes of Bean, Gudmundsson, and Ulfarsson~\cite{algstruct, structpaper}.
These algorithms are deterministic and either succeed or fail on their inputs.
The \motheralg{} framework is different in that it generates a universe of true
statements about the object under investigation, and from this universe one can
often extract enough information to reveal the structure of the object. What we
are proposing in this application is strengthening the framework in such a way that
it can reveal the structure of the object with a smaller universe, therefore making
it applicable to more problems.
\\
\spacing{1}
\section{Research plan (time and work plan, methodology, milestones, present status of project, etc.) and deliverables. Refer to more detailed description of milestones and deliverables in part 3 in the IRF electronic application system. Explain if consents and/or permits are needed}
\spacing{1.5}
\noindent \textbf{Time and work plan.}
The work packets are detailed under milestones below, as well as when they will
be attempted.
\noindent\textbf{Methodology.}
Some methods used will be traditional, coming from combinatorics, algorithms and
computer science. We will also use newer tools, such as generalized grid classes
and mesh patterns. We will also introduce new strategies, and
apply methods from computer science, such as probabilistic search and machine
learning.
\noindent \textbf{Milestones.}
\begin{packed_enum}
\item[WP1] Develop algorithm to find combinatorial systems automatically.
(Jan-Dec '\yearofappls)
\item[WP2] Implement iterative deepening search in the framework. (Jan-Jun '\yearofapplsp)
\item[WP3] Extend the \tilealg{} algorithm to find specifications and
combinatorial systems that describe permutation statistics in classes.
(Jul-Dec '\yearofapplsp)
\end{packed_enum}
\noindent\textbf{Present status of the project.} As we noted in the subsection on
feasibility, originality, and impact of our work, our beta has been successful
with several cases. As can be expected from a beta, there are many
obvious avenues for improvements, such as
searching in a more clever manner, looking for combinatorial systems and
finding specifications that describe statistics.
\noindent\textbf{Deliverables.}
Results obtained will be turned into papers and talks at conferences, as is
customary with mathematical research. We hope that these results will lead to
more work by others in the area. The developed code base will be made publicly
available for others to use and extend.
\\
\spacing{1}
\section{Management and co-operation (domestic/foreign)}
\spacing{1.5}
The proposer has a master's degree in Mathematics and Informatics from
Universit\'e du Qu\'ebec \`a Montr\'eal. He has demonstrated great research
potential by completing original research during his undergraduate and master
programs. The student already has two peer-reviewed publications and a third publication
devoted to permutation patterns under review.
He is also a
good programmer interested in algorithms
The proposer will be advised by Henning Ulfarsson, an assistant professor
at Reykjavik University. Ulfarsson leads a research group in the field and has
a large network of collaborators.
He has solved two long-standing open
problems in the field: the description of local complete intersection Schubert
varieties, with Woo~\cite{UW11}, and the description of West-$3$-stack-sortable
permutations~\cite{MR2971010}. He co-created the
\bisc{}~\cite{BiSC} and \struct{} algorithms~\cite{algstruct, structpaper}.
He has advised many students on projects in the theory of permutations,
e.g.~\cite{WilfOfShort, parti, MagnussonMSC, MurrayMSC, TomasMSc, BeanPhd}.
The PhD student will also collaborate with Jay Pantone who is an
assistant professor in the
Department of Mathematical and Statistical Sciences at Marquette University.
Pantone was involved in the developments of
the beta version of \motheralg{} and is an expert in analytic combinatorics and
permutations patterns.
\\
\spacing{1}
\section{Proposed publication of results and data storage (including adherence to open access policy)}
\spacing{1.5}
New results will be published in top-tier refereed open access
international journals in discrete mathematics and related fields, as well as
disseminated at seminars and conferences.
% \\%
\label{LastPageOfDescription}
\clearpage
\setcounter{page}{1}
\renewcommand{\lastpageref}{\pageref{LastPage}}%
\bibliographystyle{abbrv}
\bibliography{rannis2021}
\end{document}
%%% Local Variables:
%%% mode: latex
%%% TeX-master: t
%%% End: