@k.sukumar I have checked your document on my side and your code successfully remove page break from your document. Please make sure you have attached the correct file.
Hi @alexey.noskov,
with this code in my document if any other sections have existed they are also removed. but i want only the first-page section has to be removed . can you help me with this issue? the above code you send to me for above document i have attached to you.QC28738_Output_O-3.2.A.1 Facilities and Equipment-Pfizer Grange Castle.zip (43.3 KB)
@k.sukumar In the attached document there is a section break at the very beginning of the document, i.e. there is a section with empty body. However, this section still have header/footer, which, I suppose, you would like to keep. In this case you can use code like the following:
Document doc = new Document(@"C:\Temp\in.doc");
while (doc.Sections.Count > 0 && String.IsNullOrEmpty(doc.FirstSection.Body.ToString(SaveFormat.Text).Trim()))
{
doc.FirstSection.AppendContent(doc.Sections[1]);
doc.Sections[1].Remove();
}
doc.Save(@"C:\Temp\out.doc");
Hi @awais.hafeez,
Actually, I am trying to read existing citations in the document but I am unable read them can you help to read that citation and add it to a list.
I have attached a document in that when you go to the last page you will find reference page there are 20 references. when you double click numbers like 1,2,3 it will directly route to the endnotes of that number.
i want to read those particular citations can you help me to read those citations.
we have code on how to create citations .
// Open a document containing bibliographical sources that we can find in
// Microsoft Word via References -> Citations & Bibliography -> Manage Sources.
Document doc = new Document(MyDir + "Bibliography.docx");
DocumentBuilder builder = new DocumentBuilder(doc);
builder.Write("Text to be cited with one source.");
// Create a citation with just the page number and the author of the referenced book.
FieldCitation fieldCitation = (FieldCitation)builder.InsertField(FieldType.FieldCitation, true);
// We refer to sources using their tag names.
fieldCitation.SourceTag = "Book1";
fieldCitation.PageNumber = "85";
fieldCitation.SuppressAuthor = false;
fieldCitation.SuppressTitle = true;
fieldCitation.SuppressYear = true;
Assert.AreEqual(" CITATION Book1 \\p 85 \\t \\y", fieldCitation.GetFieldCode());
// Create a more detailed citation which cites two sources.
builder.InsertParagraph();
builder.Write("Text to be cited with two sources.");
fieldCitation = (FieldCitation)builder.InsertField(FieldType.FieldCitation, true);
fieldCitation.SourceTag = "Book1";
fieldCitation.AnotherSourceTag = "Book2";
fieldCitation.FormatLanguageId = "en-US";
fieldCitation.PageNumber = "19";
fieldCitation.Prefix = "Prefix ";
fieldCitation.Suffix = " Suffix";
fieldCitation.SuppressAuthor = false;
fieldCitation.SuppressTitle = false;
fieldCitation.SuppressYear = false;
fieldCitation.VolumeNumber = "VII";
Assert.AreEqual(" CITATION Book1 \\m Book2 \\l en-US \\p 19 \\f \"Prefix \" \\s \" Suffix\" \\v VII", fieldCitation.GetFieldCode());
// We can use a BIBLIOGRAPHY field to display all the sources within the document.
builder.InsertBreak(BreakType.PageBreak);
FieldBibliography fieldBibliography = (FieldBibliography)builder.InsertField(FieldType.FieldBibliography, true);
fieldBibliography.FormatLanguageId = "1124";
Assert.AreEqual(" BIBLIOGRAPHY \\l 1124", fieldBibliography.GetFieldCode());
doc.UpdateFields();
doc.Save(ArtifactsDir + "Field.CITATION.docx");
@k.sukumar It looks like you have missed to attach your document. Could you please attach it here for our reference. We will check it and provide you more information.
hi @awais.hafeez,
this is the document regarding the issue
Source_PSUR (PBRER Format) Word Template.docx (319.3 KB)
@k.sukumar What you are asking about are footnotes, you can use code like this to read them:
Document doc = new Document(@"C:\Temp\in.docx");
NodeCollection footnotes = doc.GetChildNodes(NodeType.Footnote, true);
foreach (Footnote note in footnotes)
{
if (note.FootnoteType == FootnoteType.Endnote)
Console.WriteLine(note.ToString(SaveFormat.Text));
}
Hi @alexey.noskov,
thanks for the help but unable read that footnote …
But I have tried this code to get citations with this code it worked
class Program
{
static void Main(string[] args)
{
Document doc = new Document("E:\\ABC citatinos.docx");
bool status = false;
bool bookmarkEx = false;
LayoutCollector layout = new LayoutCollector(doc);
List<int> lst = new List<int>();
List<object> objlst = new List<object>();
foreach (Section sct in doc.Sections)
{
foreach (Paragraph pr in sct.Body.GetChildNodes(NodeType.Paragraph, true))
{
foreach (Field field in pr.Range.Fields)
{
if (field.Type == FieldType.FieldCitation)
{
FieldCitation citation = (FieldCitation)field;
if (citation.Suffix != null)
{
bookmarkEx = true;
status = true;
}
if (status == true && bookmarkEx)
{
if (layout.GetStartPageIndex(field.Start) != 0)
lst.Add(layout.GetStartPageIndex(field.Start));
}
}
}
}
}
}
}
}
ABC citatinos.docx (15.5 KB)
But I have an issue reading the works site below text. I am able to read works site above text by above code .which i have pasted will you help me to read below works site citations.
ABC citatinos.docx (15.5 KB)
This is the document for my exact issue.
After work site iam unable to read below citations. will you help me read
@k.sukumar The filed below Workd Cited
is not CITATION
, but BIBLIOGRAPHY
field. Please see the screenshot:
You can press Alt+F9
in MS Word to see fields codes.
So you should add one more condition in your code:
foreach (Field field in pr.Range.Fields)
{
if (field.Type == FieldType.FieldCitation)
{
// Process CITATION field
}
if (field.Type == FieldType.FieldBibliography)
{
// Process BIBLIOGRAPHY field.
}
}
Hi awais I need help regarding space after colon in the document … I want to put only single space after a colon if have multiple space it should be removed and put only single space I have used regular expression but not getting the output
@awais.hafeez
This is code i have used
LayoutCollector layout = new LayoutCollector(doc);
foreach (Paragraph pr in doc.GetChildNodes(NodeType.Paragraph, true))
{
Regex regstart = new Regex("([a-zA-Z]+\\:(\\s{2,})?)");
//Regex regstart2 = new Regex("([a-zA-Z]+\\.)");
if (regstart.IsMatch(pr.Range.Text))
{
flag = true;
if (layout.GetStartPageIndex(pr) != 0)
lst.Add(layout.GetStartPageIndex(pr));
}
Regex regstart2 = new Regex("([a-zA-Z]+\\:\\s)");
if (regstart2.IsMatch(pr.Range.Text))
{
flag = false;
}
}
if (flag == false)
{
rObj.QC_Result = "Passed";
rObj.Comments = "Contains only single space after colon.";
}
else
{
List<int> lst2 = lst.Distinct().ToList();
if (lst2.Count > 0)
{
lst2.Sort();
Pagenumber = string.Join(", ", lst2.ToArray());
lst2.Sort();
rObj.QC_Result = "Failed";
rObj.Comments = "not Contains single space after colon in Page Numbers: " + Pagenumber;
}
}
@k.sukumar You can easily achieve this using code like the following:
Document doc = new Document(@"C:\Temp\in.docx");
FindReplaceOptions options = new FindReplaceOptions();
options.UseSubstitutions = true;
doc.Range.Replace(new Regex("([a-zA-Z]+\\:)\\s{2,}"), "$1 ", options);
doc.Save(@"C:\Temp\out.docx");